Learning the 3D Pose of Vehicles from 2D Vehicle Patches

C. Koetsier; T. Peters; M. Sester

doi:10.5194/isprs-archives-XLIII-B2-2020-683-2020

Details

Original language	English
Pages (from-to)	683-688
Number of pages	6
Journal	International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives
Volume	43
Issue number	B2
Publication status	Published - 12 Aug 2020
Event	2020 24th ISPRS Congress - Technical Commission II - Nice, Virtual, France Duration: 31 Aug 2020 → 2 Sept 2020

Abstract

Estimating vehicle poses is crucial for generating precise movement trajectories from (surveillance) camera data. Additionally for real time applications this task has to be solved in an efficient way. In this paper we introduce a deep convolutional neural network for pose estimation of vehicles from image patches. For a given 2D image patch our approach estimates the 2D coordinates of the image representing the exact center ground point (cx, cy) and the orientation of the vehicle - represented by the elevation angle (e) of the camera with respect to the vehicle's center ground point and the azimuth rotation (a) of the vehicle with respect to the camera. To train a accurate model a large and diverse training dataset is needed. Collecting and labeling such large amount of data is very time consuming and expensive. Due to the lack of a sufficient amount of training data we show furthermore, that also rendered 3D vehicle models with artificial generated textures are nearly adequate for training.

Keywords

Deep Learning, Pose Estimation, Surveillance Video Analysis, Trajectory Analysis, Trajectory Extraction

ASJC Scopus subject areas

Computer Science(all)
Information Systems
Social Sciences(all)
Geography, Planning and Development

Cite this

Learning the 3D Pose of Vehicles from 2D Vehicle Patches. / Koetsier, C.; Peters, T.; Sester, M.
In: International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives, Vol. 43, No. B2, 12.08.2020, p. 683-688.

Research output: Contribution to journal › Conference article › Research › peer review

Koetsier, C, Peters, T & Sester, M 2020, 'Learning the 3D Pose of Vehicles from 2D Vehicle Patches', International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives, vol. 43, no. B2, pp. 683-688. https://doi.org/10.5194/isprs-archives-XLIII-B2-2020-683-2020

Koetsier, C., Peters, T., & Sester, M. (2020). Learning the 3D Pose of Vehicles from 2D Vehicle Patches. International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives, 43(B2), 683-688. https://doi.org/10.5194/isprs-archives-XLIII-B2-2020-683-2020

Koetsier C, Peters T, Sester M. Learning the 3D Pose of Vehicles from 2D Vehicle Patches. International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives. 2020 Aug 12;43(B2):683-688. doi: 10.5194/isprs-archives-XLIII-B2-2020-683-2020

Koetsier, C. ; Peters, T. ; Sester, M. / Learning the 3D Pose of Vehicles from 2D Vehicle Patches. In: International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives. 2020 ; Vol. 43, No. B2. pp. 683-688.

Download

@article{5c35092b80474923800e575360dcb918,

title = "Learning the 3D Pose of Vehicles from 2D Vehicle Patches",

abstract = "Estimating vehicle poses is crucial for generating precise movement trajectories from (surveillance) camera data. Additionally for real time applications this task has to be solved in an efficient way. In this paper we introduce a deep convolutional neural network for pose estimation of vehicles from image patches. For a given 2D image patch our approach estimates the 2D coordinates of the image representing the exact center ground point (cx, cy) and the orientation of the vehicle - represented by the elevation angle (e) of the camera with respect to the vehicle's center ground point and the azimuth rotation (a) of the vehicle with respect to the camera. To train a accurate model a large and diverse training dataset is needed. Collecting and labeling such large amount of data is very time consuming and expensive. Due to the lack of a sufficient amount of training data we show furthermore, that also rendered 3D vehicle models with artificial generated textures are nearly adequate for training.",

keywords = "Deep Learning, Pose Estimation, Surveillance Video Analysis, Trajectory Analysis, Trajectory Extraction",

author = "C. Koetsier and T. Peters and M. Sester",

year = "2020",

month = aug,

day = "12",

doi = "10.5194/isprs-archives-XLIII-B2-2020-683-2020",

language = "English",

volume = "43",

pages = "683--688",

number = "B2",

note = "2020 24th ISPRS Congress - Technical Commission II ; Conference date: 31-08-2020 Through 02-09-2020",

}

Download

TY - JOUR

T1 - Learning the 3D Pose of Vehicles from 2D Vehicle Patches

AU - Koetsier, C.

AU - Peters, T.

AU - Sester, M.

PY - 2020/8/12

Y1 - 2020/8/12

N2 - Estimating vehicle poses is crucial for generating precise movement trajectories from (surveillance) camera data. Additionally for real time applications this task has to be solved in an efficient way. In this paper we introduce a deep convolutional neural network for pose estimation of vehicles from image patches. For a given 2D image patch our approach estimates the 2D coordinates of the image representing the exact center ground point (cx, cy) and the orientation of the vehicle - represented by the elevation angle (e) of the camera with respect to the vehicle's center ground point and the azimuth rotation (a) of the vehicle with respect to the camera. To train a accurate model a large and diverse training dataset is needed. Collecting and labeling such large amount of data is very time consuming and expensive. Due to the lack of a sufficient amount of training data we show furthermore, that also rendered 3D vehicle models with artificial generated textures are nearly adequate for training.

AB - Estimating vehicle poses is crucial for generating precise movement trajectories from (surveillance) camera data. Additionally for real time applications this task has to be solved in an efficient way. In this paper we introduce a deep convolutional neural network for pose estimation of vehicles from image patches. For a given 2D image patch our approach estimates the 2D coordinates of the image representing the exact center ground point (cx, cy) and the orientation of the vehicle - represented by the elevation angle (e) of the camera with respect to the vehicle's center ground point and the azimuth rotation (a) of the vehicle with respect to the camera. To train a accurate model a large and diverse training dataset is needed. Collecting and labeling such large amount of data is very time consuming and expensive. Due to the lack of a sufficient amount of training data we show furthermore, that also rendered 3D vehicle models with artificial generated textures are nearly adequate for training.

KW - Deep Learning

KW - Pose Estimation

KW - Surveillance Video Analysis

KW - Trajectory Analysis

KW - Trajectory Extraction

UR - http://www.scopus.com/inward/record.url?scp=85091098644&partnerID=8YFLogxK

U2 - 10.5194/isprs-archives-XLIII-B2-2020-683-2020

DO - 10.5194/isprs-archives-XLIII-B2-2020-683-2020

M3 - Conference article

AN - SCOPUS:85091098644

VL - 43

SP - 683

EP - 688

JO - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives

JF - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives

SN - 1682-1750

IS - B2

T2 - 2020 24th ISPRS Congress - Technical Commission II

Y2 - 31 August 2020 through 2 September 2020

ER -

Research@Leibniz University

Learning the 3D Pose of Vehicles from 2D Vehicle Patches

Authors

Research Organisations

Details

Abstract

Keywords

ASJC Scopus subject areas

Cite this

By the same author(s)

3D Uncertain Implicit Surface Mapping Using GMM and GP

Gap completion in point cloud scene occluded by vehicles using SGC-Net

The Challenge of Data Analytics with Climate-neutral Urban Mobility: (Vision Paper)

LAformer: Trajectory Prediction for Autonomous Driving with Lane-Aware Scene Constraints

CoSense3D: an Agent-based Efficient Learning Framework for Collective Perception