Pose estimation and 3D reconstruction of vehicles from stereo-images using a subcategory-aware shape prior

Max Coenen; Franz Rottensteiner

doi:10.48550/arXiv.2107.10898

Details

Originalsprache	Englisch
Seiten (von - bis)	27-47
Seitenumfang	21
Fachzeitschrift	ISPRS Journal of Photogrammetry and Remote Sensing
Jahrgang	181
Frühes Online-Datum	14 Sept. 2021
Publikationsstatus	Veröffentlicht - Nov. 2021

Abstract

The 3D reconstruction of objects is a prerequisite for many highly relevant applications of computer vision such as mobile robotics or autonomous driving. To deal with the inverse problem of reconstructing 3D objects from their 2D projections, a common strategy is to incorporate prior object knowledge into the reconstruction approach by establishing a 3D model and aligning it to the 2D image plane. However, current approaches are limited due to inadequate shape priors and the insufficiency of the derived image observations for a reliable alignment with the 3D model. The goal of this paper is to show how 3D object reconstruction can profit from a more sophisticated shape prior and from a combined incorporation of different observation types inferred from the images. We introduce a subcategory-aware deformable vehicle model that makes use of a prediction of the vehicle type for a more appropriate regularisation of the vehicle shape. A multi-branch CNN is presented to derive predictions of the vehicle type and orientation. This information is also introduced as prior information for model fitting. Furthermore, the CNN extracts vehicle keypoints and wireframes, which are well-suited for model-to-image association and model fitting. The task of pose estimation and reconstruction is addressed by a versatile probabilistic model. Extensive experiments are conducted using two challenging real-world data sets on both of which the benefit of the developed shape prior can be shown. A comparison to state-of-the-art methods for vehicle pose estimation shows that the proposed approach performs on par or better, confirming the suitability of the developed shape prior and probabilistic model for vehicle reconstruction.

ASJC Scopus Sachgebiete

Physik und Astronomie (insg.)
Atom- und Molekularphysik sowie Optik
Ingenieurwesen (insg.)
Ingenieurwesen (sonstige)
Informatik (insg.)
Angewandte Informatik
Erdkunde und Planetologie (insg.)
Computer in den Geowissenschaften

Zitieren

Pose estimation and 3D reconstruction of vehicles from stereo-images using a subcategory-aware shape prior. / Coenen, Max; Rottensteiner, Franz.
in: ISPRS Journal of Photogrammetry and Remote Sensing, Jahrgang 181, 11.2021, S. 27-47.

Publikation: Beitrag in Fachzeitschrift › Artikel › Forschung › Peer-Review

Coenen, M & Rottensteiner, F 2021, 'Pose estimation and 3D reconstruction of vehicles from stereo-images using a subcategory-aware shape prior', ISPRS Journal of Photogrammetry and Remote Sensing, Jg. 181, S. 27-47. https://doi.org/10.48550/arXiv.2107.10898, https://doi.org/10.1016/j.isprsjprs.2021.07.006

Coenen, M., & Rottensteiner, F. (2021). Pose estimation and 3D reconstruction of vehicles from stereo-images using a subcategory-aware shape prior. ISPRS Journal of Photogrammetry and Remote Sensing, 181, 27-47. https://doi.org/10.48550/arXiv.2107.10898, https://doi.org/10.1016/j.isprsjprs.2021.07.006

Coenen M, Rottensteiner F. Pose estimation and 3D reconstruction of vehicles from stereo-images using a subcategory-aware shape prior. ISPRS Journal of Photogrammetry and Remote Sensing. 2021 Nov;181:27-47. Epub 2021 Sep 14. doi: 10.48550/arXiv.2107.10898, 10.1016/j.isprsjprs.2021.07.006

Coenen, Max ; Rottensteiner, Franz. / Pose estimation and 3D reconstruction of vehicles from stereo-images using a subcategory-aware shape prior. in: ISPRS Journal of Photogrammetry and Remote Sensing. 2021 ; Jahrgang 181. S. 27-47.

Download

@article{ce0d30f62c6b43b28192f19f3e306edf,

title = "Pose estimation and 3D reconstruction of vehicles from stereo-images using a subcategory-aware shape prior",

abstract = "The 3D reconstruction of objects is a prerequisite for many highly relevant applications of computer vision such as mobile robotics or autonomous driving. To deal with the inverse problem of reconstructing 3D objects from their 2D projections, a common strategy is to incorporate prior object knowledge into the reconstruction approach by establishing a 3D model and aligning it to the 2D image plane. However, current approaches are limited due to inadequate shape priors and the insufficiency of the derived image observations for a reliable alignment with the 3D model. The goal of this paper is to show how 3D object reconstruction can profit from a more sophisticated shape prior and from a combined incorporation of different observation types inferred from the images. We introduce a subcategory-aware deformable vehicle model that makes use of a prediction of the vehicle type for a more appropriate regularisation of the vehicle shape. A multi-branch CNN is presented to derive predictions of the vehicle type and orientation. This information is also introduced as prior information for model fitting. Furthermore, the CNN extracts vehicle keypoints and wireframes, which are well-suited for model-to-image association and model fitting. The task of pose estimation and reconstruction is addressed by a versatile probabilistic model. Extensive experiments are conducted using two challenging real-world data sets on both of which the benefit of the developed shape prior can be shown. A comparison to state-of-the-art methods for vehicle pose estimation shows that the proposed approach performs on par or better, confirming the suitability of the developed shape prior and probabilistic model for vehicle reconstruction.",

keywords = "3D vehicle reconstruction, Active shape model, Multi-branch CNN, Pose estimation, Vehicle detection",

author = "Max Coenen and Franz Rottensteiner",

note = "Funding Information: This work was supported by the German Research Foundation (DFG) as part of the Research Training Group i.c.sens [ GRK2159 ].",

year = "2021",

month = nov,

doi = "10.48550/arXiv.2107.10898",

language = "English",

volume = "181",

pages = "27--47",

journal = "ISPRS Journal of Photogrammetry and Remote Sensing",

issn = "0924-2716",

publisher = "Elsevier",

}

Download

TY - JOUR

T1 - Pose estimation and 3D reconstruction of vehicles from stereo-images using a subcategory-aware shape prior

AU - Coenen, Max

AU - Rottensteiner, Franz

N1 - Funding Information: This work was supported by the German Research Foundation (DFG) as part of the Research Training Group i.c.sens [ GRK2159 ].

PY - 2021/11

Y1 - 2021/11

N2 - The 3D reconstruction of objects is a prerequisite for many highly relevant applications of computer vision such as mobile robotics or autonomous driving. To deal with the inverse problem of reconstructing 3D objects from their 2D projections, a common strategy is to incorporate prior object knowledge into the reconstruction approach by establishing a 3D model and aligning it to the 2D image plane. However, current approaches are limited due to inadequate shape priors and the insufficiency of the derived image observations for a reliable alignment with the 3D model. The goal of this paper is to show how 3D object reconstruction can profit from a more sophisticated shape prior and from a combined incorporation of different observation types inferred from the images. We introduce a subcategory-aware deformable vehicle model that makes use of a prediction of the vehicle type for a more appropriate regularisation of the vehicle shape. A multi-branch CNN is presented to derive predictions of the vehicle type and orientation. This information is also introduced as prior information for model fitting. Furthermore, the CNN extracts vehicle keypoints and wireframes, which are well-suited for model-to-image association and model fitting. The task of pose estimation and reconstruction is addressed by a versatile probabilistic model. Extensive experiments are conducted using two challenging real-world data sets on both of which the benefit of the developed shape prior can be shown. A comparison to state-of-the-art methods for vehicle pose estimation shows that the proposed approach performs on par or better, confirming the suitability of the developed shape prior and probabilistic model for vehicle reconstruction.

AB - The 3D reconstruction of objects is a prerequisite for many highly relevant applications of computer vision such as mobile robotics or autonomous driving. To deal with the inverse problem of reconstructing 3D objects from their 2D projections, a common strategy is to incorporate prior object knowledge into the reconstruction approach by establishing a 3D model and aligning it to the 2D image plane. However, current approaches are limited due to inadequate shape priors and the insufficiency of the derived image observations for a reliable alignment with the 3D model. The goal of this paper is to show how 3D object reconstruction can profit from a more sophisticated shape prior and from a combined incorporation of different observation types inferred from the images. We introduce a subcategory-aware deformable vehicle model that makes use of a prediction of the vehicle type for a more appropriate regularisation of the vehicle shape. A multi-branch CNN is presented to derive predictions of the vehicle type and orientation. This information is also introduced as prior information for model fitting. Furthermore, the CNN extracts vehicle keypoints and wireframes, which are well-suited for model-to-image association and model fitting. The task of pose estimation and reconstruction is addressed by a versatile probabilistic model. Extensive experiments are conducted using two challenging real-world data sets on both of which the benefit of the developed shape prior can be shown. A comparison to state-of-the-art methods for vehicle pose estimation shows that the proposed approach performs on par or better, confirming the suitability of the developed shape prior and probabilistic model for vehicle reconstruction.

KW - 3D vehicle reconstruction

KW - Active shape model

KW - Multi-branch CNN

KW - Pose estimation

KW - Vehicle detection

UR - http://www.scopus.com/inward/record.url?scp=85114784807&partnerID=8YFLogxK

U2 - 10.48550/arXiv.2107.10898

DO - 10.48550/arXiv.2107.10898

M3 - Article

AN - SCOPUS:85114784807

VL - 181

SP - 27

EP - 47

JO - ISPRS Journal of Photogrammetry and Remote Sensing

JF - ISPRS Journal of Photogrammetry and Remote Sensing

SN - 0924-2716

ER -

Research@Leibniz University

Pose estimation and 3D reconstruction of vehicles from stereo-images using a subcategory-aware shape prior

Autorschaft

Organisationseinheiten

Details

Abstract

ASJC Scopus Sachgebiete

Zitieren

Von denselben Autoren

Digital Slump Flow: Image-based assessment of fresh concrete homogeneity as part of the slump flow test

Image-based quality control of fresh concrete based on semantic segmentation algorithms

Fresh Concrete Properties from Stereoscopic Image Sequences

ReCyCONtrol project consortium – Key technologies for the digital revolution in concrete construction

Mechanisms of air bubble rise in cement suspensions studied by X-ray analysis

Digital Slump Flow: Image-based assessment of fresh concrete homogeneity as part of the slump flow test

Image-based quality control of fresh concrete based on semantic segmentation algorithms

Fresh Concrete Properties from Stereoscopic Image Sequences

ReCyCONtrol project consortium – Key technologies for the digital revolution in concrete construction

Mechanisms of air bubble rise in cement suspensions studied by X-ray analysis

Digital Slump Flow: Image-based assessment of fresh concrete homogeneity as part of the slump flow test