Pose estimation and 3D reconstruction of vehicles from stereo-images using a subcategory-aware shape prior

Research output: Contribution to journalArticleResearchpeer review

Authors

View graph of relations

Details

Original languageEnglish
Pages (from-to)27-47
Number of pages21
JournalISPRS Journal of Photogrammetry and Remote Sensing
Volume181
Early online date14 Sept 2021
Publication statusPublished - Nov 2021

Abstract

The 3D reconstruction of objects is a prerequisite for many highly relevant applications of computer vision such as mobile robotics or autonomous driving. To deal with the inverse problem of reconstructing 3D objects from their 2D projections, a common strategy is to incorporate prior object knowledge into the reconstruction approach by establishing a 3D model and aligning it to the 2D image plane. However, current approaches are limited due to inadequate shape priors and the insufficiency of the derived image observations for a reliable alignment with the 3D model. The goal of this paper is to show how 3D object reconstruction can profit from a more sophisticated shape prior and from a combined incorporation of different observation types inferred from the images. We introduce a subcategory-aware deformable vehicle model that makes use of a prediction of the vehicle type for a more appropriate regularisation of the vehicle shape. A multi-branch CNN is presented to derive predictions of the vehicle type and orientation. This information is also introduced as prior information for model fitting. Furthermore, the CNN extracts vehicle keypoints and wireframes, which are well-suited for model-to-image association and model fitting. The task of pose estimation and reconstruction is addressed by a versatile probabilistic model. Extensive experiments are conducted using two challenging real-world data sets on both of which the benefit of the developed shape prior can be shown. A comparison to state-of-the-art methods for vehicle pose estimation shows that the proposed approach performs on par or better, confirming the suitability of the developed shape prior and probabilistic model for vehicle reconstruction.

Keywords

    3D vehicle reconstruction, Active shape model, Multi-branch CNN, Pose estimation, Vehicle detection

ASJC Scopus subject areas

Cite this

Pose estimation and 3D reconstruction of vehicles from stereo-images using a subcategory-aware shape prior. / Coenen, Max; Rottensteiner, Franz.
In: ISPRS Journal of Photogrammetry and Remote Sensing, Vol. 181, 11.2021, p. 27-47.

Research output: Contribution to journalArticleResearchpeer review

Coenen M, Rottensteiner F. Pose estimation and 3D reconstruction of vehicles from stereo-images using a subcategory-aware shape prior. ISPRS Journal of Photogrammetry and Remote Sensing. 2021 Nov;181:27-47. Epub 2021 Sept 14. doi: 10.48550/arXiv.2107.10898, 10.1016/j.isprsjprs.2021.07.006
Download
@article{ce0d30f62c6b43b28192f19f3e306edf,
title = "Pose estimation and 3D reconstruction of vehicles from stereo-images using a subcategory-aware shape prior",
abstract = "The 3D reconstruction of objects is a prerequisite for many highly relevant applications of computer vision such as mobile robotics or autonomous driving. To deal with the inverse problem of reconstructing 3D objects from their 2D projections, a common strategy is to incorporate prior object knowledge into the reconstruction approach by establishing a 3D model and aligning it to the 2D image plane. However, current approaches are limited due to inadequate shape priors and the insufficiency of the derived image observations for a reliable alignment with the 3D model. The goal of this paper is to show how 3D object reconstruction can profit from a more sophisticated shape prior and from a combined incorporation of different observation types inferred from the images. We introduce a subcategory-aware deformable vehicle model that makes use of a prediction of the vehicle type for a more appropriate regularisation of the vehicle shape. A multi-branch CNN is presented to derive predictions of the vehicle type and orientation. This information is also introduced as prior information for model fitting. Furthermore, the CNN extracts vehicle keypoints and wireframes, which are well-suited for model-to-image association and model fitting. The task of pose estimation and reconstruction is addressed by a versatile probabilistic model. Extensive experiments are conducted using two challenging real-world data sets on both of which the benefit of the developed shape prior can be shown. A comparison to state-of-the-art methods for vehicle pose estimation shows that the proposed approach performs on par or better, confirming the suitability of the developed shape prior and probabilistic model for vehicle reconstruction.",
keywords = "3D vehicle reconstruction, Active shape model, Multi-branch CNN, Pose estimation, Vehicle detection",
author = "Max Coenen and Franz Rottensteiner",
note = "Funding Information: This work was supported by the German Research Foundation (DFG) as part of the Research Training Group i.c.sens [ GRK2159 ].",
year = "2021",
month = nov,
doi = "10.48550/arXiv.2107.10898",
language = "English",
volume = "181",
pages = "27--47",
journal = "ISPRS Journal of Photogrammetry and Remote Sensing",
issn = "0924-2716",
publisher = "Elsevier",

}

Download

TY - JOUR

T1 - Pose estimation and 3D reconstruction of vehicles from stereo-images using a subcategory-aware shape prior

AU - Coenen, Max

AU - Rottensteiner, Franz

N1 - Funding Information: This work was supported by the German Research Foundation (DFG) as part of the Research Training Group i.c.sens [ GRK2159 ].

PY - 2021/11

Y1 - 2021/11

N2 - The 3D reconstruction of objects is a prerequisite for many highly relevant applications of computer vision such as mobile robotics or autonomous driving. To deal with the inverse problem of reconstructing 3D objects from their 2D projections, a common strategy is to incorporate prior object knowledge into the reconstruction approach by establishing a 3D model and aligning it to the 2D image plane. However, current approaches are limited due to inadequate shape priors and the insufficiency of the derived image observations for a reliable alignment with the 3D model. The goal of this paper is to show how 3D object reconstruction can profit from a more sophisticated shape prior and from a combined incorporation of different observation types inferred from the images. We introduce a subcategory-aware deformable vehicle model that makes use of a prediction of the vehicle type for a more appropriate regularisation of the vehicle shape. A multi-branch CNN is presented to derive predictions of the vehicle type and orientation. This information is also introduced as prior information for model fitting. Furthermore, the CNN extracts vehicle keypoints and wireframes, which are well-suited for model-to-image association and model fitting. The task of pose estimation and reconstruction is addressed by a versatile probabilistic model. Extensive experiments are conducted using two challenging real-world data sets on both of which the benefit of the developed shape prior can be shown. A comparison to state-of-the-art methods for vehicle pose estimation shows that the proposed approach performs on par or better, confirming the suitability of the developed shape prior and probabilistic model for vehicle reconstruction.

AB - The 3D reconstruction of objects is a prerequisite for many highly relevant applications of computer vision such as mobile robotics or autonomous driving. To deal with the inverse problem of reconstructing 3D objects from their 2D projections, a common strategy is to incorporate prior object knowledge into the reconstruction approach by establishing a 3D model and aligning it to the 2D image plane. However, current approaches are limited due to inadequate shape priors and the insufficiency of the derived image observations for a reliable alignment with the 3D model. The goal of this paper is to show how 3D object reconstruction can profit from a more sophisticated shape prior and from a combined incorporation of different observation types inferred from the images. We introduce a subcategory-aware deformable vehicle model that makes use of a prediction of the vehicle type for a more appropriate regularisation of the vehicle shape. A multi-branch CNN is presented to derive predictions of the vehicle type and orientation. This information is also introduced as prior information for model fitting. Furthermore, the CNN extracts vehicle keypoints and wireframes, which are well-suited for model-to-image association and model fitting. The task of pose estimation and reconstruction is addressed by a versatile probabilistic model. Extensive experiments are conducted using two challenging real-world data sets on both of which the benefit of the developed shape prior can be shown. A comparison to state-of-the-art methods for vehicle pose estimation shows that the proposed approach performs on par or better, confirming the suitability of the developed shape prior and probabilistic model for vehicle reconstruction.

KW - 3D vehicle reconstruction

KW - Active shape model

KW - Multi-branch CNN

KW - Pose estimation

KW - Vehicle detection

UR - http://www.scopus.com/inward/record.url?scp=85114784807&partnerID=8YFLogxK

U2 - 10.48550/arXiv.2107.10898

DO - 10.48550/arXiv.2107.10898

M3 - Article

AN - SCOPUS:85114784807

VL - 181

SP - 27

EP - 47

JO - ISPRS Journal of Photogrammetry and Remote Sensing

JF - ISPRS Journal of Photogrammetry and Remote Sensing

SN - 0924-2716

ER -

By the same author(s)