Probabilistic Vehicle Reconstruction Using a Multi-Task CNN

Max Coenen; Franz Rottensteiner

doi:10.48550/arXiv.2102.10681

Details

Original language	English
Title of host publication	2019 International Conference on Computer Vision (ICCVW)
Subtitle of host publication	Proceedings
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	822-831
Number of pages	10
ISBN (electronic)	978-1-7281-5023-9
ISBN (print)	978-1-7281-5024-6
Publication status	Published - 30 Oct 2019
Event	2019 IEEE/CVF 17th International Conference on Computer Vision Workshop (ICCVW) - Seoul, Korea, Republic of Duration: 27 Oct 2019 → 28 Oct 2019

Publication series

Name	International Conference on Computer Vision Workshops (ICCV)
Volume	2019
ISSN (Print)	2473-9936
ISSN (electronic)	2473-9944

Abstract

The retrieval of the 3D pose and shape of objects from images is an ill-posed problem. A common way to object reconstruction is to match entities such as keypoints, edges, or contours of a deformable 3D model, used as shape prior, to their corresponding entities inferred from the image. However, such approaches are highly sensitive to model initialisation, imprecise keypoint localisations and/or illumination conditions. In this paper, we present a probabilistic approach for shape-aware 3D vehicle reconstruction from stereo images that leverages the outputs of a novel multi-task CNN. Specifically, we train a CNN that outputs probability distributions for the vehicle's orientation and for both, vehicle keypoints and wireframe edges. Together with 3D stereo information we integrate the predicted distributions into a common probabilistic framework. We believe that the CNN-based detection of wireframe edges reduces the sensitivity to illumination conditions and object contrast and that using the raw probability maps instead of inferring keypoint positions reduces the sensitivity to keypoint localisation errors. We show that our method achieves state-of-the-art results, evaluating our method on the challenging KITTI benchmark and on our own new 'Stereo-Vehicle' dataset.

Keywords

3D scene understanding, Multi task CNN, Pose estimation, Vehicle reconstruction

ASJC Scopus subject areas

Computer Science(all)
Computer Science Applications
Computer Science(all)
Computer Vision and Pattern Recognition

Cite this

Probabilistic Vehicle Reconstruction Using a Multi-Task CNN. / Coenen, Max; Rottensteiner, Franz.
2019 International Conference on Computer Vision (ICCVW): Proceedings. Institute of Electrical and Electronics Engineers Inc., 2019. p. 822-831 (International Conference on Computer Vision Workshops (ICCV); Vol. 2019).

Research output: Chapter in book/report/conference proceeding › Conference contribution › Research › peer review

Coenen, M & Rottensteiner, F 2019, Probabilistic Vehicle Reconstruction Using a Multi-Task CNN. in 2019 International Conference on Computer Vision (ICCVW): Proceedings. International Conference on Computer Vision Workshops (ICCV), vol. 2019, Institute of Electrical and Electronics Engineers Inc., pp. 822-831, 2019 IEEE/CVF 17th International Conference on Computer Vision Workshop (ICCVW), Seoul, Korea, Republic of, 27 Oct 2019. https://doi.org/10.48550/arXiv.2102.10681, https://doi.org/10.1109/ICCVW.2019.00110

Coenen, M., & Rottensteiner, F. (2019). Probabilistic Vehicle Reconstruction Using a Multi-Task CNN. In 2019 International Conference on Computer Vision (ICCVW): Proceedings (pp. 822-831). (International Conference on Computer Vision Workshops (ICCV); Vol. 2019). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.48550/arXiv.2102.10681, https://doi.org/10.1109/ICCVW.2019.00110

Coenen M, Rottensteiner F. Probabilistic Vehicle Reconstruction Using a Multi-Task CNN. In 2019 International Conference on Computer Vision (ICCVW): Proceedings. Institute of Electrical and Electronics Engineers Inc. 2019. p. 822-831. (International Conference on Computer Vision Workshops (ICCV)). doi: 10.48550/arXiv.2102.10681, 10.1109/ICCVW.2019.00110

Coenen, Max ; Rottensteiner, Franz. / Probabilistic Vehicle Reconstruction Using a Multi-Task CNN. 2019 International Conference on Computer Vision (ICCVW): Proceedings. Institute of Electrical and Electronics Engineers Inc., 2019. pp. 822-831 (International Conference on Computer Vision Workshops (ICCV)).

Download

@inproceedings{4537ab2525d74c94bb79391728b3f36b,

title = "Probabilistic Vehicle Reconstruction Using a Multi-Task CNN",

abstract = "The retrieval of the 3D pose and shape of objects from images is an ill-posed problem. A common way to object reconstruction is to match entities such as keypoints, edges, or contours of a deformable 3D model, used as shape prior, to their corresponding entities inferred from the image. However, such approaches are highly sensitive to model initialisation, imprecise keypoint localisations and/or illumination conditions. In this paper, we present a probabilistic approach for shape-aware 3D vehicle reconstruction from stereo images that leverages the outputs of a novel multi-task CNN. Specifically, we train a CNN that outputs probability distributions for the vehicle's orientation and for both, vehicle keypoints and wireframe edges. Together with 3D stereo information we integrate the predicted distributions into a common probabilistic framework. We believe that the CNN-based detection of wireframe edges reduces the sensitivity to illumination conditions and object contrast and that using the raw probability maps instead of inferring keypoint positions reduces the sensitivity to keypoint localisation errors. We show that our method achieves state-of-the-art results, evaluating our method on the challenging KITTI benchmark and on our own new 'Stereo-Vehicle' dataset.",

keywords = "3D scene understanding, Multi task CNN, Pose estimation, Vehicle reconstruction",

author = "Max Coenen and Franz Rottensteiner",

note = "Funding information: This work was supported by the German Research Foun- dation (DFG) as a part of the Research Training Group i.c.sens [GRK2159].; 2019 IEEE/CVF 17th International Conference on Computer Vision Workshop (ICCVW), ICCVW ; Conference date: 27-10-2019 Through 28-10-2019",

year = "2019",

month = oct,

day = "30",

doi = "10.48550/arXiv.2102.10681",

language = "English",

isbn = "978-1-7281-5024-6",

series = "International Conference on Computer Vision Workshops (ICCV)",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "822--831",

booktitle = "2019 International Conference on Computer Vision (ICCVW)",

address = "United States",

}

Download

TY - GEN

T1 - Probabilistic Vehicle Reconstruction Using a Multi-Task CNN

AU - Coenen, Max

AU - Rottensteiner, Franz

N1 - Funding information: This work was supported by the German Research Foun- dation (DFG) as a part of the Research Training Group i.c.sens [GRK2159].

PY - 2019/10/30

Y1 - 2019/10/30

N2 - The retrieval of the 3D pose and shape of objects from images is an ill-posed problem. A common way to object reconstruction is to match entities such as keypoints, edges, or contours of a deformable 3D model, used as shape prior, to their corresponding entities inferred from the image. However, such approaches are highly sensitive to model initialisation, imprecise keypoint localisations and/or illumination conditions. In this paper, we present a probabilistic approach for shape-aware 3D vehicle reconstruction from stereo images that leverages the outputs of a novel multi-task CNN. Specifically, we train a CNN that outputs probability distributions for the vehicle's orientation and for both, vehicle keypoints and wireframe edges. Together with 3D stereo information we integrate the predicted distributions into a common probabilistic framework. We believe that the CNN-based detection of wireframe edges reduces the sensitivity to illumination conditions and object contrast and that using the raw probability maps instead of inferring keypoint positions reduces the sensitivity to keypoint localisation errors. We show that our method achieves state-of-the-art results, evaluating our method on the challenging KITTI benchmark and on our own new 'Stereo-Vehicle' dataset.

AB - The retrieval of the 3D pose and shape of objects from images is an ill-posed problem. A common way to object reconstruction is to match entities such as keypoints, edges, or contours of a deformable 3D model, used as shape prior, to their corresponding entities inferred from the image. However, such approaches are highly sensitive to model initialisation, imprecise keypoint localisations and/or illumination conditions. In this paper, we present a probabilistic approach for shape-aware 3D vehicle reconstruction from stereo images that leverages the outputs of a novel multi-task CNN. Specifically, we train a CNN that outputs probability distributions for the vehicle's orientation and for both, vehicle keypoints and wireframe edges. Together with 3D stereo information we integrate the predicted distributions into a common probabilistic framework. We believe that the CNN-based detection of wireframe edges reduces the sensitivity to illumination conditions and object contrast and that using the raw probability maps instead of inferring keypoint positions reduces the sensitivity to keypoint localisation errors. We show that our method achieves state-of-the-art results, evaluating our method on the challenging KITTI benchmark and on our own new 'Stereo-Vehicle' dataset.

KW - 3D scene understanding

KW - Multi task CNN

KW - Pose estimation

KW - Vehicle reconstruction

UR - http://www.scopus.com/inward/record.url?scp=85082443659&partnerID=8YFLogxK

U2 - 10.48550/arXiv.2102.10681

DO - 10.48550/arXiv.2102.10681

M3 - Conference contribution

AN - SCOPUS:85082443659

SN - 978-1-7281-5024-6

T3 - International Conference on Computer Vision Workshops (ICCV)

SP - 822

EP - 831

BT - 2019 International Conference on Computer Vision (ICCVW)

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2019 IEEE/CVF 17th International Conference on Computer Vision Workshop (ICCVW)

Y2 - 27 October 2019 through 28 October 2019

ER -

Research@Leibniz University

Probabilistic Vehicle Reconstruction Using a Multi-Task CNN

Authors

Research Organisations

Details

Publication series

Abstract

Keywords

ASJC Scopus subject areas

Cite this

By the same author(s)

Digital Slump Flow: Image-based assessment of fresh concrete homogeneity as part of the slump flow test

Image-based quality control of fresh concrete based on semantic segmentation algorithms

Fresh Concrete Properties from Stereoscopic Image Sequences

ReCyCONtrol project consortium – Key technologies for the digital revolution in concrete construction

Mechanisms of air bubble rise in cement suspensions studied by X-ray analysis

Digital Slump Flow: Image-based assessment of fresh concrete homogeneity as part of the slump flow test

Image-based quality control of fresh concrete based on semantic segmentation algorithms

Fresh Concrete Properties from Stereoscopic Image Sequences

ReCyCONtrol project consortium – Key technologies for the digital revolution in concrete construction

Mechanisms of air bubble rise in cement suspensions studied by X-ray analysis

Digital Slump Flow: Image-based assessment of fresh concrete homogeneity as part of the slump flow test