Probabilistic Vehicle Reconstruction Using a Multi-Task CNN

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

View graph of relations

Details

Original languageEnglish
Title of host publication2019 International Conference on Computer Vision (ICCVW)
Subtitle of host publicationProceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages822-831
Number of pages10
ISBN (electronic)978-1-7281-5023-9
ISBN (print)978-1-7281-5024-6
Publication statusPublished - 30 Oct 2019
Event2019 IEEE/CVF 17th International Conference on Computer Vision Workshop (ICCVW) - Seoul, Korea, Republic of
Duration: 27 Oct 201928 Oct 2019

Publication series

NameInternational Conference on Computer Vision Workshops (ICCV)
Volume2019
ISSN (Print)2473-9936
ISSN (electronic)2473-9944

Abstract

The retrieval of the 3D pose and shape of objects from images is an ill-posed problem. A common way to object reconstruction is to match entities such as keypoints, edges, or contours of a deformable 3D model, used as shape prior, to their corresponding entities inferred from the image. However, such approaches are highly sensitive to model initialisation, imprecise keypoint localisations and/or illumination conditions. In this paper, we present a probabilistic approach for shape-aware 3D vehicle reconstruction from stereo images that leverages the outputs of a novel multi-task CNN. Specifically, we train a CNN that outputs probability distributions for the vehicle's orientation and for both, vehicle keypoints and wireframe edges. Together with 3D stereo information we integrate the predicted distributions into a common probabilistic framework. We believe that the CNN-based detection of wireframe edges reduces the sensitivity to illumination conditions and object contrast and that using the raw probability maps instead of inferring keypoint positions reduces the sensitivity to keypoint localisation errors. We show that our method achieves state-of-the-art results, evaluating our method on the challenging KITTI benchmark and on our own new 'Stereo-Vehicle' dataset.

Keywords

    3D scene understanding, Multi task CNN, Pose estimation, Vehicle reconstruction

ASJC Scopus subject areas

Cite this

Probabilistic Vehicle Reconstruction Using a Multi-Task CNN. / Coenen, Max; Rottensteiner, Franz.
2019 International Conference on Computer Vision (ICCVW): Proceedings. Institute of Electrical and Electronics Engineers Inc., 2019. p. 822-831 (International Conference on Computer Vision Workshops (ICCV); Vol. 2019).

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Coenen, M & Rottensteiner, F 2019, Probabilistic Vehicle Reconstruction Using a Multi-Task CNN. in 2019 International Conference on Computer Vision (ICCVW): Proceedings. International Conference on Computer Vision Workshops (ICCV), vol. 2019, Institute of Electrical and Electronics Engineers Inc., pp. 822-831, 2019 IEEE/CVF 17th International Conference on Computer Vision Workshop (ICCVW), Seoul, Korea, Republic of, 27 Oct 2019. https://doi.org/10.48550/arXiv.2102.10681, https://doi.org/10.1109/ICCVW.2019.00110
Coenen, M., & Rottensteiner, F. (2019). Probabilistic Vehicle Reconstruction Using a Multi-Task CNN. In 2019 International Conference on Computer Vision (ICCVW): Proceedings (pp. 822-831). (International Conference on Computer Vision Workshops (ICCV); Vol. 2019). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.48550/arXiv.2102.10681, https://doi.org/10.1109/ICCVW.2019.00110
Coenen M, Rottensteiner F. Probabilistic Vehicle Reconstruction Using a Multi-Task CNN. In 2019 International Conference on Computer Vision (ICCVW): Proceedings. Institute of Electrical and Electronics Engineers Inc. 2019. p. 822-831. (International Conference on Computer Vision Workshops (ICCV)). doi: 10.48550/arXiv.2102.10681, 10.1109/ICCVW.2019.00110
Coenen, Max ; Rottensteiner, Franz. / Probabilistic Vehicle Reconstruction Using a Multi-Task CNN. 2019 International Conference on Computer Vision (ICCVW): Proceedings. Institute of Electrical and Electronics Engineers Inc., 2019. pp. 822-831 (International Conference on Computer Vision Workshops (ICCV)).
Download
@inproceedings{4537ab2525d74c94bb79391728b3f36b,
title = "Probabilistic Vehicle Reconstruction Using a Multi-Task CNN",
abstract = "The retrieval of the 3D pose and shape of objects from images is an ill-posed problem. A common way to object reconstruction is to match entities such as keypoints, edges, or contours of a deformable 3D model, used as shape prior, to their corresponding entities inferred from the image. However, such approaches are highly sensitive to model initialisation, imprecise keypoint localisations and/or illumination conditions. In this paper, we present a probabilistic approach for shape-aware 3D vehicle reconstruction from stereo images that leverages the outputs of a novel multi-task CNN. Specifically, we train a CNN that outputs probability distributions for the vehicle's orientation and for both, vehicle keypoints and wireframe edges. Together with 3D stereo information we integrate the predicted distributions into a common probabilistic framework. We believe that the CNN-based detection of wireframe edges reduces the sensitivity to illumination conditions and object contrast and that using the raw probability maps instead of inferring keypoint positions reduces the sensitivity to keypoint localisation errors. We show that our method achieves state-of-the-art results, evaluating our method on the challenging KITTI benchmark and on our own new 'Stereo-Vehicle' dataset.",
keywords = "3D scene understanding, Multi task CNN, Pose estimation, Vehicle reconstruction",
author = "Max Coenen and Franz Rottensteiner",
note = "Funding information: This work was supported by the German Research Foun- dation (DFG) as a part of the Research Training Group i.c.sens [GRK2159].; 2019 IEEE/CVF 17th International Conference on Computer Vision Workshop (ICCVW), ICCVW ; Conference date: 27-10-2019 Through 28-10-2019",
year = "2019",
month = oct,
day = "30",
doi = "10.48550/arXiv.2102.10681",
language = "English",
isbn = "978-1-7281-5024-6",
series = "International Conference on Computer Vision Workshops (ICCV)",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
pages = "822--831",
booktitle = "2019 International Conference on Computer Vision (ICCVW)",
address = "United States",

}

Download

TY - GEN

T1 - Probabilistic Vehicle Reconstruction Using a Multi-Task CNN

AU - Coenen, Max

AU - Rottensteiner, Franz

N1 - Funding information: This work was supported by the German Research Foun- dation (DFG) as a part of the Research Training Group i.c.sens [GRK2159].

PY - 2019/10/30

Y1 - 2019/10/30

N2 - The retrieval of the 3D pose and shape of objects from images is an ill-posed problem. A common way to object reconstruction is to match entities such as keypoints, edges, or contours of a deformable 3D model, used as shape prior, to their corresponding entities inferred from the image. However, such approaches are highly sensitive to model initialisation, imprecise keypoint localisations and/or illumination conditions. In this paper, we present a probabilistic approach for shape-aware 3D vehicle reconstruction from stereo images that leverages the outputs of a novel multi-task CNN. Specifically, we train a CNN that outputs probability distributions for the vehicle's orientation and for both, vehicle keypoints and wireframe edges. Together with 3D stereo information we integrate the predicted distributions into a common probabilistic framework. We believe that the CNN-based detection of wireframe edges reduces the sensitivity to illumination conditions and object contrast and that using the raw probability maps instead of inferring keypoint positions reduces the sensitivity to keypoint localisation errors. We show that our method achieves state-of-the-art results, evaluating our method on the challenging KITTI benchmark and on our own new 'Stereo-Vehicle' dataset.

AB - The retrieval of the 3D pose and shape of objects from images is an ill-posed problem. A common way to object reconstruction is to match entities such as keypoints, edges, or contours of a deformable 3D model, used as shape prior, to their corresponding entities inferred from the image. However, such approaches are highly sensitive to model initialisation, imprecise keypoint localisations and/or illumination conditions. In this paper, we present a probabilistic approach for shape-aware 3D vehicle reconstruction from stereo images that leverages the outputs of a novel multi-task CNN. Specifically, we train a CNN that outputs probability distributions for the vehicle's orientation and for both, vehicle keypoints and wireframe edges. Together with 3D stereo information we integrate the predicted distributions into a common probabilistic framework. We believe that the CNN-based detection of wireframe edges reduces the sensitivity to illumination conditions and object contrast and that using the raw probability maps instead of inferring keypoint positions reduces the sensitivity to keypoint localisation errors. We show that our method achieves state-of-the-art results, evaluating our method on the challenging KITTI benchmark and on our own new 'Stereo-Vehicle' dataset.

KW - 3D scene understanding

KW - Multi task CNN

KW - Pose estimation

KW - Vehicle reconstruction

UR - http://www.scopus.com/inward/record.url?scp=85082443659&partnerID=8YFLogxK

U2 - 10.48550/arXiv.2102.10681

DO - 10.48550/arXiv.2102.10681

M3 - Conference contribution

AN - SCOPUS:85082443659

SN - 978-1-7281-5024-6

T3 - International Conference on Computer Vision Workshops (ICCV)

SP - 822

EP - 831

BT - 2019 International Conference on Computer Vision (ICCVW)

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2019 IEEE/CVF 17th International Conference on Computer Vision Workshop (ICCVW)

Y2 - 27 October 2019 through 28 October 2019

ER -

By the same author(s)