Loading [MathJax]/extensions/tex2jax.js

Analyzing Results of Depth Estimation Models with Monocular Criteria

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

  • Jonas Theiner
  • Nils Nommensen
  • Jim Rhotert
  • Matthias Springstein
  • Eric Muller-Budack
  • Ralph Ewerth

Research Organisations

External Research Organisations

  • German National Library of Science and Technology (TIB)

Details

Original languageEnglish
Title of host publication2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops
Subtitle of host publicationCVPRW
PublisherIEEE Computer Society
Pages3739-3743
Number of pages5
ISBN (electronic)9798350302493
ISBN (print)979-8-3503-0250-9
Publication statusPublished - 2023
Event2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPRW 2023 - Vancouver, Canada
Duration: 17 Jun 202324 Jun 2023

Publication series

NameIEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops
Volume2023-June
ISSN (Print)2160-7508
ISSN (electronic)2160-7516

Abstract

Monocular depth estimation is an essential but ill-posed (computer) vision task. While human visual perception of depth relies on several monocular depth clues, such as occlusion of objects, relative height, usual object size, linear perspective, deep learning models have to implicitly learn these cues from labeled training data to determine depth. In this paper, we investigate whether monocular depth criteria from human vision are violated for certain image instances given a model's predictions. We consider the task of depth estimation as a ranking problem, i.e., for a given pair of points, we estimate which point is nearer to the camera. In particular, we model four monocular depth criteria to automatically predict a subset of point pairs and infer their depth relation. Our experiments show that the implemented depth criteria achieve comparable performance to deep learning models. This allows the investigation of models with regard to the plausibility of predictions by finding image instances where the prediction is incorrect according to modeled human visual perception.

ASJC Scopus subject areas

Cite this

Analyzing Results of Depth Estimation Models with Monocular Criteria. / Theiner, Jonas; Nommensen, Nils; Rhotert, Jim et al.
2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops: CVPRW. IEEE Computer Society, 2023. p. 3739-3743 (IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops; Vol. 2023-June).

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Theiner, J, Nommensen, N, Rhotert, J, Springstein, M, Muller-Budack, E & Ewerth, R 2023, Analyzing Results of Depth Estimation Models with Monocular Criteria. in 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops: CVPRW. IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, vol. 2023-June, IEEE Computer Society, pp. 3739-3743, 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPRW 2023, Vancouver, British Columbia, Canada, 17 Jun 2023. https://doi.org/10.1109/CVPRW59228.2023.00385
Theiner, J., Nommensen, N., Rhotert, J., Springstein, M., Muller-Budack, E., & Ewerth, R. (2023). Analyzing Results of Depth Estimation Models with Monocular Criteria. In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops: CVPRW (pp. 3739-3743). (IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops; Vol. 2023-June). IEEE Computer Society. https://doi.org/10.1109/CVPRW59228.2023.00385
Theiner J, Nommensen N, Rhotert J, Springstein M, Muller-Budack E, Ewerth R. Analyzing Results of Depth Estimation Models with Monocular Criteria. In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops: CVPRW. IEEE Computer Society. 2023. p. 3739-3743. (IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops). doi: 10.1109/CVPRW59228.2023.00385
Theiner, Jonas ; Nommensen, Nils ; Rhotert, Jim et al. / Analyzing Results of Depth Estimation Models with Monocular Criteria. 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops: CVPRW. IEEE Computer Society, 2023. pp. 3739-3743 (IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops).
Download
@inproceedings{68d9e08448dc4bcd943f05cefb82b8ce,
title = "Analyzing Results of Depth Estimation Models with Monocular Criteria",
abstract = "Monocular depth estimation is an essential but ill-posed (computer) vision task. While human visual perception of depth relies on several monocular depth clues, such as occlusion of objects, relative height, usual object size, linear perspective, deep learning models have to implicitly learn these cues from labeled training data to determine depth. In this paper, we investigate whether monocular depth criteria from human vision are violated for certain image instances given a model's predictions. We consider the task of depth estimation as a ranking problem, i.e., for a given pair of points, we estimate which point is nearer to the camera. In particular, we model four monocular depth criteria to automatically predict a subset of point pairs and infer their depth relation. Our experiments show that the implemented depth criteria achieve comparable performance to deep learning models. This allows the investigation of models with regard to the plausibility of predictions by finding image instances where the prediction is incorrect according to modeled human visual perception.",
author = "Jonas Theiner and Nils Nommensen and Jim Rhotert and Matthias Springstein and Eric Muller-Budack and Ralph Ewerth",
note = "Funding Information: Acknowledgment This work was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) – project number 420493178. ; 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPRW 2023, CVPR 2023 ; Conference date: 17-06-2023 Through 24-06-2023",
year = "2023",
doi = "10.1109/CVPRW59228.2023.00385",
language = "English",
isbn = "979-8-3503-0250-9",
series = "IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops",
publisher = "IEEE Computer Society",
pages = "3739--3743",
booktitle = "2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops",
address = "United States",

}

Download

TY - GEN

T1 - Analyzing Results of Depth Estimation Models with Monocular Criteria

AU - Theiner, Jonas

AU - Nommensen, Nils

AU - Rhotert, Jim

AU - Springstein, Matthias

AU - Muller-Budack, Eric

AU - Ewerth, Ralph

N1 - Funding Information: Acknowledgment This work was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) – project number 420493178.

PY - 2023

Y1 - 2023

N2 - Monocular depth estimation is an essential but ill-posed (computer) vision task. While human visual perception of depth relies on several monocular depth clues, such as occlusion of objects, relative height, usual object size, linear perspective, deep learning models have to implicitly learn these cues from labeled training data to determine depth. In this paper, we investigate whether monocular depth criteria from human vision are violated for certain image instances given a model's predictions. We consider the task of depth estimation as a ranking problem, i.e., for a given pair of points, we estimate which point is nearer to the camera. In particular, we model four monocular depth criteria to automatically predict a subset of point pairs and infer their depth relation. Our experiments show that the implemented depth criteria achieve comparable performance to deep learning models. This allows the investigation of models with regard to the plausibility of predictions by finding image instances where the prediction is incorrect according to modeled human visual perception.

AB - Monocular depth estimation is an essential but ill-posed (computer) vision task. While human visual perception of depth relies on several monocular depth clues, such as occlusion of objects, relative height, usual object size, linear perspective, deep learning models have to implicitly learn these cues from labeled training data to determine depth. In this paper, we investigate whether monocular depth criteria from human vision are violated for certain image instances given a model's predictions. We consider the task of depth estimation as a ranking problem, i.e., for a given pair of points, we estimate which point is nearer to the camera. In particular, we model four monocular depth criteria to automatically predict a subset of point pairs and infer their depth relation. Our experiments show that the implemented depth criteria achieve comparable performance to deep learning models. This allows the investigation of models with regard to the plausibility of predictions by finding image instances where the prediction is incorrect according to modeled human visual perception.

UR - http://www.scopus.com/inward/record.url?scp=85170820212&partnerID=8YFLogxK

U2 - 10.1109/CVPRW59228.2023.00385

DO - 10.1109/CVPRW59228.2023.00385

M3 - Conference contribution

AN - SCOPUS:85170820212

SN - 979-8-3503-0250-9

T3 - IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops

SP - 3739

EP - 3743

BT - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops

PB - IEEE Computer Society

T2 - 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPRW 2023

Y2 - 17 June 2023 through 24 June 2023

ER -