Guiding Deep Learning with Expert Knowledge for Dense Stereo Matching

Research output: Contribution to journalArticleResearchpeer review

Authors

External Research Organisations

  • Clausthal University of Technology
View graph of relations

Details

Original languageEnglish
Pages (from-to)365-380
Number of pages16
JournalPFG - Journal of Photogrammetry, Remote Sensing and Geoinformation Science
Volume91
Issue number5
Early online date28 Jul 2023
Publication statusPublished - Oct 2023

Abstract

Dense depth information can be reconstructed from stereo images using conventional hand-crafted as well as deep learning-based approaches. While deep-learning methods often show superior results compared to hand-crafted ones, they commonly learn geometric principles underlying the matching task from scratch and neglect that these principles have already been intensively studied and were considered explicitly in various models with great success in the past. In consequence, a broad range of principles and associated features need to be learned, limiting the possibility to focus on important details to also succeed in challenging image regions, such as close to depth discontinuities, thin objects and in weakly textured areas. To overcome this limitation, in this work, a hybrid technique, i.e., a combination of conventional hand-crafted and deep learning-based methods, is presented, addressing the task of dense stereo matching. More precisely, the input RGB stereo images are supplemented by a fourth image channel containing feature information obtained with a method based on expert knowledge. In addition, the assumption that edges in an image and discontinuities in the corresponding depth map coincide is modeled explicitly, allowing to predict the probability of being located next to a depth discontinuity per pixel. This information is used to guide the matching process and helps to sharpen correct depth discontinuities and to avoid the false prediction of such discontinuities, especially in weakly textured areas. The performance of the proposed method is investigated on three different data sets, including studies on the influence of the two methodological components as well as on the generalization capability. The results demonstrate that the presented hybrid approach can help to mitigate common limitations of deep learning-based methods and improves the quality of the estimated depth maps.

Keywords

    3D reconstruction, Depth estimation, Hybrid technique, Image matching

ASJC Scopus subject areas

Cite this

Guiding Deep Learning with Expert Knowledge for Dense Stereo Matching. / Iqbal, Waseem; Paffenholz, Jens André; Mehltretter, Max.
In: PFG - Journal of Photogrammetry, Remote Sensing and Geoinformation Science, Vol. 91, No. 5, 10.2023, p. 365-380.

Research output: Contribution to journalArticleResearchpeer review

Iqbal, W, Paffenholz, JA & Mehltretter, M 2023, 'Guiding Deep Learning with Expert Knowledge for Dense Stereo Matching', PFG - Journal of Photogrammetry, Remote Sensing and Geoinformation Science, vol. 91, no. 5, pp. 365-380. https://doi.org/10.1007/s41064-023-00252-0
Iqbal, W., Paffenholz, J. A., & Mehltretter, M. (2023). Guiding Deep Learning with Expert Knowledge for Dense Stereo Matching. PFG - Journal of Photogrammetry, Remote Sensing and Geoinformation Science, 91(5), 365-380. https://doi.org/10.1007/s41064-023-00252-0
Iqbal W, Paffenholz JA, Mehltretter M. Guiding Deep Learning with Expert Knowledge for Dense Stereo Matching. PFG - Journal of Photogrammetry, Remote Sensing and Geoinformation Science. 2023 Oct;91(5):365-380. Epub 2023 Jul 28. doi: 10.1007/s41064-023-00252-0
Iqbal, Waseem ; Paffenholz, Jens André ; Mehltretter, Max. / Guiding Deep Learning with Expert Knowledge for Dense Stereo Matching. In: PFG - Journal of Photogrammetry, Remote Sensing and Geoinformation Science. 2023 ; Vol. 91, No. 5. pp. 365-380.
Download
@article{6c62f433009f4586b2ba2fae3a6cbe5d,
title = "Guiding Deep Learning with Expert Knowledge for Dense Stereo Matching",
abstract = "Dense depth information can be reconstructed from stereo images using conventional hand-crafted as well as deep learning-based approaches. While deep-learning methods often show superior results compared to hand-crafted ones, they commonly learn geometric principles underlying the matching task from scratch and neglect that these principles have already been intensively studied and were considered explicitly in various models with great success in the past. In consequence, a broad range of principles and associated features need to be learned, limiting the possibility to focus on important details to also succeed in challenging image regions, such as close to depth discontinuities, thin objects and in weakly textured areas. To overcome this limitation, in this work, a hybrid technique, i.e., a combination of conventional hand-crafted and deep learning-based methods, is presented, addressing the task of dense stereo matching. More precisely, the input RGB stereo images are supplemented by a fourth image channel containing feature information obtained with a method based on expert knowledge. In addition, the assumption that edges in an image and discontinuities in the corresponding depth map coincide is modeled explicitly, allowing to predict the probability of being located next to a depth discontinuity per pixel. This information is used to guide the matching process and helps to sharpen correct depth discontinuities and to avoid the false prediction of such discontinuities, especially in weakly textured areas. The performance of the proposed method is investigated on three different data sets, including studies on the influence of the two methodological components as well as on the generalization capability. The results demonstrate that the presented hybrid approach can help to mitigate common limitations of deep learning-based methods and improves the quality of the estimated depth maps.",
keywords = "3D reconstruction, Depth estimation, Hybrid technique, Image matching",
author = "Waseem Iqbal and Paffenholz, {Jens Andr{\'e}} and Max Mehltretter",
note = "Funding Information: Open Access funding enabled and organized by Projekt DEAL",
year = "2023",
month = oct,
doi = "10.1007/s41064-023-00252-0",
language = "English",
volume = "91",
pages = "365--380",
number = "5",

}

Download

TY - JOUR

T1 - Guiding Deep Learning with Expert Knowledge for Dense Stereo Matching

AU - Iqbal, Waseem

AU - Paffenholz, Jens André

AU - Mehltretter, Max

N1 - Funding Information: Open Access funding enabled and organized by Projekt DEAL

PY - 2023/10

Y1 - 2023/10

N2 - Dense depth information can be reconstructed from stereo images using conventional hand-crafted as well as deep learning-based approaches. While deep-learning methods often show superior results compared to hand-crafted ones, they commonly learn geometric principles underlying the matching task from scratch and neglect that these principles have already been intensively studied and were considered explicitly in various models with great success in the past. In consequence, a broad range of principles and associated features need to be learned, limiting the possibility to focus on important details to also succeed in challenging image regions, such as close to depth discontinuities, thin objects and in weakly textured areas. To overcome this limitation, in this work, a hybrid technique, i.e., a combination of conventional hand-crafted and deep learning-based methods, is presented, addressing the task of dense stereo matching. More precisely, the input RGB stereo images are supplemented by a fourth image channel containing feature information obtained with a method based on expert knowledge. In addition, the assumption that edges in an image and discontinuities in the corresponding depth map coincide is modeled explicitly, allowing to predict the probability of being located next to a depth discontinuity per pixel. This information is used to guide the matching process and helps to sharpen correct depth discontinuities and to avoid the false prediction of such discontinuities, especially in weakly textured areas. The performance of the proposed method is investigated on three different data sets, including studies on the influence of the two methodological components as well as on the generalization capability. The results demonstrate that the presented hybrid approach can help to mitigate common limitations of deep learning-based methods and improves the quality of the estimated depth maps.

AB - Dense depth information can be reconstructed from stereo images using conventional hand-crafted as well as deep learning-based approaches. While deep-learning methods often show superior results compared to hand-crafted ones, they commonly learn geometric principles underlying the matching task from scratch and neglect that these principles have already been intensively studied and were considered explicitly in various models with great success in the past. In consequence, a broad range of principles and associated features need to be learned, limiting the possibility to focus on important details to also succeed in challenging image regions, such as close to depth discontinuities, thin objects and in weakly textured areas. To overcome this limitation, in this work, a hybrid technique, i.e., a combination of conventional hand-crafted and deep learning-based methods, is presented, addressing the task of dense stereo matching. More precisely, the input RGB stereo images are supplemented by a fourth image channel containing feature information obtained with a method based on expert knowledge. In addition, the assumption that edges in an image and discontinuities in the corresponding depth map coincide is modeled explicitly, allowing to predict the probability of being located next to a depth discontinuity per pixel. This information is used to guide the matching process and helps to sharpen correct depth discontinuities and to avoid the false prediction of such discontinuities, especially in weakly textured areas. The performance of the proposed method is investigated on three different data sets, including studies on the influence of the two methodological components as well as on the generalization capability. The results demonstrate that the presented hybrid approach can help to mitigate common limitations of deep learning-based methods and improves the quality of the estimated depth maps.

KW - 3D reconstruction

KW - Depth estimation

KW - Hybrid technique

KW - Image matching

UR - http://www.scopus.com/inward/record.url?scp=85165968631&partnerID=8YFLogxK

U2 - 10.1007/s41064-023-00252-0

DO - 10.1007/s41064-023-00252-0

M3 - Article

AN - SCOPUS:85165968631

VL - 91

SP - 365

EP - 380

JO - PFG - Journal of Photogrammetry, Remote Sensing and Geoinformation Science

JF - PFG - Journal of Photogrammetry, Remote Sensing and Geoinformation Science

SN - 2512-2789

IS - 5

ER -

By the same author(s)