Guiding Deep Learning with Expert Knowledge for Dense Stereo Matching

Publikation: Beitrag in FachzeitschriftArtikelForschungPeer-Review

Autoren

Externe Organisationen

  • Technische Universität Clausthal
Forschungs-netzwerk anzeigen

Details

OriginalspracheEnglisch
Seiten (von - bis)365-380
Seitenumfang16
FachzeitschriftPFG - Journal of Photogrammetry, Remote Sensing and Geoinformation Science
Jahrgang91
Ausgabenummer5
Frühes Online-Datum28 Juli 2023
PublikationsstatusVeröffentlicht - Okt. 2023

Abstract

Dense depth information can be reconstructed from stereo images using conventional hand-crafted as well as deep learning-based approaches. While deep-learning methods often show superior results compared to hand-crafted ones, they commonly learn geometric principles underlying the matching task from scratch and neglect that these principles have already been intensively studied and were considered explicitly in various models with great success in the past. In consequence, a broad range of principles and associated features need to be learned, limiting the possibility to focus on important details to also succeed in challenging image regions, such as close to depth discontinuities, thin objects and in weakly textured areas. To overcome this limitation, in this work, a hybrid technique, i.e., a combination of conventional hand-crafted and deep learning-based methods, is presented, addressing the task of dense stereo matching. More precisely, the input RGB stereo images are supplemented by a fourth image channel containing feature information obtained with a method based on expert knowledge. In addition, the assumption that edges in an image and discontinuities in the corresponding depth map coincide is modeled explicitly, allowing to predict the probability of being located next to a depth discontinuity per pixel. This information is used to guide the matching process and helps to sharpen correct depth discontinuities and to avoid the false prediction of such discontinuities, especially in weakly textured areas. The performance of the proposed method is investigated on three different data sets, including studies on the influence of the two methodological components as well as on the generalization capability. The results demonstrate that the presented hybrid approach can help to mitigate common limitations of deep learning-based methods and improves the quality of the estimated depth maps.

ASJC Scopus Sachgebiete

Zitieren

Guiding Deep Learning with Expert Knowledge for Dense Stereo Matching. / Iqbal, Waseem; Paffenholz, Jens André; Mehltretter, Max.
in: PFG - Journal of Photogrammetry, Remote Sensing and Geoinformation Science, Jahrgang 91, Nr. 5, 10.2023, S. 365-380.

Publikation: Beitrag in FachzeitschriftArtikelForschungPeer-Review

Iqbal, W, Paffenholz, JA & Mehltretter, M 2023, 'Guiding Deep Learning with Expert Knowledge for Dense Stereo Matching', PFG - Journal of Photogrammetry, Remote Sensing and Geoinformation Science, Jg. 91, Nr. 5, S. 365-380. https://doi.org/10.1007/s41064-023-00252-0
Iqbal, W., Paffenholz, J. A., & Mehltretter, M. (2023). Guiding Deep Learning with Expert Knowledge for Dense Stereo Matching. PFG - Journal of Photogrammetry, Remote Sensing and Geoinformation Science, 91(5), 365-380. https://doi.org/10.1007/s41064-023-00252-0
Iqbal W, Paffenholz JA, Mehltretter M. Guiding Deep Learning with Expert Knowledge for Dense Stereo Matching. PFG - Journal of Photogrammetry, Remote Sensing and Geoinformation Science. 2023 Okt;91(5):365-380. Epub 2023 Jul 28. doi: 10.1007/s41064-023-00252-0
Iqbal, Waseem ; Paffenholz, Jens André ; Mehltretter, Max. / Guiding Deep Learning with Expert Knowledge for Dense Stereo Matching. in: PFG - Journal of Photogrammetry, Remote Sensing and Geoinformation Science. 2023 ; Jahrgang 91, Nr. 5. S. 365-380.
Download
@article{6c62f433009f4586b2ba2fae3a6cbe5d,
title = "Guiding Deep Learning with Expert Knowledge for Dense Stereo Matching",
abstract = "Dense depth information can be reconstructed from stereo images using conventional hand-crafted as well as deep learning-based approaches. While deep-learning methods often show superior results compared to hand-crafted ones, they commonly learn geometric principles underlying the matching task from scratch and neglect that these principles have already been intensively studied and were considered explicitly in various models with great success in the past. In consequence, a broad range of principles and associated features need to be learned, limiting the possibility to focus on important details to also succeed in challenging image regions, such as close to depth discontinuities, thin objects and in weakly textured areas. To overcome this limitation, in this work, a hybrid technique, i.e., a combination of conventional hand-crafted and deep learning-based methods, is presented, addressing the task of dense stereo matching. More precisely, the input RGB stereo images are supplemented by a fourth image channel containing feature information obtained with a method based on expert knowledge. In addition, the assumption that edges in an image and discontinuities in the corresponding depth map coincide is modeled explicitly, allowing to predict the probability of being located next to a depth discontinuity per pixel. This information is used to guide the matching process and helps to sharpen correct depth discontinuities and to avoid the false prediction of such discontinuities, especially in weakly textured areas. The performance of the proposed method is investigated on three different data sets, including studies on the influence of the two methodological components as well as on the generalization capability. The results demonstrate that the presented hybrid approach can help to mitigate common limitations of deep learning-based methods and improves the quality of the estimated depth maps.",
keywords = "3D reconstruction, Depth estimation, Hybrid technique, Image matching",
author = "Waseem Iqbal and Paffenholz, {Jens Andr{\'e}} and Max Mehltretter",
note = "Funding Information: Open Access funding enabled and organized by Projekt DEAL",
year = "2023",
month = oct,
doi = "10.1007/s41064-023-00252-0",
language = "English",
volume = "91",
pages = "365--380",
number = "5",

}

Download

TY - JOUR

T1 - Guiding Deep Learning with Expert Knowledge for Dense Stereo Matching

AU - Iqbal, Waseem

AU - Paffenholz, Jens André

AU - Mehltretter, Max

N1 - Funding Information: Open Access funding enabled and organized by Projekt DEAL

PY - 2023/10

Y1 - 2023/10

N2 - Dense depth information can be reconstructed from stereo images using conventional hand-crafted as well as deep learning-based approaches. While deep-learning methods often show superior results compared to hand-crafted ones, they commonly learn geometric principles underlying the matching task from scratch and neglect that these principles have already been intensively studied and were considered explicitly in various models with great success in the past. In consequence, a broad range of principles and associated features need to be learned, limiting the possibility to focus on important details to also succeed in challenging image regions, such as close to depth discontinuities, thin objects and in weakly textured areas. To overcome this limitation, in this work, a hybrid technique, i.e., a combination of conventional hand-crafted and deep learning-based methods, is presented, addressing the task of dense stereo matching. More precisely, the input RGB stereo images are supplemented by a fourth image channel containing feature information obtained with a method based on expert knowledge. In addition, the assumption that edges in an image and discontinuities in the corresponding depth map coincide is modeled explicitly, allowing to predict the probability of being located next to a depth discontinuity per pixel. This information is used to guide the matching process and helps to sharpen correct depth discontinuities and to avoid the false prediction of such discontinuities, especially in weakly textured areas. The performance of the proposed method is investigated on three different data sets, including studies on the influence of the two methodological components as well as on the generalization capability. The results demonstrate that the presented hybrid approach can help to mitigate common limitations of deep learning-based methods and improves the quality of the estimated depth maps.

AB - Dense depth information can be reconstructed from stereo images using conventional hand-crafted as well as deep learning-based approaches. While deep-learning methods often show superior results compared to hand-crafted ones, they commonly learn geometric principles underlying the matching task from scratch and neglect that these principles have already been intensively studied and were considered explicitly in various models with great success in the past. In consequence, a broad range of principles and associated features need to be learned, limiting the possibility to focus on important details to also succeed in challenging image regions, such as close to depth discontinuities, thin objects and in weakly textured areas. To overcome this limitation, in this work, a hybrid technique, i.e., a combination of conventional hand-crafted and deep learning-based methods, is presented, addressing the task of dense stereo matching. More precisely, the input RGB stereo images are supplemented by a fourth image channel containing feature information obtained with a method based on expert knowledge. In addition, the assumption that edges in an image and discontinuities in the corresponding depth map coincide is modeled explicitly, allowing to predict the probability of being located next to a depth discontinuity per pixel. This information is used to guide the matching process and helps to sharpen correct depth discontinuities and to avoid the false prediction of such discontinuities, especially in weakly textured areas. The performance of the proposed method is investigated on three different data sets, including studies on the influence of the two methodological components as well as on the generalization capability. The results demonstrate that the presented hybrid approach can help to mitigate common limitations of deep learning-based methods and improves the quality of the estimated depth maps.

KW - 3D reconstruction

KW - Depth estimation

KW - Hybrid technique

KW - Image matching

UR - http://www.scopus.com/inward/record.url?scp=85165968631&partnerID=8YFLogxK

U2 - 10.1007/s41064-023-00252-0

DO - 10.1007/s41064-023-00252-0

M3 - Article

AN - SCOPUS:85165968631

VL - 91

SP - 365

EP - 380

JO - PFG - Journal of Photogrammetry, Remote Sensing and Geoinformation Science

JF - PFG - Journal of Photogrammetry, Remote Sensing and Geoinformation Science

SN - 2512-2789

IS - 5

ER -

Von denselben Autoren