Guiding Deep Learning with Expert Knowledge for Dense Stereo Matching

Waseem Iqbal; Jens André Paffenholz; Max Mehltretter

doi:10.1007/s41064-023-00252-0

Details

Originalsprache	Englisch
Seiten (von - bis)	365-380
Seitenumfang	16
Fachzeitschrift	PFG - Journal of Photogrammetry, Remote Sensing and Geoinformation Science
Jahrgang	91
Ausgabenummer	5
Frühes Online-Datum	28 Juli 2023
Publikationsstatus	Veröffentlicht - Okt. 2023

Abstract

Dense depth information can be reconstructed from stereo images using conventional hand-crafted as well as deep learning-based approaches. While deep-learning methods often show superior results compared to hand-crafted ones, they commonly learn geometric principles underlying the matching task from scratch and neglect that these principles have already been intensively studied and were considered explicitly in various models with great success in the past. In consequence, a broad range of principles and associated features need to be learned, limiting the possibility to focus on important details to also succeed in challenging image regions, such as close to depth discontinuities, thin objects and in weakly textured areas. To overcome this limitation, in this work, a hybrid technique, i.e., a combination of conventional hand-crafted and deep learning-based methods, is presented, addressing the task of dense stereo matching. More precisely, the input RGB stereo images are supplemented by a fourth image channel containing feature information obtained with a method based on expert knowledge. In addition, the assumption that edges in an image and discontinuities in the corresponding depth map coincide is modeled explicitly, allowing to predict the probability of being located next to a depth discontinuity per pixel. This information is used to guide the matching process and helps to sharpen correct depth discontinuities and to avoid the false prediction of such discontinuities, especially in weakly textured areas. The performance of the proposed method is investigated on three different data sets, including studies on the influence of the two methodological components as well as on the generalization capability. The results demonstrate that the presented hybrid approach can help to mitigate common limitations of deep learning-based methods and improves the quality of the estimated depth maps.

ASJC Scopus Sachgebiete

Sozialwissenschaften (insg.)
Geografie, Planung und Entwicklung
Physik und Astronomie (insg.)
Instrumentierung
Erdkunde und Planetologie (insg.)
Erdkunde und Planetologie (sonstige)

Zitieren

Guiding Deep Learning with Expert Knowledge for Dense Stereo Matching. / Iqbal, Waseem; Paffenholz, Jens André; Mehltretter, Max.
in: PFG - Journal of Photogrammetry, Remote Sensing and Geoinformation Science, Jahrgang 91, Nr. 5, 10.2023, S. 365-380.

Publikation: Beitrag in Fachzeitschrift › Artikel › Forschung › Peer-Review

Iqbal, W, Paffenholz, JA & Mehltretter, M 2023, 'Guiding Deep Learning with Expert Knowledge for Dense Stereo Matching', PFG - Journal of Photogrammetry, Remote Sensing and Geoinformation Science, Jg. 91, Nr. 5, S. 365-380. https://doi.org/10.1007/s41064-023-00252-0

Iqbal, W., Paffenholz, J. A., & Mehltretter, M. (2023). Guiding Deep Learning with Expert Knowledge for Dense Stereo Matching. PFG - Journal of Photogrammetry, Remote Sensing and Geoinformation Science, 91(5), 365-380. https://doi.org/10.1007/s41064-023-00252-0

Iqbal W, Paffenholz JA, Mehltretter M. Guiding Deep Learning with Expert Knowledge for Dense Stereo Matching. PFG - Journal of Photogrammetry, Remote Sensing and Geoinformation Science. 2023 Okt;91(5):365-380. Epub 2023 Jul 28. doi: 10.1007/s41064-023-00252-0

Iqbal, Waseem ; Paffenholz, Jens André ; Mehltretter, Max. / Guiding Deep Learning with Expert Knowledge for Dense Stereo Matching. in: PFG - Journal of Photogrammetry, Remote Sensing and Geoinformation Science. 2023 ; Jahrgang 91, Nr. 5. S. 365-380.

Download

@article{6c62f433009f4586b2ba2fae3a6cbe5d,

title = "Guiding Deep Learning with Expert Knowledge for Dense Stereo Matching",

abstract = "Dense depth information can be reconstructed from stereo images using conventional hand-crafted as well as deep learning-based approaches. While deep-learning methods often show superior results compared to hand-crafted ones, they commonly learn geometric principles underlying the matching task from scratch and neglect that these principles have already been intensively studied and were considered explicitly in various models with great success in the past. In consequence, a broad range of principles and associated features need to be learned, limiting the possibility to focus on important details to also succeed in challenging image regions, such as close to depth discontinuities, thin objects and in weakly textured areas. To overcome this limitation, in this work, a hybrid technique, i.e., a combination of conventional hand-crafted and deep learning-based methods, is presented, addressing the task of dense stereo matching. More precisely, the input RGB stereo images are supplemented by a fourth image channel containing feature information obtained with a method based on expert knowledge. In addition, the assumption that edges in an image and discontinuities in the corresponding depth map coincide is modeled explicitly, allowing to predict the probability of being located next to a depth discontinuity per pixel. This information is used to guide the matching process and helps to sharpen correct depth discontinuities and to avoid the false prediction of such discontinuities, especially in weakly textured areas. The performance of the proposed method is investigated on three different data sets, including studies on the influence of the two methodological components as well as on the generalization capability. The results demonstrate that the presented hybrid approach can help to mitigate common limitations of deep learning-based methods and improves the quality of the estimated depth maps.",

keywords = "3D reconstruction, Depth estimation, Hybrid technique, Image matching",

author = "Waseem Iqbal and Paffenholz, {Jens Andr{\'e}} and Max Mehltretter",

note = "Funding Information: Open Access funding enabled and organized by Projekt DEAL",

year = "2023",

month = oct,

doi = "10.1007/s41064-023-00252-0",

language = "English",

volume = "91",

pages = "365--380",

number = "5",

}

Download

TY - JOUR

T1 - Guiding Deep Learning with Expert Knowledge for Dense Stereo Matching

AU - Iqbal, Waseem

AU - Paffenholz, Jens André

AU - Mehltretter, Max

N1 - Funding Information: Open Access funding enabled and organized by Projekt DEAL

PY - 2023/10

Y1 - 2023/10

N2 - Dense depth information can be reconstructed from stereo images using conventional hand-crafted as well as deep learning-based approaches. While deep-learning methods often show superior results compared to hand-crafted ones, they commonly learn geometric principles underlying the matching task from scratch and neglect that these principles have already been intensively studied and were considered explicitly in various models with great success in the past. In consequence, a broad range of principles and associated features need to be learned, limiting the possibility to focus on important details to also succeed in challenging image regions, such as close to depth discontinuities, thin objects and in weakly textured areas. To overcome this limitation, in this work, a hybrid technique, i.e., a combination of conventional hand-crafted and deep learning-based methods, is presented, addressing the task of dense stereo matching. More precisely, the input RGB stereo images are supplemented by a fourth image channel containing feature information obtained with a method based on expert knowledge. In addition, the assumption that edges in an image and discontinuities in the corresponding depth map coincide is modeled explicitly, allowing to predict the probability of being located next to a depth discontinuity per pixel. This information is used to guide the matching process and helps to sharpen correct depth discontinuities and to avoid the false prediction of such discontinuities, especially in weakly textured areas. The performance of the proposed method is investigated on three different data sets, including studies on the influence of the two methodological components as well as on the generalization capability. The results demonstrate that the presented hybrid approach can help to mitigate common limitations of deep learning-based methods and improves the quality of the estimated depth maps.

AB - Dense depth information can be reconstructed from stereo images using conventional hand-crafted as well as deep learning-based approaches. While deep-learning methods often show superior results compared to hand-crafted ones, they commonly learn geometric principles underlying the matching task from scratch and neglect that these principles have already been intensively studied and were considered explicitly in various models with great success in the past. In consequence, a broad range of principles and associated features need to be learned, limiting the possibility to focus on important details to also succeed in challenging image regions, such as close to depth discontinuities, thin objects and in weakly textured areas. To overcome this limitation, in this work, a hybrid technique, i.e., a combination of conventional hand-crafted and deep learning-based methods, is presented, addressing the task of dense stereo matching. More precisely, the input RGB stereo images are supplemented by a fourth image channel containing feature information obtained with a method based on expert knowledge. In addition, the assumption that edges in an image and discontinuities in the corresponding depth map coincide is modeled explicitly, allowing to predict the probability of being located next to a depth discontinuity per pixel. This information is used to guide the matching process and helps to sharpen correct depth discontinuities and to avoid the false prediction of such discontinuities, especially in weakly textured areas. The performance of the proposed method is investigated on three different data sets, including studies on the influence of the two methodological components as well as on the generalization capability. The results demonstrate that the presented hybrid approach can help to mitigate common limitations of deep learning-based methods and improves the quality of the estimated depth maps.

KW - 3D reconstruction

KW - Depth estimation

KW - Hybrid technique

KW - Image matching

UR - http://www.scopus.com/inward/record.url?scp=85165968631&partnerID=8YFLogxK

U2 - 10.1007/s41064-023-00252-0

DO - 10.1007/s41064-023-00252-0

M3 - Article

AN - SCOPUS:85165968631

VL - 91

SP - 365

EP - 380

JO - PFG - Journal of Photogrammetry, Remote Sensing and Geoinformation Science

JF - PFG - Journal of Photogrammetry, Remote Sensing and Geoinformation Science

SN - 2512-2789

IS - 5

ER -

Research@Leibniz University

Guiding Deep Learning with Expert Knowledge for Dense Stereo Matching

Autoren

Organisationseinheiten

Externe Organisationen

Details

Abstract

ASJC Scopus Sachgebiete

Zitieren

Von denselben Autoren

Cooperative Image Orientation with Dynamic Objects

Editorial for Special Issue: 75 Years IPI—an Overview of Current Research Activities in Photogrammetry and Remote Sensing

Fresh Concrete Properties from Stereoscopic Image Sequences

Monocular Pose and Shape Reconstruction of Vehicles in UAV imagery using a Multi-task CNN

Self-Supervised 3D Semantic Occupancy Prediction from Multi-View 2D Surround Images