Semantic segmentation of manmade landscape structures in digital terrain models

B. Kazimi; F. Thiemann; M. Sester

doi:10.5194/isprs-annals-IV-2-W7-87-2019

Details

Originalsprache	Englisch
Seiten (von - bis)	87-94
Seitenumfang	8
Fachzeitschrift	ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences
Jahrgang	4
Ausgabenummer	2/W7
Frühes Online-Datum	16 Sept. 2019
Publikationsstatus	Veröffentlicht - 2019
Veranstaltung	1st Photogrammetric Image Analysis and Munich Remote Sensing Symposium, PIA 2019+MRSS 2019 - Munich, Deutschland Dauer: 18 Sept. 2019 → 20 Sept. 2019

Abstract

We explore the use of semantic segmentation in Digital Terrain Models (DTMS) for detecting manmade landscape structures in archaeological sites. DTM data are stored and processed as large matrices of depth 1 as opposed to depth 3 in RGB images. The matrices usually contain continuous real-valued information upper bound of which is not fixed, such as distance or height from a reference surface. This is different from RGB images that contain integer values in a fixed range of 0 to 255. Additionally, RGB images are usually stored in smaller multidimensional matrices, and are more suitable as inputs for a neural network while the large DTMs are necessary to be split into smaller sub-matrices to be used by neural networks. Thus, while the spatial information of pixels in RGB images are important only locally within a single image, for DTM data, they are important locally, within a single sub-matrix processed for neural network, and also globally, in relation to the neighboring sub-matrices. To cope with the two differences, we apply min-max normalization to each input matrix fed to the neural network, and use a slightly modified version of DeepLabv3+ model for semantic segmentation. We show that with the architecture change, and the preprocessing, better results are achieved.

ASJC Scopus Sachgebiete

Erdkunde und Planetologie (insg.)
Erdkunde und Planetologie (sonstige)
Umweltwissenschaften (insg.)
Umweltwissenschaften (sonstige)
Physik und Astronomie (insg.)
Instrumentierung

Zitieren

Semantic segmentation of manmade landscape structures in digital terrain models. / Kazimi, B.; Thiemann, F.; Sester, M.
in: ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Jahrgang 4, Nr. 2/W7, 2019, S. 87-94.

Publikation: Beitrag in Fachzeitschrift › Konferenzaufsatz in Fachzeitschrift › Forschung › Peer-Review

Kazimi, B, Thiemann, F & Sester, M 2019, 'Semantic segmentation of manmade landscape structures in digital terrain models', ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Jg. 4, Nr. 2/W7, S. 87-94. https://doi.org/10.5194/isprs-annals-IV-2-W7-87-2019

Kazimi, B., Thiemann, F., & Sester, M. (2019). Semantic segmentation of manmade landscape structures in digital terrain models. ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences, 4(2/W7), 87-94. https://doi.org/10.5194/isprs-annals-IV-2-W7-87-2019

Kazimi B, Thiemann F, Sester M. Semantic segmentation of manmade landscape structures in digital terrain models. ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences. 2019;4(2/W7):87-94. Epub 2019 Sep 16. doi: 10.5194/isprs-annals-IV-2-W7-87-2019

Kazimi, B. ; Thiemann, F. ; Sester, M. / Semantic segmentation of manmade landscape structures in digital terrain models. in: ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences. 2019 ; Jahrgang 4, Nr. 2/W7. S. 87-94.

Download

@article{119370591ea74e8689f1628880736e4c,

title = "Semantic segmentation of manmade landscape structures in digital terrain models",

abstract = "We explore the use of semantic segmentation in Digital Terrain Models (DTMS) for detecting manmade landscape structures in archaeological sites. DTM data are stored and processed as large matrices of depth 1 as opposed to depth 3 in RGB images. The matrices usually contain continuous real-valued information upper bound of which is not fixed, such as distance or height from a reference surface. This is different from RGB images that contain integer values in a fixed range of 0 to 255. Additionally, RGB images are usually stored in smaller multidimensional matrices, and are more suitable as inputs for a neural network while the large DTMs are necessary to be split into smaller sub-matrices to be used by neural networks. Thus, while the spatial information of pixels in RGB images are important only locally within a single image, for DTM data, they are important locally, within a single sub-matrix processed for neural network, and also globally, in relation to the neighboring sub-matrices. To cope with the two differences, we apply min-max normalization to each input matrix fed to the neural network, and use a slightly modified version of DeepLabv3+ model for semantic segmentation. We show that with the architecture change, and the preprocessing, better results are achieved.",

keywords = "Deep Learning, Digital Terrain Models, Laser Scanning, Object Detection, Semantic Segmentation",

author = "B. Kazimi and F. Thiemann and M. Sester",

year = "2019",

doi = "10.5194/isprs-annals-IV-2-W7-87-2019",

language = "English",

volume = "4",

pages = "87--94",

number = "2/W7",

note = "1st Photogrammetric Image Analysis and Munich Remote Sensing Symposium, PIA 2019+MRSS 2019 ; Conference date: 18-09-2019 Through 20-09-2019",

}

Download

TY - JOUR

T1 - Semantic segmentation of manmade landscape structures in digital terrain models

AU - Kazimi, B.

AU - Thiemann, F.

AU - Sester, M.

PY - 2019

Y1 - 2019

N2 - We explore the use of semantic segmentation in Digital Terrain Models (DTMS) for detecting manmade landscape structures in archaeological sites. DTM data are stored and processed as large matrices of depth 1 as opposed to depth 3 in RGB images. The matrices usually contain continuous real-valued information upper bound of which is not fixed, such as distance or height from a reference surface. This is different from RGB images that contain integer values in a fixed range of 0 to 255. Additionally, RGB images are usually stored in smaller multidimensional matrices, and are more suitable as inputs for a neural network while the large DTMs are necessary to be split into smaller sub-matrices to be used by neural networks. Thus, while the spatial information of pixels in RGB images are important only locally within a single image, for DTM data, they are important locally, within a single sub-matrix processed for neural network, and also globally, in relation to the neighboring sub-matrices. To cope with the two differences, we apply min-max normalization to each input matrix fed to the neural network, and use a slightly modified version of DeepLabv3+ model for semantic segmentation. We show that with the architecture change, and the preprocessing, better results are achieved.

AB - We explore the use of semantic segmentation in Digital Terrain Models (DTMS) for detecting manmade landscape structures in archaeological sites. DTM data are stored and processed as large matrices of depth 1 as opposed to depth 3 in RGB images. The matrices usually contain continuous real-valued information upper bound of which is not fixed, such as distance or height from a reference surface. This is different from RGB images that contain integer values in a fixed range of 0 to 255. Additionally, RGB images are usually stored in smaller multidimensional matrices, and are more suitable as inputs for a neural network while the large DTMs are necessary to be split into smaller sub-matrices to be used by neural networks. Thus, while the spatial information of pixels in RGB images are important only locally within a single image, for DTM data, they are important locally, within a single sub-matrix processed for neural network, and also globally, in relation to the neighboring sub-matrices. To cope with the two differences, we apply min-max normalization to each input matrix fed to the neural network, and use a slightly modified version of DeepLabv3+ model for semantic segmentation. We show that with the architecture change, and the preprocessing, better results are achieved.

KW - Deep Learning

KW - Digital Terrain Models

KW - Laser Scanning

KW - Object Detection

KW - Semantic Segmentation

UR - http://www.scopus.com/inward/record.url?scp=85084680018&partnerID=8YFLogxK

U2 - 10.5194/isprs-annals-IV-2-W7-87-2019

DO - 10.5194/isprs-annals-IV-2-W7-87-2019

M3 - Conference article

AN - SCOPUS:85084680018

VL - 4

SP - 87

EP - 94

JO - ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences

JF - ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences

SN - 2194-9042

IS - 2/W7

T2 - 1st Photogrammetric Image Analysis and Munich Remote Sensing Symposium, PIA 2019+MRSS 2019

Y2 - 18 September 2019 through 20 September 2019

ER -

Research@Leibniz University

Semantic segmentation of manmade landscape structures in digital terrain models

Autoren

Organisationseinheiten

Details

Abstract

ASJC Scopus Sachgebiete

Zitieren

Von denselben Autoren

3D Uncertain Implicit Surface Mapping Using GMM and GP

Gap completion in point cloud scene occluded by vehicles using SGC-Net

The Challenge of Data Analytics with Climate-neutral Urban Mobility: (Vision Paper)

LAformer: Trajectory Prediction for Autonomous Driving with Lane-Aware Scene Constraints

CoSense3D: an Agent-based Efficient Learning Framework for Collective Perception