Semantic Segmentation of Aerial Images Using Binary Space Partitioning

Daniel Gritzner; Jörn Ostermann

doi:10.1007/978-3-030-87626-5_10

Details

Original language	English
Title of host publication	KI 2021
Subtitle of host publication	Advances in Artificial Intelligence - 44th German Conference on AI, Proceedings
Editors	Stefan Edelkamp, Ralf Möller, Elmar Rueckert
Pages	116-134
Number of pages	19
ISBN (electronic)	978-3-030-87626-5
Publication status	Published - 2021

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	12873 LNAI
ISSN (Print)	0302-9743
ISSN (electronic)	1611-3349

Abstract

The semantic segmentation of aerial images enables many useful applications such as tracking city growth, tracking deforestation, or automatically creating and updating maps. However, gathering enough training data to train a proper model for the automated analysis of aerial images is usually too labor-intensive and thus too expensive in most cases. Therefore, domain adaptation techniques are often necessary to be able to adapt existing models or to transfer knowledge from existing datasets to new unlabeled aerial images. Modern adaptation approaches make use of complex architectures involving many model components, losses and loss weights. These approaches are hard to apply in practice since their hyperparameters are hard to optimize for a given adaptation problem. This complexity is the result of trying to separate domain-invariant elements, e.g., structures and shapes, from domain-specific elements, e.g., textures. In this paper, we present a novel model for semantic segmentation, which not only achieves state-of-the-art performance on aerial images, but also inherently learns separate feature representations for shapes and textures. Our goal is to provide a model which can serve as the basis for future domain adaptation approaches which are simpler but still effective. Through end-to-end training our deep learning model learns to map aerial images to feature representations which can be decoded into binary space partitioning trees, a resolution-independent representation of the semantic segmentation, which can then be rendered into a pixelwise semantic segmentation in a differentiable way.

Keywords

Aerial images, Computer vision, Deep learning, Semantic segmentation, Spatial partitioning

ASJC Scopus subject areas

Mathematics(all)
Theoretical Computer Science
Computer Science(all)
General Computer Science

Cite this

Semantic Segmentation of Aerial Images Using Binary Space Partitioning. / Gritzner, Daniel; Ostermann, Jörn.
KI 2021: Advances in Artificial Intelligence - 44th German Conference on AI, Proceedings. ed. / Stefan Edelkamp; Ralf Möller; Elmar Rueckert. 2021. p. 116-134 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 12873 LNAI).

Research output: Chapter in book/report/conference proceeding › Conference contribution › Research › peer review

Gritzner, D & Ostermann, J 2021, Semantic Segmentation of Aerial Images Using Binary Space Partitioning. in S Edelkamp, R Möller & E Rueckert (eds), KI 2021: Advances in Artificial Intelligence - 44th German Conference on AI, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 12873 LNAI, pp. 116-134. https://doi.org/10.1007/978-3-030-87626-5_10

Gritzner, D., & Ostermann, J. (2021). Semantic Segmentation of Aerial Images Using Binary Space Partitioning. In S. Edelkamp, R. Möller, & E. Rueckert (Eds.), KI 2021: Advances in Artificial Intelligence - 44th German Conference on AI, Proceedings (pp. 116-134). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 12873 LNAI). https://doi.org/10.1007/978-3-030-87626-5_10

Gritzner D, Ostermann J. Semantic Segmentation of Aerial Images Using Binary Space Partitioning. In Edelkamp S, Möller R, Rueckert E, editors, KI 2021: Advances in Artificial Intelligence - 44th German Conference on AI, Proceedings. 2021. p. 116-134. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). Epub 2021 Sept 30. doi: 10.1007/978-3-030-87626-5_10

Gritzner, Daniel ; Ostermann, Jörn. / Semantic Segmentation of Aerial Images Using Binary Space Partitioning. KI 2021: Advances in Artificial Intelligence - 44th German Conference on AI, Proceedings. editor / Stefan Edelkamp ; Ralf Möller ; Elmar Rueckert. 2021. pp. 116-134 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

Download

@inproceedings{65b2cc2b92b0407cab4c310c8071779c,

title = "Semantic Segmentation of Aerial Images Using Binary Space Partitioning",

abstract = "The semantic segmentation of aerial images enables many useful applications such as tracking city growth, tracking deforestation, or automatically creating and updating maps. However, gathering enough training data to train a proper model for the automated analysis of aerial images is usually too labor-intensive and thus too expensive in most cases. Therefore, domain adaptation techniques are often necessary to be able to adapt existing models or to transfer knowledge from existing datasets to new unlabeled aerial images. Modern adaptation approaches make use of complex architectures involving many model components, losses and loss weights. These approaches are hard to apply in practice since their hyperparameters are hard to optimize for a given adaptation problem. This complexity is the result of trying to separate domain-invariant elements, e.g., structures and shapes, from domain-specific elements, e.g., textures. In this paper, we present a novel model for semantic segmentation, which not only achieves state-of-the-art performance on aerial images, but also inherently learns separate feature representations for shapes and textures. Our goal is to provide a model which can serve as the basis for future domain adaptation approaches which are simpler but still effective. Through end-to-end training our deep learning model learns to map aerial images to feature representations which can be decoded into binary space partitioning trees, a resolution-independent representation of the semantic segmentation, which can then be rendered into a pixelwise semantic segmentation in a differentiable way.",

keywords = "Aerial images, Computer vision, Deep learning, Semantic segmentation, Spatial partitioning",

author = "Daniel Gritzner and J{\"o}rn Ostermann",

year = "2021",

doi = "10.1007/978-3-030-87626-5_10",

language = "English",

isbn = "978-3-030-87625-8",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

pages = "116--134",

editor = "Stefan Edelkamp and Ralf M{\"o}ller and Elmar Rueckert",

booktitle = "KI 2021",

}

Download

TY - GEN

T1 - Semantic Segmentation of Aerial Images Using Binary Space Partitioning

AU - Gritzner, Daniel

AU - Ostermann, Jörn

PY - 2021

Y1 - 2021

N2 - The semantic segmentation of aerial images enables many useful applications such as tracking city growth, tracking deforestation, or automatically creating and updating maps. However, gathering enough training data to train a proper model for the automated analysis of aerial images is usually too labor-intensive and thus too expensive in most cases. Therefore, domain adaptation techniques are often necessary to be able to adapt existing models or to transfer knowledge from existing datasets to new unlabeled aerial images. Modern adaptation approaches make use of complex architectures involving many model components, losses and loss weights. These approaches are hard to apply in practice since their hyperparameters are hard to optimize for a given adaptation problem. This complexity is the result of trying to separate domain-invariant elements, e.g., structures and shapes, from domain-specific elements, e.g., textures. In this paper, we present a novel model for semantic segmentation, which not only achieves state-of-the-art performance on aerial images, but also inherently learns separate feature representations for shapes and textures. Our goal is to provide a model which can serve as the basis for future domain adaptation approaches which are simpler but still effective. Through end-to-end training our deep learning model learns to map aerial images to feature representations which can be decoded into binary space partitioning trees, a resolution-independent representation of the semantic segmentation, which can then be rendered into a pixelwise semantic segmentation in a differentiable way.

AB - The semantic segmentation of aerial images enables many useful applications such as tracking city growth, tracking deforestation, or automatically creating and updating maps. However, gathering enough training data to train a proper model for the automated analysis of aerial images is usually too labor-intensive and thus too expensive in most cases. Therefore, domain adaptation techniques are often necessary to be able to adapt existing models or to transfer knowledge from existing datasets to new unlabeled aerial images. Modern adaptation approaches make use of complex architectures involving many model components, losses and loss weights. These approaches are hard to apply in practice since their hyperparameters are hard to optimize for a given adaptation problem. This complexity is the result of trying to separate domain-invariant elements, e.g., structures and shapes, from domain-specific elements, e.g., textures. In this paper, we present a novel model for semantic segmentation, which not only achieves state-of-the-art performance on aerial images, but also inherently learns separate feature representations for shapes and textures. Our goal is to provide a model which can serve as the basis for future domain adaptation approaches which are simpler but still effective. Through end-to-end training our deep learning model learns to map aerial images to feature representations which can be decoded into binary space partitioning trees, a resolution-independent representation of the semantic segmentation, which can then be rendered into a pixelwise semantic segmentation in a differentiable way.

KW - Aerial images

KW - Computer vision

KW - Deep learning

KW - Semantic segmentation

KW - Spatial partitioning

UR - http://www.scopus.com/inward/record.url?scp=85116898477&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-87626-5_10

DO - 10.1007/978-3-030-87626-5_10

M3 - Conference contribution

SN - 978-3-030-87625-8

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 116

EP - 134

BT - KI 2021

A2 - Edelkamp, Stefan

A2 - Möller, Ralf

A2 - Rueckert, Elmar

ER -

Research@Leibniz University

Semantic Segmentation of Aerial Images Using Binary Space Partitioning

Authors

Research Organisations

Details

Publication series

Abstract

Keywords

ASJC Scopus subject areas

Cite this

By the same author(s)

MaskCRT: Masked Conditional Residual Transformer for Learned Video Compression

Acoustic Emission Detection in Noisy Environments using Linear Prediction

Genie: the first open-source ISO/IEC encoder for genomic data

On the Rate-Distortion-Complexity Trade-Offs of Neural Video Coding

Self-supervised domain adaptation for machinery remaining useful life prediction

MaskCRT: Masked Conditional Residual Transformer for Learned Video Compression

Acoustic Emission Detection in Noisy Environments using Linear Prediction

Genie: the first open-source ISO/IEC encoder for genomic data

On the Rate-Distortion-Complexity Trade-Offs of Neural Video Coding

Self-supervised domain adaptation for machinery remaining useful life prediction

MaskCRT: Masked Conditional Residual Transformer for Learned Video Compression