Semantic Segmentation of Aerial Images Using Binary Space Partitioning

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

Research Organisations

View graph of relations

Details

Original languageEnglish
Title of host publicationKI 2021
Subtitle of host publicationAdvances in Artificial Intelligence - 44th German Conference on AI, Proceedings
EditorsStefan Edelkamp, Ralf Möller, Elmar Rueckert
Pages116-134
Number of pages19
ISBN (electronic)978-3-030-87626-5
Publication statusPublished - 2021

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12873 LNAI
ISSN (Print)0302-9743
ISSN (electronic)1611-3349

Abstract

The semantic segmentation of aerial images enables many useful applications such as tracking city growth, tracking deforestation, or automatically creating and updating maps. However, gathering enough training data to train a proper model for the automated analysis of aerial images is usually too labor-intensive and thus too expensive in most cases. Therefore, domain adaptation techniques are often necessary to be able to adapt existing models or to transfer knowledge from existing datasets to new unlabeled aerial images. Modern adaptation approaches make use of complex architectures involving many model components, losses and loss weights. These approaches are hard to apply in practice since their hyperparameters are hard to optimize for a given adaptation problem. This complexity is the result of trying to separate domain-invariant elements, e.g., structures and shapes, from domain-specific elements, e.g., textures. In this paper, we present a novel model for semantic segmentation, which not only achieves state-of-the-art performance on aerial images, but also inherently learns separate feature representations for shapes and textures. Our goal is to provide a model which can serve as the basis for future domain adaptation approaches which are simpler but still effective. Through end-to-end training our deep learning model learns to map aerial images to feature representations which can be decoded into binary space partitioning trees, a resolution-independent representation of the semantic segmentation, which can then be rendered into a pixelwise semantic segmentation in a differentiable way.

Keywords

    Aerial images, Computer vision, Deep learning, Semantic segmentation, Spatial partitioning

ASJC Scopus subject areas

Cite this

Semantic Segmentation of Aerial Images Using Binary Space Partitioning. / Gritzner, Daniel; Ostermann, Jörn.
KI 2021: Advances in Artificial Intelligence - 44th German Conference on AI, Proceedings. ed. / Stefan Edelkamp; Ralf Möller; Elmar Rueckert. 2021. p. 116-134 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 12873 LNAI).

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Gritzner, D & Ostermann, J 2021, Semantic Segmentation of Aerial Images Using Binary Space Partitioning. in S Edelkamp, R Möller & E Rueckert (eds), KI 2021: Advances in Artificial Intelligence - 44th German Conference on AI, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 12873 LNAI, pp. 116-134. https://doi.org/10.1007/978-3-030-87626-5_10
Gritzner, D., & Ostermann, J. (2021). Semantic Segmentation of Aerial Images Using Binary Space Partitioning. In S. Edelkamp, R. Möller, & E. Rueckert (Eds.), KI 2021: Advances in Artificial Intelligence - 44th German Conference on AI, Proceedings (pp. 116-134). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 12873 LNAI). https://doi.org/10.1007/978-3-030-87626-5_10
Gritzner D, Ostermann J. Semantic Segmentation of Aerial Images Using Binary Space Partitioning. In Edelkamp S, Möller R, Rueckert E, editors, KI 2021: Advances in Artificial Intelligence - 44th German Conference on AI, Proceedings. 2021. p. 116-134. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). Epub 2021 Sept 30. doi: 10.1007/978-3-030-87626-5_10
Gritzner, Daniel ; Ostermann, Jörn. / Semantic Segmentation of Aerial Images Using Binary Space Partitioning. KI 2021: Advances in Artificial Intelligence - 44th German Conference on AI, Proceedings. editor / Stefan Edelkamp ; Ralf Möller ; Elmar Rueckert. 2021. pp. 116-134 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).
Download
@inproceedings{65b2cc2b92b0407cab4c310c8071779c,
title = "Semantic Segmentation of Aerial Images Using Binary Space Partitioning",
abstract = "The semantic segmentation of aerial images enables many useful applications such as tracking city growth, tracking deforestation, or automatically creating and updating maps. However, gathering enough training data to train a proper model for the automated analysis of aerial images is usually too labor-intensive and thus too expensive in most cases. Therefore, domain adaptation techniques are often necessary to be able to adapt existing models or to transfer knowledge from existing datasets to new unlabeled aerial images. Modern adaptation approaches make use of complex architectures involving many model components, losses and loss weights. These approaches are hard to apply in practice since their hyperparameters are hard to optimize for a given adaptation problem. This complexity is the result of trying to separate domain-invariant elements, e.g., structures and shapes, from domain-specific elements, e.g., textures. In this paper, we present a novel model for semantic segmentation, which not only achieves state-of-the-art performance on aerial images, but also inherently learns separate feature representations for shapes and textures. Our goal is to provide a model which can serve as the basis for future domain adaptation approaches which are simpler but still effective. Through end-to-end training our deep learning model learns to map aerial images to feature representations which can be decoded into binary space partitioning trees, a resolution-independent representation of the semantic segmentation, which can then be rendered into a pixelwise semantic segmentation in a differentiable way.",
keywords = "Aerial images, Computer vision, Deep learning, Semantic segmentation, Spatial partitioning",
author = "Daniel Gritzner and J{\"o}rn Ostermann",
year = "2021",
doi = "10.1007/978-3-030-87626-5_10",
language = "English",
isbn = "978-3-030-87625-8",
series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",
pages = "116--134",
editor = "Stefan Edelkamp and Ralf M{\"o}ller and Elmar Rueckert",
booktitle = "KI 2021",

}

Download

TY - GEN

T1 - Semantic Segmentation of Aerial Images Using Binary Space Partitioning

AU - Gritzner, Daniel

AU - Ostermann, Jörn

PY - 2021

Y1 - 2021

N2 - The semantic segmentation of aerial images enables many useful applications such as tracking city growth, tracking deforestation, or automatically creating and updating maps. However, gathering enough training data to train a proper model for the automated analysis of aerial images is usually too labor-intensive and thus too expensive in most cases. Therefore, domain adaptation techniques are often necessary to be able to adapt existing models or to transfer knowledge from existing datasets to new unlabeled aerial images. Modern adaptation approaches make use of complex architectures involving many model components, losses and loss weights. These approaches are hard to apply in practice since their hyperparameters are hard to optimize for a given adaptation problem. This complexity is the result of trying to separate domain-invariant elements, e.g., structures and shapes, from domain-specific elements, e.g., textures. In this paper, we present a novel model for semantic segmentation, which not only achieves state-of-the-art performance on aerial images, but also inherently learns separate feature representations for shapes and textures. Our goal is to provide a model which can serve as the basis for future domain adaptation approaches which are simpler but still effective. Through end-to-end training our deep learning model learns to map aerial images to feature representations which can be decoded into binary space partitioning trees, a resolution-independent representation of the semantic segmentation, which can then be rendered into a pixelwise semantic segmentation in a differentiable way.

AB - The semantic segmentation of aerial images enables many useful applications such as tracking city growth, tracking deforestation, or automatically creating and updating maps. However, gathering enough training data to train a proper model for the automated analysis of aerial images is usually too labor-intensive and thus too expensive in most cases. Therefore, domain adaptation techniques are often necessary to be able to adapt existing models or to transfer knowledge from existing datasets to new unlabeled aerial images. Modern adaptation approaches make use of complex architectures involving many model components, losses and loss weights. These approaches are hard to apply in practice since their hyperparameters are hard to optimize for a given adaptation problem. This complexity is the result of trying to separate domain-invariant elements, e.g., structures and shapes, from domain-specific elements, e.g., textures. In this paper, we present a novel model for semantic segmentation, which not only achieves state-of-the-art performance on aerial images, but also inherently learns separate feature representations for shapes and textures. Our goal is to provide a model which can serve as the basis for future domain adaptation approaches which are simpler but still effective. Through end-to-end training our deep learning model learns to map aerial images to feature representations which can be decoded into binary space partitioning trees, a resolution-independent representation of the semantic segmentation, which can then be rendered into a pixelwise semantic segmentation in a differentiable way.

KW - Aerial images

KW - Computer vision

KW - Deep learning

KW - Semantic segmentation

KW - Spatial partitioning

UR - http://www.scopus.com/inward/record.url?scp=85116898477&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-87626-5_10

DO - 10.1007/978-3-030-87626-5_10

M3 - Conference contribution

SN - 978-3-030-87625-8

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 116

EP - 134

BT - KI 2021

A2 - Edelkamp, Stefan

A2 - Möller, Ralf

A2 - Rueckert, Elmar

ER -

By the same author(s)