Improving deep learning based semantic segmentation with multi view outlier correction

Research output: Contribution to journalConference articleResearchpeer review

Authors

  • Torben Peters
  • Claus Brenner
  • M. Song
View graph of relations

Details

Original languageEnglish
Pages (from-to)711-716
Number of pages6
JournalInternational Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives
Volume43
Issue numberB2
Publication statusPublished - 12 Aug 2020
Event2020 24th ISPRS Congress - Technical Commission II - Nice, Virtual, France
Duration: 31 Aug 20202 Sept 2020

Abstract

The goal of this paper is to use transfer learning for semi supervised semantic segmentation in 2D images: given a pretrained deep convolutional network (DCNN), our aim is to adapt it to a new camera-sensor system by enforcing predictions to be consistent for the same object in space. This is enabled by projecting 3D object points into multi-view 2D images. Since every 3D object point is usually mapped to a number of 2D images, each of which undergoes a pixelwise classification using the pretrained DCNN, we obtain a number of predictions (labels) for the same object point. This makes it possible to detect and correct outlier predictions. Ultimately, we retrain the DCNN on the corrected dataset in order to adapt the network to the new input data. We demonstrate the effectiveness of our approach on a mobile mapping dataset containing over 10'000 images and more than 1 billion 3D points. Moreover, we manually annotated a subset of the mobile mapping images and show that we were able to rise the mean intersection over union (mIoU) by approximately 10% with Deeplabv3+, using our approach.

Keywords

    Deep Learning, MMS, Multi-view, Point Cloud, Transfer Learning

ASJC Scopus subject areas

Cite this

Improving deep learning based semantic segmentation with multi view outlier correction. / Peters, Torben; Brenner, Claus; Song, M.
In: International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives, Vol. 43, No. B2, 12.08.2020, p. 711-716.

Research output: Contribution to journalConference articleResearchpeer review

Peters, T, Brenner, C & Song, M 2020, 'Improving deep learning based semantic segmentation with multi view outlier correction', International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives, vol. 43, no. B2, pp. 711-716. https://doi.org/10.5194/isprs-archives-XLIII-B2-2020-711-2020
Peters, T., Brenner, C., & Song, M. (2020). Improving deep learning based semantic segmentation with multi view outlier correction. International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives, 43(B2), 711-716. https://doi.org/10.5194/isprs-archives-XLIII-B2-2020-711-2020
Peters T, Brenner C, Song M. Improving deep learning based semantic segmentation with multi view outlier correction. International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives. 2020 Aug 12;43(B2):711-716. doi: 10.5194/isprs-archives-XLIII-B2-2020-711-2020
Peters, Torben ; Brenner, Claus ; Song, M. / Improving deep learning based semantic segmentation with multi view outlier correction. In: International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives. 2020 ; Vol. 43, No. B2. pp. 711-716.
Download
@article{c36e536ec33b4956b81765181c766d55,
title = "Improving deep learning based semantic segmentation with multi view outlier correction",
abstract = "The goal of this paper is to use transfer learning for semi supervised semantic segmentation in 2D images: given a pretrained deep convolutional network (DCNN), our aim is to adapt it to a new camera-sensor system by enforcing predictions to be consistent for the same object in space. This is enabled by projecting 3D object points into multi-view 2D images. Since every 3D object point is usually mapped to a number of 2D images, each of which undergoes a pixelwise classification using the pretrained DCNN, we obtain a number of predictions (labels) for the same object point. This makes it possible to detect and correct outlier predictions. Ultimately, we retrain the DCNN on the corrected dataset in order to adapt the network to the new input data. We demonstrate the effectiveness of our approach on a mobile mapping dataset containing over 10'000 images and more than 1 billion 3D points. Moreover, we manually annotated a subset of the mobile mapping images and show that we were able to rise the mean intersection over union (mIoU) by approximately 10% with Deeplabv3+, using our approach.",
keywords = "Deep Learning, MMS, Multi-view, Point Cloud, Transfer Learning",
author = "Torben Peters and Claus Brenner and M. Song",
note = "Funding information: This work was funded by the German Research Foundation (DFG) as a part of the Research Training Group GRK2159, {\textquoteleft}Integrity and collaboration in dynamic sensor networks (i.c.sens){\textquoteright}.; 2020 24th ISPRS Congress - Technical Commission II ; Conference date: 31-08-2020 Through 02-09-2020",
year = "2020",
month = aug,
day = "12",
doi = "10.5194/isprs-archives-XLIII-B2-2020-711-2020",
language = "English",
volume = "43",
pages = "711--716",
number = "B2",

}

Download

TY - JOUR

T1 - Improving deep learning based semantic segmentation with multi view outlier correction

AU - Peters, Torben

AU - Brenner, Claus

AU - Song, M.

N1 - Funding information: This work was funded by the German Research Foundation (DFG) as a part of the Research Training Group GRK2159, ‘Integrity and collaboration in dynamic sensor networks (i.c.sens)’.

PY - 2020/8/12

Y1 - 2020/8/12

N2 - The goal of this paper is to use transfer learning for semi supervised semantic segmentation in 2D images: given a pretrained deep convolutional network (DCNN), our aim is to adapt it to a new camera-sensor system by enforcing predictions to be consistent for the same object in space. This is enabled by projecting 3D object points into multi-view 2D images. Since every 3D object point is usually mapped to a number of 2D images, each of which undergoes a pixelwise classification using the pretrained DCNN, we obtain a number of predictions (labels) for the same object point. This makes it possible to detect and correct outlier predictions. Ultimately, we retrain the DCNN on the corrected dataset in order to adapt the network to the new input data. We demonstrate the effectiveness of our approach on a mobile mapping dataset containing over 10'000 images and more than 1 billion 3D points. Moreover, we manually annotated a subset of the mobile mapping images and show that we were able to rise the mean intersection over union (mIoU) by approximately 10% with Deeplabv3+, using our approach.

AB - The goal of this paper is to use transfer learning for semi supervised semantic segmentation in 2D images: given a pretrained deep convolutional network (DCNN), our aim is to adapt it to a new camera-sensor system by enforcing predictions to be consistent for the same object in space. This is enabled by projecting 3D object points into multi-view 2D images. Since every 3D object point is usually mapped to a number of 2D images, each of which undergoes a pixelwise classification using the pretrained DCNN, we obtain a number of predictions (labels) for the same object point. This makes it possible to detect and correct outlier predictions. Ultimately, we retrain the DCNN on the corrected dataset in order to adapt the network to the new input data. We demonstrate the effectiveness of our approach on a mobile mapping dataset containing over 10'000 images and more than 1 billion 3D points. Moreover, we manually annotated a subset of the mobile mapping images and show that we were able to rise the mean intersection over union (mIoU) by approximately 10% with Deeplabv3+, using our approach.

KW - Deep Learning

KW - MMS

KW - Multi-view

KW - Point Cloud

KW - Transfer Learning

UR - http://www.scopus.com/inward/record.url?scp=85091067250&partnerID=8YFLogxK

U2 - 10.5194/isprs-archives-XLIII-B2-2020-711-2020

DO - 10.5194/isprs-archives-XLIII-B2-2020-711-2020

M3 - Conference article

AN - SCOPUS:85091067250

VL - 43

SP - 711

EP - 716

JO - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives

JF - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives

SN - 1682-1750

IS - B2

T2 - 2020 24th ISPRS Congress - Technical Commission II

Y2 - 31 August 2020 through 2 September 2020

ER -