Automatically generated training data for land cover classification with cnns using sentinel-2 images

Research output: Contribution to journalConference articleResearchpeer review

Authors

  • M. Voelsen
  • J. Bostelmann
  • A. Maas
  • F. Rottensteiner
  • C. Heipke

External Research Organisations

  • State Office for Geoinformation and Surveying of Lower Saxony
View graph of relations

Details

Original languageEnglish
Pages (from-to)767-774
Number of pages8
JournalInternational Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives
Volume43
Issue numberB3
Publication statusPublished - 21 Aug 2020
Event2020 24th ISPRS Congress - Technical Commission III - Nice, Virtual, France
Duration: 31 Aug 20202 Sept 2020

Abstract

Pixel-wise classification of remote sensing imagery is highly interesting for tasks like land cover classification or change detection. The acquisition of large training data sets for these tasks is challenging, but necessary to obtain good results with deep learning algorithms such as convolutional neural networks (CNN). In this paper we present a method for the automatic generation of a large amount of training data by combining satellite imagery with reference data from an available geospatial database. Due to this combination of different data sources the resulting training data contain a certain amount of incorrect labels. We evaluate the influence of this so called label noise regarding the time difference between acquisition of the two data sources, the amount of training data and the class structure. We combine Sentinel-2 images with reference data from a geospatial database provided by the German Land Survey Office of Lower Saxony (LGLN). With different training sets we train a fully convolutional neural network (FCN) and classify four land cover classes (code Building, Agriculture, Forest, Water/code). Our results show that the errors in the training samples do not have a large influence on the resulting classifiers. This is probably due to the fact that the noise is randomly distributed and thus, neighbours of incorrect samples are predominantly correct. As expected, a larger amount of training data improves the results, especially for the less well represented classes. Other influences are different illuminations conditions and seasonal effects during data acquisition. To better adapt the classifier to these different conditions they should also be included in the training data.

Keywords

    CNN, Deep Learning, Land Cover, Remote Sensing, Semantic Segmentation, Sentinel-2

ASJC Scopus subject areas

Sustainable Development Goals

Cite this

Automatically generated training data for land cover classification with cnns using sentinel-2 images. / Voelsen, M.; Bostelmann, J.; Maas, A. et al.
In: International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives, Vol. 43, No. B3, 21.08.2020, p. 767-774.

Research output: Contribution to journalConference articleResearchpeer review

Voelsen, M, Bostelmann, J, Maas, A, Rottensteiner, F & Heipke, C 2020, 'Automatically generated training data for land cover classification with cnns using sentinel-2 images', International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives, vol. 43, no. B3, pp. 767-774. https://doi.org/10.5194/isprs-archives-XLIII-B3-2020-767-2020
Voelsen, M., Bostelmann, J., Maas, A., Rottensteiner, F., & Heipke, C. (2020). Automatically generated training data for land cover classification with cnns using sentinel-2 images. International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives, 43(B3), 767-774. https://doi.org/10.5194/isprs-archives-XLIII-B3-2020-767-2020
Voelsen M, Bostelmann J, Maas A, Rottensteiner F, Heipke C. Automatically generated training data for land cover classification with cnns using sentinel-2 images. International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives. 2020 Aug 21;43(B3):767-774. doi: 10.5194/isprs-archives-XLIII-B3-2020-767-2020
Voelsen, M. ; Bostelmann, J. ; Maas, A. et al. / Automatically generated training data for land cover classification with cnns using sentinel-2 images. In: International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives. 2020 ; Vol. 43, No. B3. pp. 767-774.
Download
@article{895bd51d6d214dbd831b10900eb861c1,
title = "Automatically generated training data for land cover classification with cnns using sentinel-2 images",
abstract = "Pixel-wise classification of remote sensing imagery is highly interesting for tasks like land cover classification or change detection. The acquisition of large training data sets for these tasks is challenging, but necessary to obtain good results with deep learning algorithms such as convolutional neural networks (CNN). In this paper we present a method for the automatic generation of a large amount of training data by combining satellite imagery with reference data from an available geospatial database. Due to this combination of different data sources the resulting training data contain a certain amount of incorrect labels. We evaluate the influence of this so called label noise regarding the time difference between acquisition of the two data sources, the amount of training data and the class structure. We combine Sentinel-2 images with reference data from a geospatial database provided by the German Land Survey Office of Lower Saxony (LGLN). With different training sets we train a fully convolutional neural network (FCN) and classify four land cover classes (code Building, Agriculture, Forest, Water/code). Our results show that the errors in the training samples do not have a large influence on the resulting classifiers. This is probably due to the fact that the noise is randomly distributed and thus, neighbours of incorrect samples are predominantly correct. As expected, a larger amount of training data improves the results, especially for the less well represented classes. Other influences are different illuminations conditions and seasonal effects during data acquisition. To better adapt the classifier to these different conditions they should also be included in the training data.",
keywords = "CNN, Deep Learning, Land Cover, Remote Sensing, Semantic Segmentation, Sentinel-2",
author = "M. Voelsen and J. Bostelmann and A. Maas and F. Rottensteiner and C. Heipke",
year = "2020",
month = aug,
day = "21",
doi = "10.5194/isprs-archives-XLIII-B3-2020-767-2020",
language = "English",
volume = "43",
pages = "767--774",
number = "B3",
note = "2020 24th ISPRS Congress - Technical Commission III ; Conference date: 31-08-2020 Through 02-09-2020",

}

Download

TY - JOUR

T1 - Automatically generated training data for land cover classification with cnns using sentinel-2 images

AU - Voelsen, M.

AU - Bostelmann, J.

AU - Maas, A.

AU - Rottensteiner, F.

AU - Heipke, C.

PY - 2020/8/21

Y1 - 2020/8/21

N2 - Pixel-wise classification of remote sensing imagery is highly interesting for tasks like land cover classification or change detection. The acquisition of large training data sets for these tasks is challenging, but necessary to obtain good results with deep learning algorithms such as convolutional neural networks (CNN). In this paper we present a method for the automatic generation of a large amount of training data by combining satellite imagery with reference data from an available geospatial database. Due to this combination of different data sources the resulting training data contain a certain amount of incorrect labels. We evaluate the influence of this so called label noise regarding the time difference between acquisition of the two data sources, the amount of training data and the class structure. We combine Sentinel-2 images with reference data from a geospatial database provided by the German Land Survey Office of Lower Saxony (LGLN). With different training sets we train a fully convolutional neural network (FCN) and classify four land cover classes (code Building, Agriculture, Forest, Water/code). Our results show that the errors in the training samples do not have a large influence on the resulting classifiers. This is probably due to the fact that the noise is randomly distributed and thus, neighbours of incorrect samples are predominantly correct. As expected, a larger amount of training data improves the results, especially for the less well represented classes. Other influences are different illuminations conditions and seasonal effects during data acquisition. To better adapt the classifier to these different conditions they should also be included in the training data.

AB - Pixel-wise classification of remote sensing imagery is highly interesting for tasks like land cover classification or change detection. The acquisition of large training data sets for these tasks is challenging, but necessary to obtain good results with deep learning algorithms such as convolutional neural networks (CNN). In this paper we present a method for the automatic generation of a large amount of training data by combining satellite imagery with reference data from an available geospatial database. Due to this combination of different data sources the resulting training data contain a certain amount of incorrect labels. We evaluate the influence of this so called label noise regarding the time difference between acquisition of the two data sources, the amount of training data and the class structure. We combine Sentinel-2 images with reference data from a geospatial database provided by the German Land Survey Office of Lower Saxony (LGLN). With different training sets we train a fully convolutional neural network (FCN) and classify four land cover classes (code Building, Agriculture, Forest, Water/code). Our results show that the errors in the training samples do not have a large influence on the resulting classifiers. This is probably due to the fact that the noise is randomly distributed and thus, neighbours of incorrect samples are predominantly correct. As expected, a larger amount of training data improves the results, especially for the less well represented classes. Other influences are different illuminations conditions and seasonal effects during data acquisition. To better adapt the classifier to these different conditions they should also be included in the training data.

KW - CNN

KW - Deep Learning

KW - Land Cover

KW - Remote Sensing

KW - Semantic Segmentation

KW - Sentinel-2

UR - http://www.scopus.com/inward/record.url?scp=85091153172&partnerID=8YFLogxK

U2 - 10.5194/isprs-archives-XLIII-B3-2020-767-2020

DO - 10.5194/isprs-archives-XLIII-B3-2020-767-2020

M3 - Conference article

AN - SCOPUS:85091153172

VL - 43

SP - 767

EP - 774

JO - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives

JF - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives

SN - 1682-1750

IS - B3

T2 - 2020 24th ISPRS Congress - Technical Commission III

Y2 - 31 August 2020 through 2 September 2020

ER -