Automatically generated training data for land cover classification with cnns using sentinel-2 images

M. Voelsen; J. Bostelmann; A. Maas; F. Rottensteiner; C. Heipke

doi:10.5194/isprs-archives-XLIII-B3-2020-767-2020

Details

Original language	English
Pages (from-to)	767-774
Number of pages	8
Journal	International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives
Volume	43
Issue number	B3
Publication status	Published - 21 Aug 2020
Event	2020 24th ISPRS Congress - Technical Commission III - Nice, Virtual, France Duration: 31 Aug 2020 → 2 Sept 2020

Abstract

Pixel-wise classification of remote sensing imagery is highly interesting for tasks like land cover classification or change detection. The acquisition of large training data sets for these tasks is challenging, but necessary to obtain good results with deep learning algorithms such as convolutional neural networks (CNN). In this paper we present a method for the automatic generation of a large amount of training data by combining satellite imagery with reference data from an available geospatial database. Due to this combination of different data sources the resulting training data contain a certain amount of incorrect labels. We evaluate the influence of this so called label noise regarding the time difference between acquisition of the two data sources, the amount of training data and the class structure. We combine Sentinel-2 images with reference data from a geospatial database provided by the German Land Survey Office of Lower Saxony (LGLN). With different training sets we train a fully convolutional neural network (FCN) and classify four land cover classes (code Building, Agriculture, Forest, Water/code). Our results show that the errors in the training samples do not have a large influence on the resulting classifiers. This is probably due to the fact that the noise is randomly distributed and thus, neighbours of incorrect samples are predominantly correct. As expected, a larger amount of training data improves the results, especially for the less well represented classes. Other influences are different illuminations conditions and seasonal effects during data acquisition. To better adapt the classifier to these different conditions they should also be included in the training data.

Keywords

CNN, Deep Learning, Land Cover, Remote Sensing, Semantic Segmentation, Sentinel-2

ASJC Scopus subject areas

Computer Science(all)
Information Systems
Social Sciences(all)
Geography, Planning and Development

Sustainable Development Goals

SDG 15 - Life on Land

Cite this

Automatically generated training data for land cover classification with cnns using sentinel-2 images. / Voelsen, M.; Bostelmann, J.; Maas, A. et al.
In: International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives, Vol. 43, No. B3, 21.08.2020, p. 767-774.

Research output: Contribution to journal › Conference article › Research › peer review

Voelsen, M, Bostelmann, J, Maas, A, Rottensteiner, F & Heipke, C 2020, 'Automatically generated training data for land cover classification with cnns using sentinel-2 images', International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives, vol. 43, no. B3, pp. 767-774. https://doi.org/10.5194/isprs-archives-XLIII-B3-2020-767-2020

Voelsen, M., Bostelmann, J., Maas, A., Rottensteiner, F., & Heipke, C. (2020). Automatically generated training data for land cover classification with cnns using sentinel-2 images. International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives, 43(B3), 767-774. https://doi.org/10.5194/isprs-archives-XLIII-B3-2020-767-2020

Voelsen M, Bostelmann J, Maas A, Rottensteiner F, Heipke C. Automatically generated training data for land cover classification with cnns using sentinel-2 images. International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives. 2020 Aug 21;43(B3):767-774. doi: 10.5194/isprs-archives-XLIII-B3-2020-767-2020

Voelsen, M. ; Bostelmann, J. ; Maas, A. et al. / Automatically generated training data for land cover classification with cnns using sentinel-2 images. In: International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives. 2020 ; Vol. 43, No. B3. pp. 767-774.

Download

@article{895bd51d6d214dbd831b10900eb861c1,

title = "Automatically generated training data for land cover classification with cnns using sentinel-2 images",

abstract = "Pixel-wise classification of remote sensing imagery is highly interesting for tasks like land cover classification or change detection. The acquisition of large training data sets for these tasks is challenging, but necessary to obtain good results with deep learning algorithms such as convolutional neural networks (CNN). In this paper we present a method for the automatic generation of a large amount of training data by combining satellite imagery with reference data from an available geospatial database. Due to this combination of different data sources the resulting training data contain a certain amount of incorrect labels. We evaluate the influence of this so called label noise regarding the time difference between acquisition of the two data sources, the amount of training data and the class structure. We combine Sentinel-2 images with reference data from a geospatial database provided by the German Land Survey Office of Lower Saxony (LGLN). With different training sets we train a fully convolutional neural network (FCN) and classify four land cover classes (code Building, Agriculture, Forest, Water/code). Our results show that the errors in the training samples do not have a large influence on the resulting classifiers. This is probably due to the fact that the noise is randomly distributed and thus, neighbours of incorrect samples are predominantly correct. As expected, a larger amount of training data improves the results, especially for the less well represented classes. Other influences are different illuminations conditions and seasonal effects during data acquisition. To better adapt the classifier to these different conditions they should also be included in the training data.",

keywords = "CNN, Deep Learning, Land Cover, Remote Sensing, Semantic Segmentation, Sentinel-2",

author = "M. Voelsen and J. Bostelmann and A. Maas and F. Rottensteiner and C. Heipke",

year = "2020",

month = aug,

day = "21",

doi = "10.5194/isprs-archives-XLIII-B3-2020-767-2020",

language = "English",

volume = "43",

pages = "767--774",

number = "B3",

note = "2020 24th ISPRS Congress - Technical Commission III ; Conference date: 31-08-2020 Through 02-09-2020",

}

Download

TY - JOUR

T1 - Automatically generated training data for land cover classification with cnns using sentinel-2 images

AU - Voelsen, M.

AU - Bostelmann, J.

AU - Maas, A.

AU - Rottensteiner, F.

AU - Heipke, C.

PY - 2020/8/21

Y1 - 2020/8/21

N2 - Pixel-wise classification of remote sensing imagery is highly interesting for tasks like land cover classification or change detection. The acquisition of large training data sets for these tasks is challenging, but necessary to obtain good results with deep learning algorithms such as convolutional neural networks (CNN). In this paper we present a method for the automatic generation of a large amount of training data by combining satellite imagery with reference data from an available geospatial database. Due to this combination of different data sources the resulting training data contain a certain amount of incorrect labels. We evaluate the influence of this so called label noise regarding the time difference between acquisition of the two data sources, the amount of training data and the class structure. We combine Sentinel-2 images with reference data from a geospatial database provided by the German Land Survey Office of Lower Saxony (LGLN). With different training sets we train a fully convolutional neural network (FCN) and classify four land cover classes (code Building, Agriculture, Forest, Water/code). Our results show that the errors in the training samples do not have a large influence on the resulting classifiers. This is probably due to the fact that the noise is randomly distributed and thus, neighbours of incorrect samples are predominantly correct. As expected, a larger amount of training data improves the results, especially for the less well represented classes. Other influences are different illuminations conditions and seasonal effects during data acquisition. To better adapt the classifier to these different conditions they should also be included in the training data.

AB - Pixel-wise classification of remote sensing imagery is highly interesting for tasks like land cover classification or change detection. The acquisition of large training data sets for these tasks is challenging, but necessary to obtain good results with deep learning algorithms such as convolutional neural networks (CNN). In this paper we present a method for the automatic generation of a large amount of training data by combining satellite imagery with reference data from an available geospatial database. Due to this combination of different data sources the resulting training data contain a certain amount of incorrect labels. We evaluate the influence of this so called label noise regarding the time difference between acquisition of the two data sources, the amount of training data and the class structure. We combine Sentinel-2 images with reference data from a geospatial database provided by the German Land Survey Office of Lower Saxony (LGLN). With different training sets we train a fully convolutional neural network (FCN) and classify four land cover classes (code Building, Agriculture, Forest, Water/code). Our results show that the errors in the training samples do not have a large influence on the resulting classifiers. This is probably due to the fact that the noise is randomly distributed and thus, neighbours of incorrect samples are predominantly correct. As expected, a larger amount of training data improves the results, especially for the less well represented classes. Other influences are different illuminations conditions and seasonal effects during data acquisition. To better adapt the classifier to these different conditions they should also be included in the training data.

KW - CNN

KW - Deep Learning

KW - Land Cover

KW - Remote Sensing

KW - Semantic Segmentation

KW - Sentinel-2

UR - http://www.scopus.com/inward/record.url?scp=85091153172&partnerID=8YFLogxK

U2 - 10.5194/isprs-archives-XLIII-B3-2020-767-2020

DO - 10.5194/isprs-archives-XLIII-B3-2020-767-2020

M3 - Conference article

AN - SCOPUS:85091153172

VL - 43

SP - 767

EP - 774

JO - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives

JF - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives

SN - 1682-1750

IS - B3

T2 - 2020 24th ISPRS Congress - Technical Commission III

Y2 - 31 August 2020 through 2 September 2020

ER -

Research@Leibniz University

Automatically generated training data for land cover classification with cnns using sentinel-2 images

Authors

Research Organisations

External Research Organisations