A comparison of two strategies for avoiding negative transfer in domain adaptation based on logistic regression

A. Paul; K. Vogt; F. Rottensteiner; J. Ostermann; C. Heipke

doi:10.5194/isprs-archives-XLII-2-845-2018

Details

Original language	English
Pages (from-to)	845-852
Number of pages	8
Journal	International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives
Volume	42
Issue number	2
Publication status	Published - 30 May 2018
Event	2018 ISPRS TC II Mid-term Symposium "Towards Photogrammetry 2020" - Riva del Garda, Italy Duration: 4 Jun 2018 → 7 Jun 2018

Abstract

In this paper we deal with the problem of measuring the similarity between training and tests datasets in the context of transfer learning (TL) for image classification. TL tries to transfer knowledge from a source domain, where labelled training samples are abundant but the data may follow a different distribution, to a target domain, where labelled training samples are scarce or even unavailable, assuming that the domains are related. Thus, the requirements w.r.t. the availability of labelled training samples in the target domain are reduced. In particular, if no labelled target data are available, it is inherently difficult to find a robust measure of relatedness between the source and target domains. This is of crucial importance for the performance of TL, because the knowledge transfer between unrelated data may lead to negative transfer, i.e. to a decrease of classification performance after transfer. We address the problem of measuring the relatedness between source and target datasets and investigate three different strategies to predict and, consequently, to avoid negative transfer in this paper. The first strategy is based on circular validation. The second strategy relies on the Maximum Mean Discrepancy (MMD) similarity metric, whereas the third one is an extension of MMD which incorporates the knowledge about the class labels in the source domain. Our method is evaluated using two different benchmark datasets. The experiments highlight the strengths and weaknesses of the investigated methods. We also show that it is possible to reduce the amount of negative transfer using these strategies for a TL method and to generate a consistent performance improvement over the whole dataset.

Keywords

Domain adaptation, Negative transfer, Remote sensing, Transfer learning

ASJC Scopus subject areas

Computer Science(all)
Information Systems
Social Sciences(all)
Geography, Planning and Development

Cite this

A comparison of two strategies for avoiding negative transfer in domain adaptation based on logistic regression. / Paul, A.; Vogt, K.; Rottensteiner, F. et al.
In: International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives, Vol. 42, No. 2, 30.05.2018, p. 845-852.

Research output: Contribution to journal › Conference article › Research › peer review

Paul, A, Vogt, K, Rottensteiner, F, Ostermann, J & Heipke, C 2018, 'A comparison of two strategies for avoiding negative transfer in domain adaptation based on logistic regression', International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives, vol. 42, no. 2, pp. 845-852. https://doi.org/10.5194/isprs-archives-XLII-2-845-2018, https://doi.org/10.15488/3752

Paul, A., Vogt, K., Rottensteiner, F., Ostermann, J., & Heipke, C. (2018). A comparison of two strategies for avoiding negative transfer in domain adaptation based on logistic regression. International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives, 42(2), 845-852. https://doi.org/10.5194/isprs-archives-XLII-2-845-2018, https://doi.org/10.15488/3752

Paul A, Vogt K, Rottensteiner F, Ostermann J, Heipke C. A comparison of two strategies for avoiding negative transfer in domain adaptation based on logistic regression. International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives. 2018 May 30;42(2):845-852. doi: 10.5194/isprs-archives-XLII-2-845-2018, 10.15488/3752

Paul, A. ; Vogt, K. ; Rottensteiner, F. et al. / A comparison of two strategies for avoiding negative transfer in domain adaptation based on logistic regression. In: International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives. 2018 ; Vol. 42, No. 2. pp. 845-852.

Download

@article{df32b01892984ad5a35a3bb27271e0d3,

title = "A comparison of two strategies for avoiding negative transfer in domain adaptation based on logistic regression",

abstract = "In this paper we deal with the problem of measuring the similarity between training and tests datasets in the context of transfer learning (TL) for image classification. TL tries to transfer knowledge from a source domain, where labelled training samples are abundant but the data may follow a different distribution, to a target domain, where labelled training samples are scarce or even unavailable, assuming that the domains are related. Thus, the requirements w.r.t. the availability of labelled training samples in the target domain are reduced. In particular, if no labelled target data are available, it is inherently difficult to find a robust measure of relatedness between the source and target domains. This is of crucial importance for the performance of TL, because the knowledge transfer between unrelated data may lead to negative transfer, i.e. to a decrease of classification performance after transfer. We address the problem of measuring the relatedness between source and target datasets and investigate three different strategies to predict and, consequently, to avoid negative transfer in this paper. The first strategy is based on circular validation. The second strategy relies on the Maximum Mean Discrepancy (MMD) similarity metric, whereas the third one is an extension of MMD which incorporates the knowledge about the class labels in the source domain. Our method is evaluated using two different benchmark datasets. The experiments highlight the strengths and weaknesses of the investigated methods. We also show that it is possible to reduce the amount of negative transfer using these strategies for a TL method and to generate a consistent performance improvement over the whole dataset.",

keywords = "Domain adaptation, Negative transfer, Remote sensing, Transfer learning",

author = "A. Paul and K. Vogt and F. Rottensteiner and J. Ostermann and C. Heipke",

note = "Funding information: This work was supported by the German Science Foundation (DFG) under grant HE 1822/30-1. The Vaihingen and Potsdam data were provided by the German Society for Photogrammetry, Remote Sensing and Geoinformation (DGPF) (Cramer, 2010): http://www.ifp.uni-stuttgart.de/dgpf/DKEP-Allg.html.; 2018 ISPRS TC II Mid-term Symposium {"}Towards Photogrammetry 2020{"} ; Conference date: 04-06-2018 Through 07-06-2018",

year = "2018",

month = may,

day = "30",

doi = "10.5194/isprs-archives-XLII-2-845-2018",

language = "English",

volume = "42",

pages = "845--852",

number = "2",

}

Download

TY - JOUR

T1 - A comparison of two strategies for avoiding negative transfer in domain adaptation based on logistic regression

AU - Paul, A.

AU - Vogt, K.

AU - Rottensteiner, F.

AU - Ostermann, J.

AU - Heipke, C.

N1 - Funding information: This work was supported by the German Science Foundation (DFG) under grant HE 1822/30-1. The Vaihingen and Potsdam data were provided by the German Society for Photogrammetry, Remote Sensing and Geoinformation (DGPF) (Cramer, 2010): http://www.ifp.uni-stuttgart.de/dgpf/DKEP-Allg.html.

PY - 2018/5/30

Y1 - 2018/5/30

N2 - In this paper we deal with the problem of measuring the similarity between training and tests datasets in the context of transfer learning (TL) for image classification. TL tries to transfer knowledge from a source domain, where labelled training samples are abundant but the data may follow a different distribution, to a target domain, where labelled training samples are scarce or even unavailable, assuming that the domains are related. Thus, the requirements w.r.t. the availability of labelled training samples in the target domain are reduced. In particular, if no labelled target data are available, it is inherently difficult to find a robust measure of relatedness between the source and target domains. This is of crucial importance for the performance of TL, because the knowledge transfer between unrelated data may lead to negative transfer, i.e. to a decrease of classification performance after transfer. We address the problem of measuring the relatedness between source and target datasets and investigate three different strategies to predict and, consequently, to avoid negative transfer in this paper. The first strategy is based on circular validation. The second strategy relies on the Maximum Mean Discrepancy (MMD) similarity metric, whereas the third one is an extension of MMD which incorporates the knowledge about the class labels in the source domain. Our method is evaluated using two different benchmark datasets. The experiments highlight the strengths and weaknesses of the investigated methods. We also show that it is possible to reduce the amount of negative transfer using these strategies for a TL method and to generate a consistent performance improvement over the whole dataset.

AB - In this paper we deal with the problem of measuring the similarity between training and tests datasets in the context of transfer learning (TL) for image classification. TL tries to transfer knowledge from a source domain, where labelled training samples are abundant but the data may follow a different distribution, to a target domain, where labelled training samples are scarce or even unavailable, assuming that the domains are related. Thus, the requirements w.r.t. the availability of labelled training samples in the target domain are reduced. In particular, if no labelled target data are available, it is inherently difficult to find a robust measure of relatedness between the source and target domains. This is of crucial importance for the performance of TL, because the knowledge transfer between unrelated data may lead to negative transfer, i.e. to a decrease of classification performance after transfer. We address the problem of measuring the relatedness between source and target datasets and investigate three different strategies to predict and, consequently, to avoid negative transfer in this paper. The first strategy is based on circular validation. The second strategy relies on the Maximum Mean Discrepancy (MMD) similarity metric, whereas the third one is an extension of MMD which incorporates the knowledge about the class labels in the source domain. Our method is evaluated using two different benchmark datasets. The experiments highlight the strengths and weaknesses of the investigated methods. We also show that it is possible to reduce the amount of negative transfer using these strategies for a TL method and to generate a consistent performance improvement over the whole dataset.

KW - Domain adaptation

KW - Negative transfer

KW - Remote sensing

KW - Transfer learning

UR - http://www.scopus.com/inward/record.url?scp=85048360278&partnerID=8YFLogxK

U2 - 10.5194/isprs-archives-XLII-2-845-2018

DO - 10.5194/isprs-archives-XLII-2-845-2018

M3 - Conference article

AN - SCOPUS:85048360278

VL - 42

SP - 845

EP - 852

JO - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives

JF - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences - ISPRS Archives

SN - 1682-1750

IS - 2

T2 - 2018 ISPRS TC II Mid-term Symposium "Towards Photogrammetry 2020"

Y2 - 4 June 2018 through 7 June 2018

ER -

Research@Leibniz University

A comparison of two strategies for avoiding negative transfer in domain adaptation based on logistic regression

Authors

Research Organisations

Details

Abstract

Keywords

ASJC Scopus subject areas

Cite this

By the same author(s)

Self-supervised domain adaptation for machinery remaining useful life prediction

Acoustic Emission Detection in Noisy Environments using Linear Prediction

Genie: the first open-source ISO/IEC encoder for genomic data

Matched Filter for Acoustic Emission Monitoring in Noisy Environments: Application to Wire Break Detection

Blind extraction of guitar effects through blind system inversion and neural guitar effect modeling