Open set task augmentation facilitates generalization of deep neural networks trained on small data sets

Wadhah Zai El Amri; Felix Reinhart; Wolfram Schenck

doi:10.1007/s00521-021-06753-6

Details

Originalsprache	Englisch
Seiten (von - bis)	6067-6083
Seitenumfang	17
Fachzeitschrift	Neural Computing and Applications
Jahrgang	34
Ausgabenummer	8
Frühes Online-Datum	9 Dez. 2021
Publikationsstatus	Veröffentlicht - Apr. 2022
Extern publiziert	Ja

Abstract

Many application scenarios for image recognition require learning of deep networks from small sample sizes in the order of a few hundred samples per class. Then, avoiding overfitting is critical. Common techniques to address overfitting are transfer learning, reduction of model complexity and artificial enrichment of the available data by, e.g., data augmentation. A key idea proposed in this paper is to incorporate additional samples into the training that do not belong to the classes of the target task. This can be accomplished by formulating the original classification task as an open set classification task. While the original closed set classification task is not altered at inference time, the recast as open set classification task enables the inclusion of additional data during training. Hence, the original closed set classification task is augmented with an open set task during training. We therefore call the proposed approach open set task augmentation. In order to integrate additional task-unrelated samples into the training, we employ the entropic open set loss originally proposed for open set classification tasks and also show that similar results can be obtained with a modified sum of squared errors loss function. Learning with the proposed approach benefits from the integration of additional “unknown” samples, which are often available, e.g., from open data sets, and can then be easily integrated into the learning process. We show that this open set task augmentation can improve model performance even when these additional samples are rather few or far from the domain of the target task. The proposed approach is demonstrated on two exemplary scenarios based on subsets of the ImageNet and Food-101 data sets as well as with several network architectures and two loss functions. We further shed light on the impact of the entropic open set loss on the internal representations formed by the networks. Open set task augmentation is particularly valuable when no additional data from the target classes are available—a scenario often faced in practice.

ASJC Scopus Sachgebiete

Informatik (insg.)
Software
Informatik (insg.)
Artificial intelligence

Zitieren

Open set task augmentation facilitates generalization of deep neural networks trained on small data sets. / Zai El Amri, Wadhah; Reinhart, Felix; Schenck, Wolfram.
in: Neural Computing and Applications, Jahrgang 34, Nr. 8, 04.2022, S. 6067-6083.

Publikation: Beitrag in Fachzeitschrift › Artikel › Forschung › Peer-Review

Zai El Amri, W, Reinhart, F & Schenck, W 2022, 'Open set task augmentation facilitates generalization of deep neural networks trained on small data sets', Neural Computing and Applications, Jg. 34, Nr. 8, S. 6067-6083. https://doi.org/10.1007/s00521-021-06753-6

Zai El Amri, W., Reinhart, F., & Schenck, W. (2022). Open set task augmentation facilitates generalization of deep neural networks trained on small data sets. Neural Computing and Applications, 34(8), 6067-6083. https://doi.org/10.1007/s00521-021-06753-6

Zai El Amri W, Reinhart F, Schenck W. Open set task augmentation facilitates generalization of deep neural networks trained on small data sets. Neural Computing and Applications. 2022 Apr;34(8):6067-6083. Epub 2021 Dez 9. doi: 10.1007/s00521-021-06753-6

Zai El Amri, Wadhah ; Reinhart, Felix ; Schenck, Wolfram. / Open set task augmentation facilitates generalization of deep neural networks trained on small data sets. in: Neural Computing and Applications. 2022 ; Jahrgang 34, Nr. 8. S. 6067-6083.

Download

@article{f95f80b87658439b8461be9667f13493,

title = "Open set task augmentation facilitates generalization of deep neural networks trained on small data sets",

abstract = "Many application scenarios for image recognition require learning of deep networks from small sample sizes in the order of a few hundred samples per class. Then, avoiding overfitting is critical. Common techniques to address overfitting are transfer learning, reduction of model complexity and artificial enrichment of the available data by, e.g., data augmentation. A key idea proposed in this paper is to incorporate additional samples into the training that do not belong to the classes of the target task. This can be accomplished by formulating the original classification task as an open set classification task. While the original closed set classification task is not altered at inference time, the recast as open set classification task enables the inclusion of additional data during training. Hence, the original closed set classification task is augmented with an open set task during training. We therefore call the proposed approach open set task augmentation. In order to integrate additional task-unrelated samples into the training, we employ the entropic open set loss originally proposed for open set classification tasks and also show that similar results can be obtained with a modified sum of squared errors loss function. Learning with the proposed approach benefits from the integration of additional “unknown” samples, which are often available, e.g., from open data sets, and can then be easily integrated into the learning process. We show that this open set task augmentation can improve model performance even when these additional samples are rather few or far from the domain of the target task. The proposed approach is demonstrated on two exemplary scenarios based on subsets of the ImageNet and Food-101 data sets as well as with several network architectures and two loss functions. We further shed light on the impact of the entropic open set loss on the internal representations formed by the networks. Open set task augmentation is particularly valuable when no additional data from the target classes are available—a scenario often faced in practice.",

keywords = "Convolutional neural networks, Image recognition, Open set classification, Transfer learning",

author = "{Zai El Amri}, Wadhah and Felix Reinhart and Wolfram Schenck",

note = "Publisher Copyright: {\textcopyright} 2021, The Author(s).",

year = "2022",

month = apr,

doi = "10.1007/s00521-021-06753-6",

language = "English",

volume = "34",

pages = "6067--6083",

journal = "Neural Computing and Applications",

issn = "0941-0643",

publisher = "Springer London",

number = "8",

}

Download

TY - JOUR

T1 - Open set task augmentation facilitates generalization of deep neural networks trained on small data sets

AU - Zai El Amri, Wadhah

AU - Reinhart, Felix

AU - Schenck, Wolfram

PY - 2022/4

Y1 - 2022/4

N2 - Many application scenarios for image recognition require learning of deep networks from small sample sizes in the order of a few hundred samples per class. Then, avoiding overfitting is critical. Common techniques to address overfitting are transfer learning, reduction of model complexity and artificial enrichment of the available data by, e.g., data augmentation. A key idea proposed in this paper is to incorporate additional samples into the training that do not belong to the classes of the target task. This can be accomplished by formulating the original classification task as an open set classification task. While the original closed set classification task is not altered at inference time, the recast as open set classification task enables the inclusion of additional data during training. Hence, the original closed set classification task is augmented with an open set task during training. We therefore call the proposed approach open set task augmentation. In order to integrate additional task-unrelated samples into the training, we employ the entropic open set loss originally proposed for open set classification tasks and also show that similar results can be obtained with a modified sum of squared errors loss function. Learning with the proposed approach benefits from the integration of additional “unknown” samples, which are often available, e.g., from open data sets, and can then be easily integrated into the learning process. We show that this open set task augmentation can improve model performance even when these additional samples are rather few or far from the domain of the target task. The proposed approach is demonstrated on two exemplary scenarios based on subsets of the ImageNet and Food-101 data sets as well as with several network architectures and two loss functions. We further shed light on the impact of the entropic open set loss on the internal representations formed by the networks. Open set task augmentation is particularly valuable when no additional data from the target classes are available—a scenario often faced in practice.

AB - Many application scenarios for image recognition require learning of deep networks from small sample sizes in the order of a few hundred samples per class. Then, avoiding overfitting is critical. Common techniques to address overfitting are transfer learning, reduction of model complexity and artificial enrichment of the available data by, e.g., data augmentation. A key idea proposed in this paper is to incorporate additional samples into the training that do not belong to the classes of the target task. This can be accomplished by formulating the original classification task as an open set classification task. While the original closed set classification task is not altered at inference time, the recast as open set classification task enables the inclusion of additional data during training. Hence, the original closed set classification task is augmented with an open set task during training. We therefore call the proposed approach open set task augmentation. In order to integrate additional task-unrelated samples into the training, we employ the entropic open set loss originally proposed for open set classification tasks and also show that similar results can be obtained with a modified sum of squared errors loss function. Learning with the proposed approach benefits from the integration of additional “unknown” samples, which are often available, e.g., from open data sets, and can then be easily integrated into the learning process. We show that this open set task augmentation can improve model performance even when these additional samples are rather few or far from the domain of the target task. The proposed approach is demonstrated on two exemplary scenarios based on subsets of the ImageNet and Food-101 data sets as well as with several network architectures and two loss functions. We further shed light on the impact of the entropic open set loss on the internal representations formed by the networks. Open set task augmentation is particularly valuable when no additional data from the target classes are available—a scenario often faced in practice.

KW - Convolutional neural networks

KW - Image recognition

KW - Open set classification

KW - Transfer learning

UR - http://www.scopus.com/inward/record.url?scp=85120918189&partnerID=8YFLogxK

U2 - 10.1007/s00521-021-06753-6

DO - 10.1007/s00521-021-06753-6

M3 - Article

VL - 34

SP - 6067

EP - 6083

JO - Neural Computing and Applications

JF - Neural Computing and Applications

SN - 0941-0643

IS - 8

ER -

Research@Leibniz University

Open set task augmentation facilitates generalization of deep neural networks trained on small data sets

Autorschaft

Externe Organisationen

Details

Abstract

ASJC Scopus Sachgebiete

Zitieren

Von denselben Autoren

Optimizing BioTac Simulation for Realistic Tactile Perception

Hierarchical Decentralized Deep Reinforcement Learning Architecture for a Simulated Four-Legged Agent

Transfer Learning with Jukebox for Music Source Separation