A flexible class of dependence-aware multi-label loss functions

Eyke Hüllermeier; Marcel Wever; Eneldo Loza Mencia; Johannes Fürnkranz; Michael Rapp

doi:10.1007/s10994-021-06107-2

Details

Original language	English
Pages (from-to)	713-737
Number of pages	25
Journal	Machine learning
Volume	111
Issue number	2
Publication status	Published - Feb 2022
Externally published	Yes

Abstract

The idea to exploit label dependencies for better prediction is at the core of methods for multi-label classification (MLC), and performance improvements are normally explained in this way. Surprisingly, however, there is no established methodology that allows to analyze the dependence-awareness of MLC algorithms. With that goal in mind, we introduce a class of loss functions that are able to capture the important aspect of label dependence. To this end, we leverage the mathematical framework of non-additive measures and integrals. Roughly speaking, a non-additive measure allows for modeling the importance of correct predictions of label subsets (instead of single labels), and thereby their impact on the overall evaluation, in a flexible way. The well-known Hamming and subset 0/1 losses are rather extreme special cases of this function class, which give full importance to single label sets or the entire label set, respectively. We present concrete instantiations of this class, which appear to be especially appealing from a modeling perspective. The assessment of multi-label classifiers in terms of these losses is illustrated in an empirical study, clearly showing their aptness at capturing label dependencies. Finally, while not being the main goal of this study, we also show some preliminary results on the minimization of this parametrized family of losses.

Keywords

Analysis, Label dependence, Loss function, Multi-label classification, Non-additive measures

ASJC Scopus subject areas

Computer Science(all)
Software
Computer Science(all)
Artificial Intelligence

Cite this

A flexible class of dependence-aware multi-label loss functions. / Hüllermeier, Eyke; Wever, Marcel; Loza Mencia, Eneldo et al.
In: Machine learning, Vol. 111, No. 2, 02.2022, p. 713-737.

Research output: Contribution to journal › Article › Research › peer review

Hüllermeier, E, Wever, M, Loza Mencia, E, Fürnkranz, J & Rapp, M 2022, 'A flexible class of dependence-aware multi-label loss functions', Machine learning, vol. 111, no. 2, pp. 713-737. https://doi.org/10.1007/s10994-021-06107-2

Hüllermeier, E., Wever, M., Loza Mencia, E., Fürnkranz, J., & Rapp, M. (2022). A flexible class of dependence-aware multi-label loss functions. Machine learning, 111(2), 713-737. https://doi.org/10.1007/s10994-021-06107-2

Hüllermeier E, Wever M, Loza Mencia E, Fürnkranz J, Rapp M. A flexible class of dependence-aware multi-label loss functions. Machine learning. 2022 Feb;111(2):713-737. doi: 10.1007/s10994-021-06107-2

Hüllermeier, Eyke ; Wever, Marcel ; Loza Mencia, Eneldo et al. / A flexible class of dependence-aware multi-label loss functions. In: Machine learning. 2022 ; Vol. 111, No. 2. pp. 713-737.

Download

@article{1ffc80d7ee3243e0bf554778c51fb083,

title = "A flexible class of dependence-aware multi-label loss functions",

abstract = "The idea to exploit label dependencies for better prediction is at the core of methods for multi-label classification (MLC), and performance improvements are normally explained in this way. Surprisingly, however, there is no established methodology that allows to analyze the dependence-awareness of MLC algorithms. With that goal in mind, we introduce a class of loss functions that are able to capture the important aspect of label dependence. To this end, we leverage the mathematical framework of non-additive measures and integrals. Roughly speaking, a non-additive measure allows for modeling the importance of correct predictions of label subsets (instead of single labels), and thereby their impact on the overall evaluation, in a flexible way. The well-known Hamming and subset 0/1 losses are rather extreme special cases of this function class, which give full importance to single label sets or the entire label set, respectively. We present concrete instantiations of this class, which appear to be especially appealing from a modeling perspective. The assessment of multi-label classifiers in terms of these losses is illustrated in an empirical study, clearly showing their aptness at capturing label dependencies. Finally, while not being the main goal of this study, we also show some preliminary results on the minimization of this parametrized family of losses.",

keywords = "Analysis, Label dependence, Loss function, Multi-label classification, Non-additive measures",

author = "Eyke H{\"u}llermeier and Marcel Wever and {Loza Mencia}, Eneldo and Johannes F{\"u}rnkranz and Michael Rapp",

note = "Publisher Copyright: {\textcopyright} 2022, The Author(s).",

year = "2022",

month = feb,

doi = "10.1007/s10994-021-06107-2",

language = "English",

volume = "111",

pages = "713--737",

journal = "Machine learning",

issn = "0885-6125",

publisher = "Springer Netherlands",

number = "2",

}

Download

TY - JOUR

T1 - A flexible class of dependence-aware multi-label loss functions

AU - Hüllermeier, Eyke

AU - Wever, Marcel

AU - Loza Mencia, Eneldo

AU - Fürnkranz, Johannes

AU - Rapp, Michael

PY - 2022/2

Y1 - 2022/2

N2 - The idea to exploit label dependencies for better prediction is at the core of methods for multi-label classification (MLC), and performance improvements are normally explained in this way. Surprisingly, however, there is no established methodology that allows to analyze the dependence-awareness of MLC algorithms. With that goal in mind, we introduce a class of loss functions that are able to capture the important aspect of label dependence. To this end, we leverage the mathematical framework of non-additive measures and integrals. Roughly speaking, a non-additive measure allows for modeling the importance of correct predictions of label subsets (instead of single labels), and thereby their impact on the overall evaluation, in a flexible way. The well-known Hamming and subset 0/1 losses are rather extreme special cases of this function class, which give full importance to single label sets or the entire label set, respectively. We present concrete instantiations of this class, which appear to be especially appealing from a modeling perspective. The assessment of multi-label classifiers in terms of these losses is illustrated in an empirical study, clearly showing their aptness at capturing label dependencies. Finally, while not being the main goal of this study, we also show some preliminary results on the minimization of this parametrized family of losses.

AB - The idea to exploit label dependencies for better prediction is at the core of methods for multi-label classification (MLC), and performance improvements are normally explained in this way. Surprisingly, however, there is no established methodology that allows to analyze the dependence-awareness of MLC algorithms. With that goal in mind, we introduce a class of loss functions that are able to capture the important aspect of label dependence. To this end, we leverage the mathematical framework of non-additive measures and integrals. Roughly speaking, a non-additive measure allows for modeling the importance of correct predictions of label subsets (instead of single labels), and thereby their impact on the overall evaluation, in a flexible way. The well-known Hamming and subset 0/1 losses are rather extreme special cases of this function class, which give full importance to single label sets or the entire label set, respectively. We present concrete instantiations of this class, which appear to be especially appealing from a modeling perspective. The assessment of multi-label classifiers in terms of these losses is illustrated in an empirical study, clearly showing their aptness at capturing label dependencies. Finally, while not being the main goal of this study, we also show some preliminary results on the minimization of this parametrized family of losses.

KW - Analysis

KW - Label dependence

KW - Loss function

KW - Multi-label classification

KW - Non-additive measures

UR - http://www.scopus.com/inward/record.url?scp=85123113719&partnerID=8YFLogxK

U2 - 10.1007/s10994-021-06107-2

DO - 10.1007/s10994-021-06107-2

M3 - Article

AN - SCOPUS:85123113719

VL - 111

SP - 713

EP - 737

JO - Machine learning

JF - Machine learning

SN - 0885-6125

IS - 2

ER -

Research@Leibniz University

A flexible class of dependence-aware multi-label loss functions

Authors

External Research Organisations

Details

Abstract

Keywords

ASJC Scopus subject areas

Cite this

By the same author(s)

Hyperparameter optimization of two-branch neural networks in multi-target prediction

Best Arm Identification with Retroactively Increased Sampling Budget for More Resource-Efficient HPO

On the Importance of Initialization in Active Learning

Annotation uncertainty in the context of grammatical change

Configuration and Evaluation