Distribution-Aware Multi-Label FixMatch for Semi-Supervised Learning on CheXpert.

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

View graph of relations

Details

Original languageEnglish
Title of host publicationProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops
Pages2295-2304
Number of pages10
Publication statusE-pub ahead of print - 1 Jun 2024

Abstract

<!DOCTYPE html>
<html xmlns="http://www.w3.org/1999/xhtml">
<head>
<title>Semi-supervised Learning for Multi-label Classification</title>
<meta charset="UTF-8" />
</head>
<body>
<p>
Semi-supervised learning (SSL) has achieved remarkable success for multiclass classification in recent years,
yielding a promising solution for medical image classification where labeled data is scarce but unlabeled images are accessible.
In the context of multi-label problems, however, SSL is still under-explored.
In this work, we adapt FixMatch to the multi-label scenario, specifically focusing on CheXpert,
a multi-label chest X-ray classification dataset which is imbalanced and only partially labeled.
</p>
<p>
Leveraging distribution alignment, our proposed method, ML-FixMatch+DA, achieves solid performance gains in SSL tasks
(AUC: +2.6%) and in a missing label scenario (AUC: +1.9%).
In contrast to previous work, we achieve a performance gain on CheXpert using FixMatch.
We show that in contrast to multiclass FixMatch, where distribution alignment is optional,
it is essential for multi-label FixMatch to handle class imbalance and generate reliable (positive and negative) pseudo-labels.
</p>
<p>
Our pseudo-label selection is based on a single threshold for all classes and handles imbalance with no prior knowledge on label distributions.
Our adaptation keeps the simplicity of the original multiclass FixMatch with no added hyperparameters (even for imbalanced data)
and demonstrates the feasibility of simple SSL for multi-label problems, filling a crucial gap in the literature.
</p>
</body>
</html>

Keywords

    semi supervised learning, CheXpert, multi-label classification

Cite this

Distribution-Aware Multi-Label FixMatch for Semi-Supervised Learning on CheXpert. / Ihler, Sontje; Kuhnke, Felix; Kuhlgatz, Timo et al.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops. 2024. p. 2295-2304.

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Ihler, S, Kuhnke, F, Kuhlgatz, T & Seel, T 2024, Distribution-Aware Multi-Label FixMatch for Semi-Supervised Learning on CheXpert. in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops. pp. 2295-2304.
Ihler, S., Kuhnke, F., Kuhlgatz, T., & Seel, T. (2024). Distribution-Aware Multi-Label FixMatch for Semi-Supervised Learning on CheXpert. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops (pp. 2295-2304) Advance online publication.
Ihler S, Kuhnke F, Kuhlgatz T, Seel T. Distribution-Aware Multi-Label FixMatch for Semi-Supervised Learning on CheXpert. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops. 2024. p. 2295-2304 Epub 2024 Jun 1.
Ihler, Sontje ; Kuhnke, Felix ; Kuhlgatz, Timo et al. / Distribution-Aware Multi-Label FixMatch for Semi-Supervised Learning on CheXpert. Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops. 2024. pp. 2295-2304
Download
@inproceedings{a80b16e0a99d48d6891b50ebdabcd4be,
title = "Distribution-Aware Multi-Label FixMatch for Semi-Supervised Learning on CheXpert.",
abstract = "<!DOCTYPE html>Semi-supervised Learning for Multi-label ClassificationSemi-supervised learning (SSL) has achieved remarkable success for multiclass classification in recent years,yielding a promising solution for medical image classification where labeled data is scarce but unlabeled images are accessible.In the context of multi-label problems, however, SSL is still under-explored.In this work, we adapt FixMatch to the multi-label scenario, specifically focusing on CheXpert,a multi-label chest X-ray classification dataset which is imbalanced and only partially labeled.Leveraging distribution alignment, our proposed method, ML-FixMatch+DA, achieves solid performance gains in SSL tasks(AUC: +2.6%) and in a missing label scenario (AUC: +1.9%).In contrast to previous work, we achieve a performance gain on CheXpert using FixMatch.We show that in contrast to multiclass FixMatch, where distribution alignment is optional,it is essential for multi-label FixMatch to handle class imbalance and generate reliable (positive and negative) pseudo-labels.Our pseudo-label selection is based on a single threshold for all classes and handles imbalance with no prior knowledge on label distributions.Our adaptation keeps the simplicity of the original multiclass FixMatch with no added hyperparameters (even for imbalanced data)and demonstrates the feasibility of simple SSL for multi-label problems, filling a crucial gap in the literature.",
keywords = "semi supervised learning, CheXpert, multi-label classification",
author = "Sontje Ihler and Felix Kuhnke and Timo Kuhlgatz and Thomas Seel",
year = "2024",
month = jun,
day = "1",
language = "English",
pages = "2295--2304",
booktitle = "Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops",

}

Download

TY - GEN

T1 - Distribution-Aware Multi-Label FixMatch for Semi-Supervised Learning on CheXpert.

AU - Ihler, Sontje

AU - Kuhnke, Felix

AU - Kuhlgatz, Timo

AU - Seel, Thomas

PY - 2024/6/1

Y1 - 2024/6/1

N2 - <!DOCTYPE html>Semi-supervised Learning for Multi-label ClassificationSemi-supervised learning (SSL) has achieved remarkable success for multiclass classification in recent years,yielding a promising solution for medical image classification where labeled data is scarce but unlabeled images are accessible.In the context of multi-label problems, however, SSL is still under-explored.In this work, we adapt FixMatch to the multi-label scenario, specifically focusing on CheXpert,a multi-label chest X-ray classification dataset which is imbalanced and only partially labeled.Leveraging distribution alignment, our proposed method, ML-FixMatch+DA, achieves solid performance gains in SSL tasks(AUC: +2.6%) and in a missing label scenario (AUC: +1.9%).In contrast to previous work, we achieve a performance gain on CheXpert using FixMatch.We show that in contrast to multiclass FixMatch, where distribution alignment is optional,it is essential for multi-label FixMatch to handle class imbalance and generate reliable (positive and negative) pseudo-labels.Our pseudo-label selection is based on a single threshold for all classes and handles imbalance with no prior knowledge on label distributions.Our adaptation keeps the simplicity of the original multiclass FixMatch with no added hyperparameters (even for imbalanced data)and demonstrates the feasibility of simple SSL for multi-label problems, filling a crucial gap in the literature.

AB - <!DOCTYPE html>Semi-supervised Learning for Multi-label ClassificationSemi-supervised learning (SSL) has achieved remarkable success for multiclass classification in recent years,yielding a promising solution for medical image classification where labeled data is scarce but unlabeled images are accessible.In the context of multi-label problems, however, SSL is still under-explored.In this work, we adapt FixMatch to the multi-label scenario, specifically focusing on CheXpert,a multi-label chest X-ray classification dataset which is imbalanced and only partially labeled.Leveraging distribution alignment, our proposed method, ML-FixMatch+DA, achieves solid performance gains in SSL tasks(AUC: +2.6%) and in a missing label scenario (AUC: +1.9%).In contrast to previous work, we achieve a performance gain on CheXpert using FixMatch.We show that in contrast to multiclass FixMatch, where distribution alignment is optional,it is essential for multi-label FixMatch to handle class imbalance and generate reliable (positive and negative) pseudo-labels.Our pseudo-label selection is based on a single threshold for all classes and handles imbalance with no prior knowledge on label distributions.Our adaptation keeps the simplicity of the original multiclass FixMatch with no added hyperparameters (even for imbalanced data)and demonstrates the feasibility of simple SSL for multi-label problems, filling a crucial gap in the literature.

KW - semi supervised learning

KW - CheXpert

KW - multi-label classification

M3 - Conference contribution

SP - 2295

EP - 2304

BT - Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops

ER -

By the same author(s)