Federated Learning to Safeguard Patients Data: A Medical Image Retrieval Case

Gurtaj Singh; Vincenzo Violi; Marco Fisichella

doi:10.3390/bdcc7010018

Details

Originalsprache	Englisch
Aufsatznummer	18
Fachzeitschrift	Big Data and Cognitive Computing
Jahrgang	7
Ausgabenummer	1
Frühes Online-Datum	18 Jan. 2023
Publikationsstatus	Veröffentlicht - März 2023

Abstract

Healthcare data are distributed and confidential, making it difficult to use centralized automatic diagnostic techniques. For example, different hospitals hold the electronic health records (EHRs) of different patient populations; however, transferring this data between hospitals is difficult due to the sensitive nature of the information. This presents a significant obstacle to the development of efficient and generalizable analytical methods that require a large amount of diverse Big Data. Federated learning allows multiple institutions to work together to develop a machine learning algorithm without sharing their data. We conducted a systematic study to analyze the current state of FL in the healthcare industry and explore both the limitations of this technology and its potential. Organizations share the parameters of their models with each other. This allows them to reap the benefits of a model developed with a richer data set while protecting the confidentiality of their data. Standard methods for large-scale machine learning, distributed optimization, and privacy-friendly data analytics need to be fundamentally rethought to address the new problems posed by training on diverse networks that may contain large amounts of data. In this article, we discuss the particular qualities and difficulties of federated learning, provide a comprehensive overview of current approaches, and outline several directions for future work that are relevant to a variety of research communities. These issues are important to many different research communities.

ASJC Scopus Sachgebiete

Betriebswirtschaft, Management und Rechnungswesen (insg.)
Management-Informationssysteme
Informatik (insg.)
Information systems
Informatik (insg.)
Angewandte Informatik
Informatik (insg.)
Artificial intelligence

Zitieren

Federated Learning to Safeguard Patients Data: A Medical Image Retrieval Case. / Singh, Gurtaj; Violi, Vincenzo; Fisichella, Marco.
in: Big Data and Cognitive Computing, Jahrgang 7, Nr. 1, 18, 03.2023.

Publikation: Beitrag in Fachzeitschrift › Artikel › Forschung › Peer-Review

Singh, G, Violi, V & Fisichella, M 2023, 'Federated Learning to Safeguard Patients Data: A Medical Image Retrieval Case', Big Data and Cognitive Computing, Jg. 7, Nr. 1, 18. https://doi.org/10.3390/bdcc7010018

Singh, G., Violi, V., & Fisichella, M. (2023). Federated Learning to Safeguard Patients Data: A Medical Image Retrieval Case. Big Data and Cognitive Computing, 7(1), Artikel 18. https://doi.org/10.3390/bdcc7010018

Singh G, Violi V, Fisichella M. Federated Learning to Safeguard Patients Data: A Medical Image Retrieval Case. Big Data and Cognitive Computing. 2023 Mär;7(1):18. Epub 2023 Jan 18. doi: 10.3390/bdcc7010018

Singh, Gurtaj ; Violi, Vincenzo ; Fisichella, Marco. / Federated Learning to Safeguard Patients Data : A Medical Image Retrieval Case. in: Big Data and Cognitive Computing. 2023 ; Jahrgang 7, Nr. 1.

Download

@article{5a185a60ed5947abbfd3d965159eb907,

title = "Federated Learning to Safeguard Patients Data: A Medical Image Retrieval Case",

abstract = "Healthcare data are distributed and confidential, making it difficult to use centralized automatic diagnostic techniques. For example, different hospitals hold the electronic health records (EHRs) of different patient populations; however, transferring this data between hospitals is difficult due to the sensitive nature of the information. This presents a significant obstacle to the development of efficient and generalizable analytical methods that require a large amount of diverse Big Data. Federated learning allows multiple institutions to work together to develop a machine learning algorithm without sharing their data. We conducted a systematic study to analyze the current state of FL in the healthcare industry and explore both the limitations of this technology and its potential. Organizations share the parameters of their models with each other. This allows them to reap the benefits of a model developed with a richer data set while protecting the confidentiality of their data. Standard methods for large-scale machine learning, distributed optimization, and privacy-friendly data analytics need to be fundamentally rethought to address the new problems posed by training on diverse networks that may contain large amounts of data. In this article, we discuss the particular qualities and difficulties of federated learning, provide a comprehensive overview of current approaches, and outline several directions for future work that are relevant to a variety of research communities. These issues are important to many different research communities.",

keywords = "federated learning, health, privacy preserving",

author = "Gurtaj Singh and Vincenzo Violi and Marco Fisichella",

note = "Funding Information: This research was funded by L3S Research Center of Leibniz University of Hannover, Germany.",

year = "2023",

month = mar,

doi = "10.3390/bdcc7010018",

language = "English",

volume = "7",

number = "1",

}

Download

TY - JOUR

T1 - Federated Learning to Safeguard Patients Data

T2 - A Medical Image Retrieval Case

AU - Singh, Gurtaj

AU - Violi, Vincenzo

AU - Fisichella, Marco

N1 - Funding Information: This research was funded by L3S Research Center of Leibniz University of Hannover, Germany.

PY - 2023/3

Y1 - 2023/3

N2 - Healthcare data are distributed and confidential, making it difficult to use centralized automatic diagnostic techniques. For example, different hospitals hold the electronic health records (EHRs) of different patient populations; however, transferring this data between hospitals is difficult due to the sensitive nature of the information. This presents a significant obstacle to the development of efficient and generalizable analytical methods that require a large amount of diverse Big Data. Federated learning allows multiple institutions to work together to develop a machine learning algorithm without sharing their data. We conducted a systematic study to analyze the current state of FL in the healthcare industry and explore both the limitations of this technology and its potential. Organizations share the parameters of their models with each other. This allows them to reap the benefits of a model developed with a richer data set while protecting the confidentiality of their data. Standard methods for large-scale machine learning, distributed optimization, and privacy-friendly data analytics need to be fundamentally rethought to address the new problems posed by training on diverse networks that may contain large amounts of data. In this article, we discuss the particular qualities and difficulties of federated learning, provide a comprehensive overview of current approaches, and outline several directions for future work that are relevant to a variety of research communities. These issues are important to many different research communities.

AB - Healthcare data are distributed and confidential, making it difficult to use centralized automatic diagnostic techniques. For example, different hospitals hold the electronic health records (EHRs) of different patient populations; however, transferring this data between hospitals is difficult due to the sensitive nature of the information. This presents a significant obstacle to the development of efficient and generalizable analytical methods that require a large amount of diverse Big Data. Federated learning allows multiple institutions to work together to develop a machine learning algorithm without sharing their data. We conducted a systematic study to analyze the current state of FL in the healthcare industry and explore both the limitations of this technology and its potential. Organizations share the parameters of their models with each other. This allows them to reap the benefits of a model developed with a richer data set while protecting the confidentiality of their data. Standard methods for large-scale machine learning, distributed optimization, and privacy-friendly data analytics need to be fundamentally rethought to address the new problems posed by training on diverse networks that may contain large amounts of data. In this article, we discuss the particular qualities and difficulties of federated learning, provide a comprehensive overview of current approaches, and outline several directions for future work that are relevant to a variety of research communities. These issues are important to many different research communities.

KW - federated learning

KW - health

KW - privacy preserving

UR - http://www.scopus.com/inward/record.url?scp=85151096933&partnerID=8YFLogxK

U2 - 10.3390/bdcc7010018

DO - 10.3390/bdcc7010018

M3 - Article

AN - SCOPUS:85151096933

VL - 7

JO - Big Data and Cognitive Computing

JF - Big Data and Cognitive Computing

IS - 1

M1 - 18

ER -

Research@Leibniz University

Federated Learning to Safeguard Patients Data: A Medical Image Retrieval Case

Autoren

Organisationseinheiten

Externe Organisationen

Details

Abstract

ASJC Scopus Sachgebiete

Zitieren

Von denselben Autoren

Harnessing Empathy and Ethics for Relevance Detection and Information Categorization in Climate and COVID-19 Tweets

A Trustworthy Approach to Classify and Analyze Epidemic-Related Information From Microblogs

LaMMOn: language model combined graph neural network for multi-target multi-camera tracking in online scenarios

Open benchmark for filtering techniques in entity resolution

Does a language model “understand” high school math? A survey of deep learning based word problem solvers