Federated Learning to Safeguard Patients Data: A Medical Image Retrieval Case

Gurtaj Singh; Vincenzo Violi; Marco Fisichella

doi:10.3390/bdcc7010018

Details

Original language	English
Article number	18
Journal	Big Data and Cognitive Computing
Volume	7
Issue number	1
Early online date	18 Jan 2023
Publication status	Published - Mar 2023

Abstract

Healthcare data are distributed and confidential, making it difficult to use centralized automatic diagnostic techniques. For example, different hospitals hold the electronic health records (EHRs) of different patient populations; however, transferring this data between hospitals is difficult due to the sensitive nature of the information. This presents a significant obstacle to the development of efficient and generalizable analytical methods that require a large amount of diverse Big Data. Federated learning allows multiple institutions to work together to develop a machine learning algorithm without sharing their data. We conducted a systematic study to analyze the current state of FL in the healthcare industry and explore both the limitations of this technology and its potential. Organizations share the parameters of their models with each other. This allows them to reap the benefits of a model developed with a richer data set while protecting the confidentiality of their data. Standard methods for large-scale machine learning, distributed optimization, and privacy-friendly data analytics need to be fundamentally rethought to address the new problems posed by training on diverse networks that may contain large amounts of data. In this article, we discuss the particular qualities and difficulties of federated learning, provide a comprehensive overview of current approaches, and outline several directions for future work that are relevant to a variety of research communities. These issues are important to many different research communities.

Keywords

federated learning, health, privacy preserving

ASJC Scopus subject areas

Business, Management and Accounting(all)
Management Information Systems
Computer Science(all)
Information Systems
Computer Science(all)
Computer Science Applications
Computer Science(all)
Artificial Intelligence

Cite this

Federated Learning to Safeguard Patients Data: A Medical Image Retrieval Case. / Singh, Gurtaj; Violi, Vincenzo; Fisichella, Marco.
In: Big Data and Cognitive Computing, Vol. 7, No. 1, 18, 03.2023.

Research output: Contribution to journal › Article › Research › peer review

Singh, G, Violi, V & Fisichella, M 2023, 'Federated Learning to Safeguard Patients Data: A Medical Image Retrieval Case', Big Data and Cognitive Computing, vol. 7, no. 1, 18. https://doi.org/10.3390/bdcc7010018

Singh, G., Violi, V., & Fisichella, M. (2023). Federated Learning to Safeguard Patients Data: A Medical Image Retrieval Case. Big Data and Cognitive Computing, 7(1), Article 18. https://doi.org/10.3390/bdcc7010018

Singh G, Violi V, Fisichella M. Federated Learning to Safeguard Patients Data: A Medical Image Retrieval Case. Big Data and Cognitive Computing. 2023 Mar;7(1):18. Epub 2023 Jan 18. doi: 10.3390/bdcc7010018

Singh, Gurtaj ; Violi, Vincenzo ; Fisichella, Marco. / Federated Learning to Safeguard Patients Data : A Medical Image Retrieval Case. In: Big Data and Cognitive Computing. 2023 ; Vol. 7, No. 1.

Download

@article{5a185a60ed5947abbfd3d965159eb907,

title = "Federated Learning to Safeguard Patients Data: A Medical Image Retrieval Case",

abstract = "Healthcare data are distributed and confidential, making it difficult to use centralized automatic diagnostic techniques. For example, different hospitals hold the electronic health records (EHRs) of different patient populations; however, transferring this data between hospitals is difficult due to the sensitive nature of the information. This presents a significant obstacle to the development of efficient and generalizable analytical methods that require a large amount of diverse Big Data. Federated learning allows multiple institutions to work together to develop a machine learning algorithm without sharing their data. We conducted a systematic study to analyze the current state of FL in the healthcare industry and explore both the limitations of this technology and its potential. Organizations share the parameters of their models with each other. This allows them to reap the benefits of a model developed with a richer data set while protecting the confidentiality of their data. Standard methods for large-scale machine learning, distributed optimization, and privacy-friendly data analytics need to be fundamentally rethought to address the new problems posed by training on diverse networks that may contain large amounts of data. In this article, we discuss the particular qualities and difficulties of federated learning, provide a comprehensive overview of current approaches, and outline several directions for future work that are relevant to a variety of research communities. These issues are important to many different research communities.",

keywords = "federated learning, health, privacy preserving",

author = "Gurtaj Singh and Vincenzo Violi and Marco Fisichella",

note = "Funding Information: This research was funded by L3S Research Center of Leibniz University of Hannover, Germany.",

year = "2023",

month = mar,

doi = "10.3390/bdcc7010018",

language = "English",

volume = "7",

number = "1",

}

Download

TY - JOUR

T1 - Federated Learning to Safeguard Patients Data

T2 - A Medical Image Retrieval Case

AU - Singh, Gurtaj

AU - Violi, Vincenzo

AU - Fisichella, Marco

N1 - Funding Information: This research was funded by L3S Research Center of Leibniz University of Hannover, Germany.

PY - 2023/3

Y1 - 2023/3

N2 - Healthcare data are distributed and confidential, making it difficult to use centralized automatic diagnostic techniques. For example, different hospitals hold the electronic health records (EHRs) of different patient populations; however, transferring this data between hospitals is difficult due to the sensitive nature of the information. This presents a significant obstacle to the development of efficient and generalizable analytical methods that require a large amount of diverse Big Data. Federated learning allows multiple institutions to work together to develop a machine learning algorithm without sharing their data. We conducted a systematic study to analyze the current state of FL in the healthcare industry and explore both the limitations of this technology and its potential. Organizations share the parameters of their models with each other. This allows them to reap the benefits of a model developed with a richer data set while protecting the confidentiality of their data. Standard methods for large-scale machine learning, distributed optimization, and privacy-friendly data analytics need to be fundamentally rethought to address the new problems posed by training on diverse networks that may contain large amounts of data. In this article, we discuss the particular qualities and difficulties of federated learning, provide a comprehensive overview of current approaches, and outline several directions for future work that are relevant to a variety of research communities. These issues are important to many different research communities.

AB - Healthcare data are distributed and confidential, making it difficult to use centralized automatic diagnostic techniques. For example, different hospitals hold the electronic health records (EHRs) of different patient populations; however, transferring this data between hospitals is difficult due to the sensitive nature of the information. This presents a significant obstacle to the development of efficient and generalizable analytical methods that require a large amount of diverse Big Data. Federated learning allows multiple institutions to work together to develop a machine learning algorithm without sharing their data. We conducted a systematic study to analyze the current state of FL in the healthcare industry and explore both the limitations of this technology and its potential. Organizations share the parameters of their models with each other. This allows them to reap the benefits of a model developed with a richer data set while protecting the confidentiality of their data. Standard methods for large-scale machine learning, distributed optimization, and privacy-friendly data analytics need to be fundamentally rethought to address the new problems posed by training on diverse networks that may contain large amounts of data. In this article, we discuss the particular qualities and difficulties of federated learning, provide a comprehensive overview of current approaches, and outline several directions for future work that are relevant to a variety of research communities. These issues are important to many different research communities.

KW - federated learning

KW - health

KW - privacy preserving

UR - http://www.scopus.com/inward/record.url?scp=85151096933&partnerID=8YFLogxK

U2 - 10.3390/bdcc7010018

DO - 10.3390/bdcc7010018

M3 - Article

AN - SCOPUS:85151096933

VL - 7

JO - Big Data and Cognitive Computing

JF - Big Data and Cognitive Computing

IS - 1

M1 - 18

ER -

Research@Leibniz University

Federated Learning to Safeguard Patients Data: A Medical Image Retrieval Case

Authors

Research Organisations

External Research Organisations

Details

Abstract

Keywords

ASJC Scopus subject areas

Cite this

By the same author(s)

LaMMOn: language model combined graph neural network for multi-target multi-camera tracking in online scenarios

Open benchmark for filtering techniques in entity resolution

Does a language model “understand” high school math? A survey of deep learning based word problem solvers

Harnessing Empathy and Ethics for Relevance Detection and Information Categorization in Climate and COVID-19 Tweets

A Trustworthy Approach to Classify and Analyze Epidemic-Related Information From Microblogs

LaMMOn: language model combined graph neural network for multi-target multi-camera tracking in online scenarios

Open benchmark for filtering techniques in entity resolution

Does a language model “understand” high school math? A survey of deep learning based word problem solvers

Harnessing Empathy and Ethics for Relevance Detection and Information Categorization in Climate and COVID-19 Tweets

A Trustworthy Approach to Classify and Analyze Epidemic-Related Information From Microblogs

LaMMOn: language model combined graph neural network for multi-target multi-camera tracking in online scenarios