Federated Learning to Safeguard Patients Data: A Medical Image Retrieval Case

Research output: Contribution to journalArticleResearchpeer review

Authors

Research Organisations

External Research Organisations

  • University “Mediterranea” of Reggio Calabria
  • University of Pisa
View graph of relations

Details

Original languageEnglish
Article number18
JournalBig Data and Cognitive Computing
Volume7
Issue number1
Early online date18 Jan 2023
Publication statusPublished - Mar 2023

Abstract

Healthcare data are distributed and confidential, making it difficult to use centralized automatic diagnostic techniques. For example, different hospitals hold the electronic health records (EHRs) of different patient populations; however, transferring this data between hospitals is difficult due to the sensitive nature of the information. This presents a significant obstacle to the development of efficient and generalizable analytical methods that require a large amount of diverse Big Data. Federated learning allows multiple institutions to work together to develop a machine learning algorithm without sharing their data. We conducted a systematic study to analyze the current state of FL in the healthcare industry and explore both the limitations of this technology and its potential. Organizations share the parameters of their models with each other. This allows them to reap the benefits of a model developed with a richer data set while protecting the confidentiality of their data. Standard methods for large-scale machine learning, distributed optimization, and privacy-friendly data analytics need to be fundamentally rethought to address the new problems posed by training on diverse networks that may contain large amounts of data. In this article, we discuss the particular qualities and difficulties of federated learning, provide a comprehensive overview of current approaches, and outline several directions for future work that are relevant to a variety of research communities. These issues are important to many different research communities.

Keywords

    federated learning, health, privacy preserving

ASJC Scopus subject areas

Cite this

Federated Learning to Safeguard Patients Data: A Medical Image Retrieval Case. / Singh, Gurtaj; Violi, Vincenzo; Fisichella, Marco.
In: Big Data and Cognitive Computing, Vol. 7, No. 1, 18, 03.2023.

Research output: Contribution to journalArticleResearchpeer review

Singh G, Violi V, Fisichella M. Federated Learning to Safeguard Patients Data: A Medical Image Retrieval Case. Big Data and Cognitive Computing. 2023 Mar;7(1):18. Epub 2023 Jan 18. doi: 10.3390/bdcc7010018
Singh, Gurtaj ; Violi, Vincenzo ; Fisichella, Marco. / Federated Learning to Safeguard Patients Data : A Medical Image Retrieval Case. In: Big Data and Cognitive Computing. 2023 ; Vol. 7, No. 1.
Download
@article{5a185a60ed5947abbfd3d965159eb907,
title = "Federated Learning to Safeguard Patients Data: A Medical Image Retrieval Case",
abstract = "Healthcare data are distributed and confidential, making it difficult to use centralized automatic diagnostic techniques. For example, different hospitals hold the electronic health records (EHRs) of different patient populations; however, transferring this data between hospitals is difficult due to the sensitive nature of the information. This presents a significant obstacle to the development of efficient and generalizable analytical methods that require a large amount of diverse Big Data. Federated learning allows multiple institutions to work together to develop a machine learning algorithm without sharing their data. We conducted a systematic study to analyze the current state of FL in the healthcare industry and explore both the limitations of this technology and its potential. Organizations share the parameters of their models with each other. This allows them to reap the benefits of a model developed with a richer data set while protecting the confidentiality of their data. Standard methods for large-scale machine learning, distributed optimization, and privacy-friendly data analytics need to be fundamentally rethought to address the new problems posed by training on diverse networks that may contain large amounts of data. In this article, we discuss the particular qualities and difficulties of federated learning, provide a comprehensive overview of current approaches, and outline several directions for future work that are relevant to a variety of research communities. These issues are important to many different research communities.",
keywords = "federated learning, health, privacy preserving",
author = "Gurtaj Singh and Vincenzo Violi and Marco Fisichella",
note = "Funding Information: This research was funded by L3S Research Center of Leibniz University of Hannover, Germany.",
year = "2023",
month = mar,
doi = "10.3390/bdcc7010018",
language = "English",
volume = "7",
number = "1",

}

Download

TY - JOUR

T1 - Federated Learning to Safeguard Patients Data

T2 - A Medical Image Retrieval Case

AU - Singh, Gurtaj

AU - Violi, Vincenzo

AU - Fisichella, Marco

N1 - Funding Information: This research was funded by L3S Research Center of Leibniz University of Hannover, Germany.

PY - 2023/3

Y1 - 2023/3

N2 - Healthcare data are distributed and confidential, making it difficult to use centralized automatic diagnostic techniques. For example, different hospitals hold the electronic health records (EHRs) of different patient populations; however, transferring this data between hospitals is difficult due to the sensitive nature of the information. This presents a significant obstacle to the development of efficient and generalizable analytical methods that require a large amount of diverse Big Data. Federated learning allows multiple institutions to work together to develop a machine learning algorithm without sharing their data. We conducted a systematic study to analyze the current state of FL in the healthcare industry and explore both the limitations of this technology and its potential. Organizations share the parameters of their models with each other. This allows them to reap the benefits of a model developed with a richer data set while protecting the confidentiality of their data. Standard methods for large-scale machine learning, distributed optimization, and privacy-friendly data analytics need to be fundamentally rethought to address the new problems posed by training on diverse networks that may contain large amounts of data. In this article, we discuss the particular qualities and difficulties of federated learning, provide a comprehensive overview of current approaches, and outline several directions for future work that are relevant to a variety of research communities. These issues are important to many different research communities.

AB - Healthcare data are distributed and confidential, making it difficult to use centralized automatic diagnostic techniques. For example, different hospitals hold the electronic health records (EHRs) of different patient populations; however, transferring this data between hospitals is difficult due to the sensitive nature of the information. This presents a significant obstacle to the development of efficient and generalizable analytical methods that require a large amount of diverse Big Data. Federated learning allows multiple institutions to work together to develop a machine learning algorithm without sharing their data. We conducted a systematic study to analyze the current state of FL in the healthcare industry and explore both the limitations of this technology and its potential. Organizations share the parameters of their models with each other. This allows them to reap the benefits of a model developed with a richer data set while protecting the confidentiality of their data. Standard methods for large-scale machine learning, distributed optimization, and privacy-friendly data analytics need to be fundamentally rethought to address the new problems posed by training on diverse networks that may contain large amounts of data. In this article, we discuss the particular qualities and difficulties of federated learning, provide a comprehensive overview of current approaches, and outline several directions for future work that are relevant to a variety of research communities. These issues are important to many different research communities.

KW - federated learning

KW - health

KW - privacy preserving

UR - http://www.scopus.com/inward/record.url?scp=85151096933&partnerID=8YFLogxK

U2 - 10.3390/bdcc7010018

DO - 10.3390/bdcc7010018

M3 - Article

AN - SCOPUS:85151096933

VL - 7

JO - Big Data and Cognitive Computing

JF - Big Data and Cognitive Computing

IS - 1

M1 - 18

ER -

By the same author(s)