Genie: the first open-source ISO/IEC encoder for genomic data

Publikation: Beitrag in FachzeitschriftArtikelForschungPeer-Review

Autoren

  • Fabian Müntefering
  • Yeremia Gunawan Adhisantoso
  • Shubham Chandak
  • Jörn Ostermann
  • Mikel Hernaez
  • Jan Voges

Externe Organisationen

  • Stanford University
  • University of Navarra
Forschungs-netzwerk anzeigen

Details

OriginalspracheEnglisch
Aufsatznummer553
Seitenumfang10
FachzeitschriftCommunications Biology
Jahrgang7
Ausgabenummer1
PublikationsstatusVeröffentlicht - 9 Mai 2024

Abstract

For the last two decades, the amount of genomic data produced by scientific and medical applications has been growing at a rapid pace. To enable software solutions that analyze, process, and transmit these data in an efficient and interoperable way, ISO and IEC released the first version of the compression standard MPEG-G in 2019. However, non-proprietary implementations of the standard are not openly available so far, limiting fair scientific assessment of the standard and, therefore, hindering its broad adoption. In this paper, we present Genie, to the best of our knowledge the first open-source encoder that compresses genomic data according to the MPEG-G standard. We demonstrate that Genie reaches state-of-the-art compression ratios while offering interoperability with any other standard-compliant decoder independent from its manufacturer. Finally, the ISO/IEC ecosystem ensures the long-term sustainability and decodability of the compressed data through the ISO/IEC-supported reference decoder.

Zitieren

Genie: the first open-source ISO/IEC encoder for genomic data. / Müntefering, Fabian; Adhisantoso, Yeremia Gunawan; Chandak, Shubham et al.
in: Communications Biology, Jahrgang 7, Nr. 1, 553, 09.05.2024.

Publikation: Beitrag in FachzeitschriftArtikelForschungPeer-Review

Müntefering, F, Adhisantoso, YG, Chandak, S, Ostermann, J, Hernaez, M & Voges, J 2024, 'Genie: the first open-source ISO/IEC encoder for genomic data', Communications Biology, Jg. 7, Nr. 1, 553. https://doi.org/10.1038/s42003-024-06249-8
Müntefering, F., Adhisantoso, Y. G., Chandak, S., Ostermann, J., Hernaez, M., & Voges, J. (2024). Genie: the first open-source ISO/IEC encoder for genomic data. Communications Biology, 7(1), Artikel 553. https://doi.org/10.1038/s42003-024-06249-8
Müntefering F, Adhisantoso YG, Chandak S, Ostermann J, Hernaez M, Voges J. Genie: the first open-source ISO/IEC encoder for genomic data. Communications Biology. 2024 Mai 9;7(1):553. doi: 10.1038/s42003-024-06249-8
Müntefering, Fabian ; Adhisantoso, Yeremia Gunawan ; Chandak, Shubham et al. / Genie : the first open-source ISO/IEC encoder for genomic data. in: Communications Biology. 2024 ; Jahrgang 7, Nr. 1.
Download
@article{55fde8f189854efeaf36589d4eac8f3a,
title = "Genie: the first open-source ISO/IEC encoder for genomic data",
abstract = "For the last two decades, the amount of genomic data produced by scientific and medical applications has been growing at a rapid pace. To enable software solutions that analyze, process, and transmit these data in an efficient and interoperable way, ISO and IEC released the first version of the compression standard MPEG-G in 2019. However, non-proprietary implementations of the standard are not openly available so far, limiting fair scientific assessment of the standard and, therefore, hindering its broad adoption. In this paper, we present Genie, to the best of our knowledge the first open-source encoder that compresses genomic data according to the MPEG-G standard. We demonstrate that Genie reaches state-of-the-art compression ratios while offering interoperability with any other standard-compliant decoder independent from its manufacturer. Finally, the ISO/IEC ecosystem ensures the long-term sustainability and decodability of the compressed data through the ISO/IEC-supported reference decoder.",
author = "Fabian M{\"u}ntefering and Adhisantoso, {Yeremia Gunawan} and Shubham Chandak and J{\"o}rn Ostermann and Mikel Hernaez and Jan Voges",
note = "Publisher Copyright: {\textcopyright} The Author(s) 2024.",
year = "2024",
month = may,
day = "9",
doi = "10.1038/s42003-024-06249-8",
language = "English",
volume = "7",
number = "1",

}

Download

TY - JOUR

T1 - Genie

T2 - the first open-source ISO/IEC encoder for genomic data

AU - Müntefering, Fabian

AU - Adhisantoso, Yeremia Gunawan

AU - Chandak, Shubham

AU - Ostermann, Jörn

AU - Hernaez, Mikel

AU - Voges, Jan

N1 - Publisher Copyright: © The Author(s) 2024.

PY - 2024/5/9

Y1 - 2024/5/9

N2 - For the last two decades, the amount of genomic data produced by scientific and medical applications has been growing at a rapid pace. To enable software solutions that analyze, process, and transmit these data in an efficient and interoperable way, ISO and IEC released the first version of the compression standard MPEG-G in 2019. However, non-proprietary implementations of the standard are not openly available so far, limiting fair scientific assessment of the standard and, therefore, hindering its broad adoption. In this paper, we present Genie, to the best of our knowledge the first open-source encoder that compresses genomic data according to the MPEG-G standard. We demonstrate that Genie reaches state-of-the-art compression ratios while offering interoperability with any other standard-compliant decoder independent from its manufacturer. Finally, the ISO/IEC ecosystem ensures the long-term sustainability and decodability of the compressed data through the ISO/IEC-supported reference decoder.

AB - For the last two decades, the amount of genomic data produced by scientific and medical applications has been growing at a rapid pace. To enable software solutions that analyze, process, and transmit these data in an efficient and interoperable way, ISO and IEC released the first version of the compression standard MPEG-G in 2019. However, non-proprietary implementations of the standard are not openly available so far, limiting fair scientific assessment of the standard and, therefore, hindering its broad adoption. In this paper, we present Genie, to the best of our knowledge the first open-source encoder that compresses genomic data according to the MPEG-G standard. We demonstrate that Genie reaches state-of-the-art compression ratios while offering interoperability with any other standard-compliant decoder independent from its manufacturer. Finally, the ISO/IEC ecosystem ensures the long-term sustainability and decodability of the compressed data through the ISO/IEC-supported reference decoder.

UR - http://www.scopus.com/inward/record.url?scp=85192568613&partnerID=8YFLogxK

U2 - 10.1038/s42003-024-06249-8

DO - 10.1038/s42003-024-06249-8

M3 - Article

C2 - 38724695

AN - SCOPUS:85192568613

VL - 7

JO - Communications Biology

JF - Communications Biology

IS - 1

M1 - 553

ER -