Genie: the first open-source ISO/IEC encoder for genomic data

Research output: Contribution to journalArticleResearchpeer review

Authors

  • Fabian Müntefering
  • Yeremia Gunawan Adhisantoso
  • Shubham Chandak
  • Jörn Ostermann
  • Mikel Hernaez
  • Jan Voges

Research Organisations

External Research Organisations

  • Stanford University
  • University of Navarra
View graph of relations

Details

Original languageEnglish
Article number553
Number of pages10
JournalCommunications Biology
Volume7
Issue number1
Publication statusPublished - 9 May 2024

Abstract

For the last two decades, the amount of genomic data produced by scientific and medical applications has been growing at a rapid pace. To enable software solutions that analyze, process, and transmit these data in an efficient and interoperable way, ISO and IEC released the first version of the compression standard MPEG-G in 2019. However, non-proprietary implementations of the standard are not openly available so far, limiting fair scientific assessment of the standard and, therefore, hindering its broad adoption. In this paper, we present Genie, to the best of our knowledge the first open-source encoder that compresses genomic data according to the MPEG-G standard. We demonstrate that Genie reaches state-of-the-art compression ratios while offering interoperability with any other standard-compliant decoder independent from its manufacturer. Finally, the ISO/IEC ecosystem ensures the long-term sustainability and decodability of the compressed data through the ISO/IEC-supported reference decoder.

Cite this

Genie: the first open-source ISO/IEC encoder for genomic data. / Müntefering, Fabian; Adhisantoso, Yeremia Gunawan; Chandak, Shubham et al.
In: Communications Biology, Vol. 7, No. 1, 553, 09.05.2024.

Research output: Contribution to journalArticleResearchpeer review

Müntefering, F, Adhisantoso, YG, Chandak, S, Ostermann, J, Hernaez, M & Voges, J 2024, 'Genie: the first open-source ISO/IEC encoder for genomic data', Communications Biology, vol. 7, no. 1, 553. https://doi.org/10.1038/s42003-024-06249-8
Müntefering, F., Adhisantoso, Y. G., Chandak, S., Ostermann, J., Hernaez, M., & Voges, J. (2024). Genie: the first open-source ISO/IEC encoder for genomic data. Communications Biology, 7(1), Article 553. https://doi.org/10.1038/s42003-024-06249-8
Müntefering F, Adhisantoso YG, Chandak S, Ostermann J, Hernaez M, Voges J. Genie: the first open-source ISO/IEC encoder for genomic data. Communications Biology. 2024 May 9;7(1):553. doi: 10.1038/s42003-024-06249-8
Müntefering, Fabian ; Adhisantoso, Yeremia Gunawan ; Chandak, Shubham et al. / Genie : the first open-source ISO/IEC encoder for genomic data. In: Communications Biology. 2024 ; Vol. 7, No. 1.
Download
@article{55fde8f189854efeaf36589d4eac8f3a,
title = "Genie: the first open-source ISO/IEC encoder for genomic data",
abstract = "For the last two decades, the amount of genomic data produced by scientific and medical applications has been growing at a rapid pace. To enable software solutions that analyze, process, and transmit these data in an efficient and interoperable way, ISO and IEC released the first version of the compression standard MPEG-G in 2019. However, non-proprietary implementations of the standard are not openly available so far, limiting fair scientific assessment of the standard and, therefore, hindering its broad adoption. In this paper, we present Genie, to the best of our knowledge the first open-source encoder that compresses genomic data according to the MPEG-G standard. We demonstrate that Genie reaches state-of-the-art compression ratios while offering interoperability with any other standard-compliant decoder independent from its manufacturer. Finally, the ISO/IEC ecosystem ensures the long-term sustainability and decodability of the compressed data through the ISO/IEC-supported reference decoder.",
author = "Fabian M{\"u}ntefering and Adhisantoso, {Yeremia Gunawan} and Shubham Chandak and J{\"o}rn Ostermann and Mikel Hernaez and Jan Voges",
note = "Publisher Copyright: {\textcopyright} The Author(s) 2024.",
year = "2024",
month = may,
day = "9",
doi = "10.1038/s42003-024-06249-8",
language = "English",
volume = "7",
number = "1",

}

Download

TY - JOUR

T1 - Genie

T2 - the first open-source ISO/IEC encoder for genomic data

AU - Müntefering, Fabian

AU - Adhisantoso, Yeremia Gunawan

AU - Chandak, Shubham

AU - Ostermann, Jörn

AU - Hernaez, Mikel

AU - Voges, Jan

N1 - Publisher Copyright: © The Author(s) 2024.

PY - 2024/5/9

Y1 - 2024/5/9

N2 - For the last two decades, the amount of genomic data produced by scientific and medical applications has been growing at a rapid pace. To enable software solutions that analyze, process, and transmit these data in an efficient and interoperable way, ISO and IEC released the first version of the compression standard MPEG-G in 2019. However, non-proprietary implementations of the standard are not openly available so far, limiting fair scientific assessment of the standard and, therefore, hindering its broad adoption. In this paper, we present Genie, to the best of our knowledge the first open-source encoder that compresses genomic data according to the MPEG-G standard. We demonstrate that Genie reaches state-of-the-art compression ratios while offering interoperability with any other standard-compliant decoder independent from its manufacturer. Finally, the ISO/IEC ecosystem ensures the long-term sustainability and decodability of the compressed data through the ISO/IEC-supported reference decoder.

AB - For the last two decades, the amount of genomic data produced by scientific and medical applications has been growing at a rapid pace. To enable software solutions that analyze, process, and transmit these data in an efficient and interoperable way, ISO and IEC released the first version of the compression standard MPEG-G in 2019. However, non-proprietary implementations of the standard are not openly available so far, limiting fair scientific assessment of the standard and, therefore, hindering its broad adoption. In this paper, we present Genie, to the best of our knowledge the first open-source encoder that compresses genomic data according to the MPEG-G standard. We demonstrate that Genie reaches state-of-the-art compression ratios while offering interoperability with any other standard-compliant decoder independent from its manufacturer. Finally, the ISO/IEC ecosystem ensures the long-term sustainability and decodability of the compressed data through the ISO/IEC-supported reference decoder.

UR - http://www.scopus.com/inward/record.url?scp=85192568613&partnerID=8YFLogxK

U2 - 10.1038/s42003-024-06249-8

DO - 10.1038/s42003-024-06249-8

M3 - Article

C2 - 38724695

AN - SCOPUS:85192568613

VL - 7

JO - Communications Biology

JF - Communications Biology

IS - 1

M1 - 553

ER -