Details
Originalsprache | Englisch |
---|---|
Aufsatznummer | 553 |
Seitenumfang | 10 |
Fachzeitschrift | Communications Biology |
Jahrgang | 7 |
Ausgabenummer | 1 |
Publikationsstatus | Veröffentlicht - 9 Mai 2024 |
Abstract
For the last two decades, the amount of genomic data produced by scientific and medical applications has been growing at a rapid pace. To enable software solutions that analyze, process, and transmit these data in an efficient and interoperable way, ISO and IEC released the first version of the compression standard MPEG-G in 2019. However, non-proprietary implementations of the standard are not openly available so far, limiting fair scientific assessment of the standard and, therefore, hindering its broad adoption. In this paper, we present Genie, to the best of our knowledge the first open-source encoder that compresses genomic data according to the MPEG-G standard. We demonstrate that Genie reaches state-of-the-art compression ratios while offering interoperability with any other standard-compliant decoder independent from its manufacturer. Finally, the ISO/IEC ecosystem ensures the long-term sustainability and decodability of the compressed data through the ISO/IEC-supported reference decoder.
ASJC Scopus Sachgebiete
- Medizin (insg.)
- Medizin (sonstige)
- Biochemie, Genetik und Molekularbiologie (insg.)
- Allgemeine Biochemie, Genetik und Molekularbiologie
- Agrar- und Biowissenschaften (insg.)
- Allgemeine Agrar- und Biowissenschaften
Zitieren
- Standard
- Harvard
- Apa
- Vancouver
- BibTex
- RIS
in: Communications Biology, Jahrgang 7, Nr. 1, 553, 09.05.2024.
Publikation: Beitrag in Fachzeitschrift › Artikel › Forschung › Peer-Review
}
TY - JOUR
T1 - Genie
T2 - the first open-source ISO/IEC encoder for genomic data
AU - Müntefering, Fabian
AU - Adhisantoso, Yeremia Gunawan
AU - Chandak, Shubham
AU - Ostermann, Jörn
AU - Hernaez, Mikel
AU - Voges, Jan
N1 - Publisher Copyright: © The Author(s) 2024.
PY - 2024/5/9
Y1 - 2024/5/9
N2 - For the last two decades, the amount of genomic data produced by scientific and medical applications has been growing at a rapid pace. To enable software solutions that analyze, process, and transmit these data in an efficient and interoperable way, ISO and IEC released the first version of the compression standard MPEG-G in 2019. However, non-proprietary implementations of the standard are not openly available so far, limiting fair scientific assessment of the standard and, therefore, hindering its broad adoption. In this paper, we present Genie, to the best of our knowledge the first open-source encoder that compresses genomic data according to the MPEG-G standard. We demonstrate that Genie reaches state-of-the-art compression ratios while offering interoperability with any other standard-compliant decoder independent from its manufacturer. Finally, the ISO/IEC ecosystem ensures the long-term sustainability and decodability of the compressed data through the ISO/IEC-supported reference decoder.
AB - For the last two decades, the amount of genomic data produced by scientific and medical applications has been growing at a rapid pace. To enable software solutions that analyze, process, and transmit these data in an efficient and interoperable way, ISO and IEC released the first version of the compression standard MPEG-G in 2019. However, non-proprietary implementations of the standard are not openly available so far, limiting fair scientific assessment of the standard and, therefore, hindering its broad adoption. In this paper, we present Genie, to the best of our knowledge the first open-source encoder that compresses genomic data according to the MPEG-G standard. We demonstrate that Genie reaches state-of-the-art compression ratios while offering interoperability with any other standard-compliant decoder independent from its manufacturer. Finally, the ISO/IEC ecosystem ensures the long-term sustainability and decodability of the compressed data through the ISO/IEC-supported reference decoder.
UR - http://www.scopus.com/inward/record.url?scp=85192568613&partnerID=8YFLogxK
U2 - 10.1038/s42003-024-06249-8
DO - 10.1038/s42003-024-06249-8
M3 - Article
C2 - 38724695
AN - SCOPUS:85192568613
VL - 7
JO - Communications Biology
JF - Communications Biology
IS - 1
M1 - 553
ER -