Details
Original language | English |
---|---|
Title of host publication | Proceedings - DCC 2022 |
Subtitle of host publication | 2022 Data Compression Conference |
Editors | Ali Bilgin, Michael W. Marcellin, Joan Serra-Sagrista, James A. Storer |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 399-408 |
Number of pages | 10 |
ISBN (electronic) | 9781665478939 |
ISBN (print) | 978-1-6654-7894-6 |
Publication status | Published - 2022 |
Event | 2022 Data Compression Conference, DCC 2022 - Snowbird, United States Duration: 22 Mar 2022 → 25 Mar 2022 |
Publication series
Name | Data Compression Conference Proceedings |
---|---|
Volume | 2022-March |
ISSN (Print) | 1068-0314 |
Abstract
The study of three-dimensional folding of chromosomes is important to understand genomics processes. This is done through techniques, such as Hi-C, that analyze the spatial organization of chromosomes in a cell. The data coming from the study is a 2-dimensional quantitative maps with genomic coordinate systems. We present a novel approach called Contact Matrix Compressor(CMC) for the efficient compression of Hi-C data. By exploiting the properties of the data, such as diagonally dominant and symmetrical, CMC achieves a much higher compression. CMC outperforms the existing method Cooler, and also the generic compression methods LZMA as well as BZip2.
ASJC Scopus subject areas
- Computer Science(all)
- Computer Networks and Communications
Cite this
- Standard
- Harvard
- Apa
- Vancouver
- BibTeX
- RIS
Proceedings - DCC 2022: 2022 Data Compression Conference. ed. / Ali Bilgin; Michael W. Marcellin; Joan Serra-Sagrista; James A. Storer. Institute of Electrical and Electronics Engineers Inc., 2022. p. 399-408 (Data Compression Conference Proceedings; Vol. 2022-March).
Research output: Chapter in book/report/conference proceeding › Conference contribution › Research › peer review
}
TY - GEN
T1 - Contact Matrix Compressor
AU - Adhisantoso, Yeremia Gunawan
AU - Ostermann, Jorn
PY - 2022
Y1 - 2022
N2 - The study of three-dimensional folding of chromosomes is important to understand genomics processes. This is done through techniques, such as Hi-C, that analyze the spatial organization of chromosomes in a cell. The data coming from the study is a 2-dimensional quantitative maps with genomic coordinate systems. We present a novel approach called Contact Matrix Compressor(CMC) for the efficient compression of Hi-C data. By exploiting the properties of the data, such as diagonally dominant and symmetrical, CMC achieves a much higher compression. CMC outperforms the existing method Cooler, and also the generic compression methods LZMA as well as BZip2.
AB - The study of three-dimensional folding of chromosomes is important to understand genomics processes. This is done through techniques, such as Hi-C, that analyze the spatial organization of chromosomes in a cell. The data coming from the study is a 2-dimensional quantitative maps with genomic coordinate systems. We present a novel approach called Contact Matrix Compressor(CMC) for the efficient compression of Hi-C data. By exploiting the properties of the data, such as diagonally dominant and symmetrical, CMC achieves a much higher compression. CMC outperforms the existing method Cooler, and also the generic compression methods LZMA as well as BZip2.
UR - http://www.scopus.com/inward/record.url?scp=85134401511&partnerID=8YFLogxK
U2 - 10.1109/DCC52660.2022.00048
DO - 10.1109/DCC52660.2022.00048
M3 - Conference contribution
AN - SCOPUS:85134401511
SN - 978-1-6654-7894-6
T3 - Data Compression Conference Proceedings
SP - 399
EP - 408
BT - Proceedings - DCC 2022
A2 - Bilgin, Ali
A2 - Marcellin, Michael W.
A2 - Serra-Sagrista, Joan
A2 - Storer, James A.
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2022 Data Compression Conference, DCC 2022
Y2 - 22 March 2022 through 25 March 2022
ER -