Differential gene expression with lossy compression of quality scores in RNA-seq data

Research output: Chapter in book/report/conference proceedingConference abstractResearchpeer review

Authors

  • Ana A. Hernandez-Lopez
  • Jan Voges
  • Claudio Alberti
  • Marco Mattavelli
  • Jörn Ostermann

Research Organisations

External Research Organisations

  • École polytechnique fédérale de Lausanne (EPFL)
View graph of relations

Details

Original languageEnglish
Title of host publicationProceedings - DCC 2017
Subtitle of host publication2017 Data Compression Conference
EditorsAli Bilgin, Joan Serra-Sagrista, Michael W. Marcellin, James A. Storer
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages444
Number of pages1
ISBN (electronic)9781509067213
Publication statusPublished - May 2017
Event2017 Data Compression Conference, DCC 2017 - Snowbird, United States
Duration: 4 Apr 20177 Apr 2017

Publication series

NameData Compression Conference Proceedings
VolumePart F127767
ISSN (Print)1068-0314

Abstract

High-Throughput sequencing of RNA molecules has enabled the quantitative analysis of the expression of genes at the expense of storage space and processing power. To help alleviate these problems, lossy compression methods of the quality scores associated to RNA sequence data have recently been proposed, and the evaluation of their impact on downstream analysis is gaining attention. This work presents a first assessment of the impact of lossily compressed quality scores in RNA sequence data on the performance of some of the most recent tools used for differential gene expression.

Keywords

    differential gene expression, Lossy compression, RNA-seq

ASJC Scopus subject areas

Cite this

Differential gene expression with lossy compression of quality scores in RNA-seq data. / Hernandez-Lopez, Ana A.; Voges, Jan; Alberti, Claudio et al.
Proceedings - DCC 2017: 2017 Data Compression Conference. ed. / Ali Bilgin; Joan Serra-Sagrista; Michael W. Marcellin; James A. Storer. Institute of Electrical and Electronics Engineers Inc., 2017. p. 444 7923727 (Data Compression Conference Proceedings; Vol. Part F127767).

Research output: Chapter in book/report/conference proceedingConference abstractResearchpeer review

Hernandez-Lopez, AA, Voges, J, Alberti, C, Mattavelli, M & Ostermann, J 2017, Differential gene expression with lossy compression of quality scores in RNA-seq data. in A Bilgin, J Serra-Sagrista, MW Marcellin & JA Storer (eds), Proceedings - DCC 2017: 2017 Data Compression Conference., 7923727, Data Compression Conference Proceedings, vol. Part F127767, Institute of Electrical and Electronics Engineers Inc., pp. 444, 2017 Data Compression Conference, DCC 2017, Snowbird, United States, 4 Apr 2017. https://doi.org/10.1109/dcc.2017.75
Hernandez-Lopez, A. A., Voges, J., Alberti, C., Mattavelli, M., & Ostermann, J. (2017). Differential gene expression with lossy compression of quality scores in RNA-seq data. In A. Bilgin, J. Serra-Sagrista, M. W. Marcellin, & J. A. Storer (Eds.), Proceedings - DCC 2017: 2017 Data Compression Conference (pp. 444). Article 7923727 (Data Compression Conference Proceedings; Vol. Part F127767). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/dcc.2017.75
Hernandez-Lopez AA, Voges J, Alberti C, Mattavelli M, Ostermann J. Differential gene expression with lossy compression of quality scores in RNA-seq data. In Bilgin A, Serra-Sagrista J, Marcellin MW, Storer JA, editors, Proceedings - DCC 2017: 2017 Data Compression Conference. Institute of Electrical and Electronics Engineers Inc. 2017. p. 444. 7923727. (Data Compression Conference Proceedings). doi: 10.1109/dcc.2017.75
Hernandez-Lopez, Ana A. ; Voges, Jan ; Alberti, Claudio et al. / Differential gene expression with lossy compression of quality scores in RNA-seq data. Proceedings - DCC 2017: 2017 Data Compression Conference. editor / Ali Bilgin ; Joan Serra-Sagrista ; Michael W. Marcellin ; James A. Storer. Institute of Electrical and Electronics Engineers Inc., 2017. pp. 444 (Data Compression Conference Proceedings).
Download
@inbook{a10c6f6f91e24545bf3c0d2bceb4c726,
title = "Differential gene expression with lossy compression of quality scores in RNA-seq data",
abstract = "High-Throughput sequencing of RNA molecules has enabled the quantitative analysis of the expression of genes at the expense of storage space and processing power. To help alleviate these problems, lossy compression methods of the quality scores associated to RNA sequence data have recently been proposed, and the evaluation of their impact on downstream analysis is gaining attention. This work presents a first assessment of the impact of lossily compressed quality scores in RNA sequence data on the performance of some of the most recent tools used for differential gene expression.",
keywords = "differential gene expression, Lossy compression, RNA-seq",
author = "Hernandez-Lopez, {Ana A.} and Jan Voges and Claudio Alberti and Marco Mattavelli and J{\"o}rn Ostermann",
year = "2017",
month = may,
doi = "10.1109/dcc.2017.75",
language = "English",
series = "Data Compression Conference Proceedings",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
pages = "444",
editor = "Ali Bilgin and Joan Serra-Sagrista and Marcellin, {Michael W.} and Storer, {James A.}",
booktitle = "Proceedings - DCC 2017",
address = "United States",
note = "2017 Data Compression Conference, DCC 2017 ; Conference date: 04-04-2017 Through 07-04-2017",

}

Download

TY - CHAP

T1 - Differential gene expression with lossy compression of quality scores in RNA-seq data

AU - Hernandez-Lopez, Ana A.

AU - Voges, Jan

AU - Alberti, Claudio

AU - Mattavelli, Marco

AU - Ostermann, Jörn

PY - 2017/5

Y1 - 2017/5

N2 - High-Throughput sequencing of RNA molecules has enabled the quantitative analysis of the expression of genes at the expense of storage space and processing power. To help alleviate these problems, lossy compression methods of the quality scores associated to RNA sequence data have recently been proposed, and the evaluation of their impact on downstream analysis is gaining attention. This work presents a first assessment of the impact of lossily compressed quality scores in RNA sequence data on the performance of some of the most recent tools used for differential gene expression.

AB - High-Throughput sequencing of RNA molecules has enabled the quantitative analysis of the expression of genes at the expense of storage space and processing power. To help alleviate these problems, lossy compression methods of the quality scores associated to RNA sequence data have recently been proposed, and the evaluation of their impact on downstream analysis is gaining attention. This work presents a first assessment of the impact of lossily compressed quality scores in RNA sequence data on the performance of some of the most recent tools used for differential gene expression.

KW - differential gene expression

KW - Lossy compression

KW - RNA-seq

UR - http://www.scopus.com/inward/record.url?scp=85019968467&partnerID=8YFLogxK

U2 - 10.1109/dcc.2017.75

DO - 10.1109/dcc.2017.75

M3 - Conference abstract

AN - SCOPUS:85019968467

T3 - Data Compression Conference Proceedings

SP - 444

BT - Proceedings - DCC 2017

A2 - Bilgin, Ali

A2 - Serra-Sagrista, Joan

A2 - Marcellin, Michael W.

A2 - Storer, James A.

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2017 Data Compression Conference, DCC 2017

Y2 - 4 April 2017 through 7 April 2017

ER -

By the same author(s)