Deep learning-based intra prediction mode decision for HEVC

Thorsten Laude; Jörn Ostermann

doi:10.1109/pcs.2016.7906399

Details

Originalsprache	Englisch
Titel des Sammelwerks	2016 Picture Coding Symposium
Untertitel	PCS 2016
Herausgeber (Verlag)	Institute of Electrical and Electronics Engineers Inc.
ISBN (elektronisch)	9781509059669
Publikationsstatus	Veröffentlicht - Apr. 2017
Veranstaltung	2016 Picture Coding Symposium, PCS 2016 - Nuremberg, Deutschland Dauer: 4 Dez. 2016 → 7 Dez. 2016

Publikationsreihe

Name	2016 Picture Coding Symposium, PCS 2016

Abstract

The High Efficiency Video Coding standard and its screen content coding extension provide superior coding efficiency compared to predecessor standards. However, this coding efficiency is achieved at the expense of very complex encoders. One major complexity driver is the comprehensive rate distortion (RD) optimization. In this paper, we present a deep learning-based encoder control which replaces the conventional RD optimization for the intra prediction mode with deep convolutional neural network (CNN) classifiers. Thereby, we save the RD optimization complexity. Our classifiers operate independently of any encoder decisions and reconstructed sample values. Thus, no additional systematic latency is introduced. Furthermore, the loss in coding efficiency is negligible with an average value of 0.52% over HM-16.6+SCM-5.2.

ASJC Scopus Sachgebiete

Ingenieurwesen (insg.)
Medientechnik
Informatik (insg.)
Signalverarbeitung

Zitieren

Deep learning-based intra prediction mode decision for HEVC. / Laude, Thorsten; Ostermann, Jörn.
2016 Picture Coding Symposium: PCS 2016. Institute of Electrical and Electronics Engineers Inc., 2017. 7906399 (2016 Picture Coding Symposium, PCS 2016).

Publikation: Beitrag in Buch/Bericht/Sammelwerk/Konferenzband › Aufsatz in Konferenzband › Forschung › Peer-Review

Laude, T & Ostermann, J 2017, Deep learning-based intra prediction mode decision for HEVC. in 2016 Picture Coding Symposium: PCS 2016., 7906399, 2016 Picture Coding Symposium, PCS 2016, Institute of Electrical and Electronics Engineers Inc., 2016 Picture Coding Symposium, PCS 2016, Nuremberg, Deutschland, 4 Dez. 2016. https://doi.org/10.1109/pcs.2016.7906399

Laude, T., & Ostermann, J. (2017). Deep learning-based intra prediction mode decision for HEVC. In 2016 Picture Coding Symposium: PCS 2016 Artikel 7906399 (2016 Picture Coding Symposium, PCS 2016). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/pcs.2016.7906399

Laude T, Ostermann J. Deep learning-based intra prediction mode decision for HEVC. in 2016 Picture Coding Symposium: PCS 2016. Institute of Electrical and Electronics Engineers Inc. 2017. 7906399. (2016 Picture Coding Symposium, PCS 2016). doi: 10.1109/pcs.2016.7906399

Laude, Thorsten ; Ostermann, Jörn. / Deep learning-based intra prediction mode decision for HEVC. 2016 Picture Coding Symposium: PCS 2016. Institute of Electrical and Electronics Engineers Inc., 2017. (2016 Picture Coding Symposium, PCS 2016).

Download

@inproceedings{5a9261662afa4a07a64f5abfd5c5d548,

title = "Deep learning-based intra prediction mode decision for HEVC",

abstract = "The High Efficiency Video Coding standard and its screen content coding extension provide superior coding efficiency compared to predecessor standards. However, this coding efficiency is achieved at the expense of very complex encoders. One major complexity driver is the comprehensive rate distortion (RD) optimization. In this paper, we present a deep learning-based encoder control which replaces the conventional RD optimization for the intra prediction mode with deep convolutional neural network (CNN) classifiers. Thereby, we save the RD optimization complexity. Our classifiers operate independently of any encoder decisions and reconstructed sample values. Thus, no additional systematic latency is introduced. Furthermore, the loss in coding efficiency is negligible with an average value of 0.52% over HM-16.6+SCM-5.2.",

author = "Thorsten Laude and J{\"o}rn Ostermann",

year = "2017",

month = apr,

doi = "10.1109/pcs.2016.7906399",

language = "English",

series = "2016 Picture Coding Symposium, PCS 2016",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "2016 Picture Coding Symposium",

address = "United States",

note = "2016 Picture Coding Symposium, PCS 2016 ; Conference date: 04-12-2016 Through 07-12-2016",

}

Download

TY - GEN

T1 - Deep learning-based intra prediction mode decision for HEVC

AU - Laude, Thorsten

AU - Ostermann, Jörn

PY - 2017/4

Y1 - 2017/4

N2 - The High Efficiency Video Coding standard and its screen content coding extension provide superior coding efficiency compared to predecessor standards. However, this coding efficiency is achieved at the expense of very complex encoders. One major complexity driver is the comprehensive rate distortion (RD) optimization. In this paper, we present a deep learning-based encoder control which replaces the conventional RD optimization for the intra prediction mode with deep convolutional neural network (CNN) classifiers. Thereby, we save the RD optimization complexity. Our classifiers operate independently of any encoder decisions and reconstructed sample values. Thus, no additional systematic latency is introduced. Furthermore, the loss in coding efficiency is negligible with an average value of 0.52% over HM-16.6+SCM-5.2.

AB - The High Efficiency Video Coding standard and its screen content coding extension provide superior coding efficiency compared to predecessor standards. However, this coding efficiency is achieved at the expense of very complex encoders. One major complexity driver is the comprehensive rate distortion (RD) optimization. In this paper, we present a deep learning-based encoder control which replaces the conventional RD optimization for the intra prediction mode with deep convolutional neural network (CNN) classifiers. Thereby, we save the RD optimization complexity. Our classifiers operate independently of any encoder decisions and reconstructed sample values. Thus, no additional systematic latency is introduced. Furthermore, the loss in coding efficiency is negligible with an average value of 0.52% over HM-16.6+SCM-5.2.

UR - http://www.scopus.com/inward/record.url?scp=85019423425&partnerID=8YFLogxK

U2 - 10.1109/pcs.2016.7906399

DO - 10.1109/pcs.2016.7906399

M3 - Conference contribution

AN - SCOPUS:85019423425

T3 - 2016 Picture Coding Symposium, PCS 2016

BT - 2016 Picture Coding Symposium

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2016 Picture Coding Symposium, PCS 2016

Y2 - 4 December 2016 through 7 December 2016

ER -

Research@Leibniz University

Deep learning-based intra prediction mode decision for HEVC

Autorschaft

Organisationseinheiten

Details

Publikationsreihe

Abstract

ASJC Scopus Sachgebiete

Zitieren

Von denselben Autoren

MaskCRT: Masked Conditional Residual Transformer for Learned Video Compression

Acoustic Emission Detection in Noisy Environments using Linear Prediction

Genie: the first open-source ISO/IEC encoder for genomic data

On the Rate-Distortion-Complexity Trade-Offs of Neural Video Coding

Self-supervised domain adaptation for machinery remaining useful life prediction

MaskCRT: Masked Conditional Residual Transformer for Learned Video Compression

Acoustic Emission Detection in Noisy Environments using Linear Prediction

Genie: the first open-source ISO/IEC encoder for genomic data

On the Rate-Distortion-Complexity Trade-Offs of Neural Video Coding

Self-supervised domain adaptation for machinery remaining useful life prediction

MaskCRT: Masked Conditional Residual Transformer for Learned Video Compression