Deep learning-based intra prediction mode decision for HEVC

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Autorschaft

Forschungs-netzwerk anzeigen

Details

OriginalspracheEnglisch
Titel des Sammelwerks2016 Picture Coding Symposium
UntertitelPCS 2016
Herausgeber (Verlag)Institute of Electrical and Electronics Engineers Inc.
ISBN (elektronisch)9781509059669
PublikationsstatusVeröffentlicht - Apr. 2017
Veranstaltung2016 Picture Coding Symposium, PCS 2016 - Nuremberg, Deutschland
Dauer: 4 Dez. 20167 Dez. 2016

Publikationsreihe

Name2016 Picture Coding Symposium, PCS 2016

Abstract

The High Efficiency Video Coding standard and its screen content coding extension provide superior coding efficiency compared to predecessor standards. However, this coding efficiency is achieved at the expense of very complex encoders. One major complexity driver is the comprehensive rate distortion (RD) optimization. In this paper, we present a deep learning-based encoder control which replaces the conventional RD optimization for the intra prediction mode with deep convolutional neural network (CNN) classifiers. Thereby, we save the RD optimization complexity. Our classifiers operate independently of any encoder decisions and reconstructed sample values. Thus, no additional systematic latency is introduced. Furthermore, the loss in coding efficiency is negligible with an average value of 0.52% over HM-16.6+SCM-5.2.

ASJC Scopus Sachgebiete

Zitieren

Deep learning-based intra prediction mode decision for HEVC. / Laude, Thorsten; Ostermann, Jörn.
2016 Picture Coding Symposium: PCS 2016. Institute of Electrical and Electronics Engineers Inc., 2017. 7906399 (2016 Picture Coding Symposium, PCS 2016).

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Laude, T & Ostermann, J 2017, Deep learning-based intra prediction mode decision for HEVC. in 2016 Picture Coding Symposium: PCS 2016., 7906399, 2016 Picture Coding Symposium, PCS 2016, Institute of Electrical and Electronics Engineers Inc., 2016 Picture Coding Symposium, PCS 2016, Nuremberg, Deutschland, 4 Dez. 2016. https://doi.org/10.1109/pcs.2016.7906399
Laude, T., & Ostermann, J. (2017). Deep learning-based intra prediction mode decision for HEVC. In 2016 Picture Coding Symposium: PCS 2016 Artikel 7906399 (2016 Picture Coding Symposium, PCS 2016). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/pcs.2016.7906399
Laude T, Ostermann J. Deep learning-based intra prediction mode decision for HEVC. in 2016 Picture Coding Symposium: PCS 2016. Institute of Electrical and Electronics Engineers Inc. 2017. 7906399. (2016 Picture Coding Symposium, PCS 2016). doi: 10.1109/pcs.2016.7906399
Laude, Thorsten ; Ostermann, Jörn. / Deep learning-based intra prediction mode decision for HEVC. 2016 Picture Coding Symposium: PCS 2016. Institute of Electrical and Electronics Engineers Inc., 2017. (2016 Picture Coding Symposium, PCS 2016).
Download
@inproceedings{5a9261662afa4a07a64f5abfd5c5d548,
title = "Deep learning-based intra prediction mode decision for HEVC",
abstract = "The High Efficiency Video Coding standard and its screen content coding extension provide superior coding efficiency compared to predecessor standards. However, this coding efficiency is achieved at the expense of very complex encoders. One major complexity driver is the comprehensive rate distortion (RD) optimization. In this paper, we present a deep learning-based encoder control which replaces the conventional RD optimization for the intra prediction mode with deep convolutional neural network (CNN) classifiers. Thereby, we save the RD optimization complexity. Our classifiers operate independently of any encoder decisions and reconstructed sample values. Thus, no additional systematic latency is introduced. Furthermore, the loss in coding efficiency is negligible with an average value of 0.52% over HM-16.6+SCM-5.2.",
author = "Thorsten Laude and J{\"o}rn Ostermann",
year = "2017",
month = apr,
doi = "10.1109/pcs.2016.7906399",
language = "English",
series = "2016 Picture Coding Symposium, PCS 2016",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
booktitle = "2016 Picture Coding Symposium",
address = "United States",
note = "2016 Picture Coding Symposium, PCS 2016 ; Conference date: 04-12-2016 Through 07-12-2016",

}

Download

TY - GEN

T1 - Deep learning-based intra prediction mode decision for HEVC

AU - Laude, Thorsten

AU - Ostermann, Jörn

PY - 2017/4

Y1 - 2017/4

N2 - The High Efficiency Video Coding standard and its screen content coding extension provide superior coding efficiency compared to predecessor standards. However, this coding efficiency is achieved at the expense of very complex encoders. One major complexity driver is the comprehensive rate distortion (RD) optimization. In this paper, we present a deep learning-based encoder control which replaces the conventional RD optimization for the intra prediction mode with deep convolutional neural network (CNN) classifiers. Thereby, we save the RD optimization complexity. Our classifiers operate independently of any encoder decisions and reconstructed sample values. Thus, no additional systematic latency is introduced. Furthermore, the loss in coding efficiency is negligible with an average value of 0.52% over HM-16.6+SCM-5.2.

AB - The High Efficiency Video Coding standard and its screen content coding extension provide superior coding efficiency compared to predecessor standards. However, this coding efficiency is achieved at the expense of very complex encoders. One major complexity driver is the comprehensive rate distortion (RD) optimization. In this paper, we present a deep learning-based encoder control which replaces the conventional RD optimization for the intra prediction mode with deep convolutional neural network (CNN) classifiers. Thereby, we save the RD optimization complexity. Our classifiers operate independently of any encoder decisions and reconstructed sample values. Thus, no additional systematic latency is introduced. Furthermore, the loss in coding efficiency is negligible with an average value of 0.52% over HM-16.6+SCM-5.2.

UR - http://www.scopus.com/inward/record.url?scp=85019423425&partnerID=8YFLogxK

U2 - 10.1109/pcs.2016.7906399

DO - 10.1109/pcs.2016.7906399

M3 - Conference contribution

AN - SCOPUS:85019423425

T3 - 2016 Picture Coding Symposium, PCS 2016

BT - 2016 Picture Coding Symposium

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2016 Picture Coding Symposium, PCS 2016

Y2 - 4 December 2016 through 7 December 2016

ER -

Von denselben Autoren