Enhanced Machine Learning-based Inter Coding for VVC.

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

Research Organisations

View graph of relations

Details

Original languageEnglish
Title of host publicationICAIIC
Pages21-25
Number of pages5
ISBN (electronic)978-1-7281-7638-3
Publication statusPublished - 2021

Abstract

In this paper, we propose an enhanced machine learning-based inter coding algorithm for VVC. Conceptually, the reference pictures from the decoded picture butter are processed using a recurrent neural network to generate an artificial reference picture at the time instance of the currently coded picture. The network is trained using a SATD cost function to minimize the bit rate cost for the prediction error rather than the pixel-wise difference. By this we achieved average weighted BD-rate gains of 0.94%. The coding time increased about 5% for the encoder and 300% for the decoder due to the use of a neural network.

Keywords

    inter coding, machine learning, recurrent neural networks, video coding, VVC

ASJC Scopus subject areas

Cite this

Enhanced Machine Learning-based Inter Coding for VVC. / Benjak, Martin; Meuel, Holger; Laude, Thorsten et al.
ICAIIC. 2021. p. 21-25 9415184.

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Benjak, M., Meuel, H., Laude, T., & Ostermann, J. (2021). Enhanced Machine Learning-based Inter Coding for VVC. In ICAIIC (pp. 21-25). Article 9415184 https://doi.org/10.1109/ICAIIC51459.2021.9415184
Benjak M, Meuel H, Laude T, Ostermann J. Enhanced Machine Learning-based Inter Coding for VVC. In ICAIIC. 2021. p. 21-25. 9415184 doi: 10.1109/ICAIIC51459.2021.9415184
Benjak, Martin ; Meuel, Holger ; Laude, Thorsten et al. / Enhanced Machine Learning-based Inter Coding for VVC. ICAIIC. 2021. pp. 21-25
Download
@inproceedings{556979d8c4d84934b90ca09ca7f21e5c,
title = "Enhanced Machine Learning-based Inter Coding for VVC.",
abstract = "In this paper, we propose an enhanced machine learning-based inter coding algorithm for VVC. Conceptually, the reference pictures from the decoded picture butter are processed using a recurrent neural network to generate an artificial reference picture at the time instance of the currently coded picture. The network is trained using a SATD cost function to minimize the bit rate cost for the prediction error rather than the pixel-wise difference. By this we achieved average weighted BD-rate gains of 0.94%. The coding time increased about 5% for the encoder and 300% for the decoder due to the use of a neural network.",
keywords = "inter coding, machine learning, recurrent neural networks, video coding, VVC",
author = "Martin Benjak and Holger Meuel and Thorsten Laude and J{\"o}rn Ostermann",
year = "2021",
doi = "10.1109/ICAIIC51459.2021.9415184",
language = "English",
isbn = "978-1-7281-7639-0",
pages = "21--25",
booktitle = "ICAIIC",

}

Download

TY - GEN

T1 - Enhanced Machine Learning-based Inter Coding for VVC.

AU - Benjak, Martin

AU - Meuel, Holger

AU - Laude, Thorsten

AU - Ostermann, Jörn

PY - 2021

Y1 - 2021

N2 - In this paper, we propose an enhanced machine learning-based inter coding algorithm for VVC. Conceptually, the reference pictures from the decoded picture butter are processed using a recurrent neural network to generate an artificial reference picture at the time instance of the currently coded picture. The network is trained using a SATD cost function to minimize the bit rate cost for the prediction error rather than the pixel-wise difference. By this we achieved average weighted BD-rate gains of 0.94%. The coding time increased about 5% for the encoder and 300% for the decoder due to the use of a neural network.

AB - In this paper, we propose an enhanced machine learning-based inter coding algorithm for VVC. Conceptually, the reference pictures from the decoded picture butter are processed using a recurrent neural network to generate an artificial reference picture at the time instance of the currently coded picture. The network is trained using a SATD cost function to minimize the bit rate cost for the prediction error rather than the pixel-wise difference. By this we achieved average weighted BD-rate gains of 0.94%. The coding time increased about 5% for the encoder and 300% for the decoder due to the use of a neural network.

KW - inter coding

KW - machine learning

KW - recurrent neural networks

KW - video coding

KW - VVC

UR - http://www.scopus.com/inward/record.url?scp=85105441407&partnerID=8YFLogxK

U2 - 10.1109/ICAIIC51459.2021.9415184

DO - 10.1109/ICAIIC51459.2021.9415184

M3 - Conference contribution

SN - 978-1-7281-7639-0

SP - 21

EP - 25

BT - ICAIIC

ER -

By the same author(s)