Mesh-based piecewise planar motion compensation and optical flow clustering for ROI coding

Holger Meuel; Marco Munderloh; Matthias Reso; Jörn Ostermann

doi:10.1017/ATSIP.2015.12

Details

Original language	English
Article number	1017
Journal	APSIPA Transactions on Signal and Information Processing
Volume	4
Issue number	e13
Publication status	Published - 2 Oct 2015

Abstract

For the transmission of aerial surveillance videos taken from unmanned aerial vehicles (UAVs), region of interest (ROI)-based coding systems are of growing interest in order to cope with the limited channel capacities available. We present a fully automatic detection and coding system which is capable of transmitting high-resolution aerial surveillance videos at very low bit rates. Our coding system is based on the transmission of ROI areas only. We assume two different kinds of ROIs: in order to limit the transmission bit rate while simultaneously retaining a high-quality view of the ground, we only transmit new emerging areas (ROI-NA) for each frame instead of the entire frame. At the decoder side, the surface of the earth is reconstructed from transmitted ROI-NA by means of global motion compensation (GMC). In order to retain the movement of moving objects not conforming with the motion of the ground (like moving cars and their previously occluded ground), we additionally consider regions containing such objects as interesting (ROI-MO). Finally, both ROIs are used as input to an externally controlled video encoder. While we use GMC for the reconstruction of the ground from ROI-NA, we use meshed-based motion compensation in order to generate the pelwise difference in the luminance channel (difference image) between the mesh-based motion compensated and the current input image to detect the ROI-MO. High spots of energy within this difference image are used as seeds to select corresponding superpixels from an independent (temporally consistent) superpixel segmentation of the input image in order to obtain accurate shape information of ROI-MO. For a false positive detection rate (regions falsely classified as containing local motion) of less than 2% we detect more than 97% true positives (correctly detected ROI-MOs) in challenging scenarios. Furthermore, we propose to use a modified high-efficiency video coding (HEVC) video encoder. Retaining full HDTV video resolution at 30 fps and subjectively high quality we achieve bit rates of about 0.6-0.9 Mbit/s, which is a bit rate saving of about 90% compared to an unmodified HEVC encoder.

Keywords

Low bit rate HDTV video coding, Mesh-based motion compensation, Moving object detection, Region of interest ROI coding, Superpixel segmentation

ASJC Scopus subject areas

Computer Science(all)
Signal Processing
Computer Science(all)
Information Systems

Cite this

Mesh-based piecewise planar motion compensation and optical flow clustering for ROI coding. / Meuel, Holger; Munderloh, Marco; Reso, Matthias et al.
In: APSIPA Transactions on Signal and Information Processing, Vol. 4, No. e13, 1017, 02.10.2015.

Research output: Contribution to journal › Article › Research › peer review

Meuel, H, Munderloh, M, Reso, M & Ostermann, J 2015, 'Mesh-based piecewise planar motion compensation and optical flow clustering for ROI coding', APSIPA Transactions on Signal and Information Processing, vol. 4, no. e13, 1017. https://doi.org/10.1017/ATSIP.2015.12

Meuel, H., Munderloh, M., Reso, M., & Ostermann, J. (2015). Mesh-based piecewise planar motion compensation and optical flow clustering for ROI coding. APSIPA Transactions on Signal and Information Processing, 4(e13), Article 1017. https://doi.org/10.1017/ATSIP.2015.12

Meuel H, Munderloh M, Reso M, Ostermann J. Mesh-based piecewise planar motion compensation and optical flow clustering for ROI coding. APSIPA Transactions on Signal and Information Processing. 2015 Oct 2;4(e13):1017. doi: 10.1017/ATSIP.2015.12

Meuel, Holger ; Munderloh, Marco ; Reso, Matthias et al. / Mesh-based piecewise planar motion compensation and optical flow clustering for ROI coding. In: APSIPA Transactions on Signal and Information Processing. 2015 ; Vol. 4, No. e13.

Download

@article{10ba539cd2af4786b5ab973db13e47ce,

title = "Mesh-based piecewise planar motion compensation and optical flow clustering for ROI coding",

abstract = "For the transmission of aerial surveillance videos taken from unmanned aerial vehicles (UAVs), region of interest (ROI)-based coding systems are of growing interest in order to cope with the limited channel capacities available. We present a fully automatic detection and coding system which is capable of transmitting high-resolution aerial surveillance videos at very low bit rates. Our coding system is based on the transmission of ROI areas only. We assume two different kinds of ROIs: in order to limit the transmission bit rate while simultaneously retaining a high-quality view of the ground, we only transmit new emerging areas (ROI-NA) for each frame instead of the entire frame. At the decoder side, the surface of the earth is reconstructed from transmitted ROI-NA by means of global motion compensation (GMC). In order to retain the movement of moving objects not conforming with the motion of the ground (like moving cars and their previously occluded ground), we additionally consider regions containing such objects as interesting (ROI-MO). Finally, both ROIs are used as input to an externally controlled video encoder. While we use GMC for the reconstruction of the ground from ROI-NA, we use meshed-based motion compensation in order to generate the pelwise difference in the luminance channel (difference image) between the mesh-based motion compensated and the current input image to detect the ROI-MO. High spots of energy within this difference image are used as seeds to select corresponding superpixels from an independent (temporally consistent) superpixel segmentation of the input image in order to obtain accurate shape information of ROI-MO. For a false positive detection rate (regions falsely classified as containing local motion) of less than 2% we detect more than 97% true positives (correctly detected ROI-MOs) in challenging scenarios. Furthermore, we propose to use a modified high-efficiency video coding (HEVC) video encoder. Retaining full HDTV video resolution at 30 fps and subjectively high quality we achieve bit rates of about 0.6-0.9 Mbit/s, which is a bit rate saving of about 90% compared to an unmodified HEVC encoder.",

keywords = "Low bit rate HDTV video coding, Mesh-based motion compensation, Moving object detection, Region of interest ROI coding, Superpixel segmentation",

author = "Holger Meuel and Marco Munderloh and Matthias Reso and J{\"o}rn Ostermann",

year = "2015",

month = oct,

day = "2",

doi = "10.1017/ATSIP.2015.12",

language = "English",

volume = "4",

number = "e13",

}

Download

TY - JOUR

T1 - Mesh-based piecewise planar motion compensation and optical flow clustering for ROI coding

AU - Meuel, Holger

AU - Munderloh, Marco

AU - Reso, Matthias

AU - Ostermann, Jörn

PY - 2015/10/2

Y1 - 2015/10/2

N2 - For the transmission of aerial surveillance videos taken from unmanned aerial vehicles (UAVs), region of interest (ROI)-based coding systems are of growing interest in order to cope with the limited channel capacities available. We present a fully automatic detection and coding system which is capable of transmitting high-resolution aerial surveillance videos at very low bit rates. Our coding system is based on the transmission of ROI areas only. We assume two different kinds of ROIs: in order to limit the transmission bit rate while simultaneously retaining a high-quality view of the ground, we only transmit new emerging areas (ROI-NA) for each frame instead of the entire frame. At the decoder side, the surface of the earth is reconstructed from transmitted ROI-NA by means of global motion compensation (GMC). In order to retain the movement of moving objects not conforming with the motion of the ground (like moving cars and their previously occluded ground), we additionally consider regions containing such objects as interesting (ROI-MO). Finally, both ROIs are used as input to an externally controlled video encoder. While we use GMC for the reconstruction of the ground from ROI-NA, we use meshed-based motion compensation in order to generate the pelwise difference in the luminance channel (difference image) between the mesh-based motion compensated and the current input image to detect the ROI-MO. High spots of energy within this difference image are used as seeds to select corresponding superpixels from an independent (temporally consistent) superpixel segmentation of the input image in order to obtain accurate shape information of ROI-MO. For a false positive detection rate (regions falsely classified as containing local motion) of less than 2% we detect more than 97% true positives (correctly detected ROI-MOs) in challenging scenarios. Furthermore, we propose to use a modified high-efficiency video coding (HEVC) video encoder. Retaining full HDTV video resolution at 30 fps and subjectively high quality we achieve bit rates of about 0.6-0.9 Mbit/s, which is a bit rate saving of about 90% compared to an unmodified HEVC encoder.

AB - For the transmission of aerial surveillance videos taken from unmanned aerial vehicles (UAVs), region of interest (ROI)-based coding systems are of growing interest in order to cope with the limited channel capacities available. We present a fully automatic detection and coding system which is capable of transmitting high-resolution aerial surveillance videos at very low bit rates. Our coding system is based on the transmission of ROI areas only. We assume two different kinds of ROIs: in order to limit the transmission bit rate while simultaneously retaining a high-quality view of the ground, we only transmit new emerging areas (ROI-NA) for each frame instead of the entire frame. At the decoder side, the surface of the earth is reconstructed from transmitted ROI-NA by means of global motion compensation (GMC). In order to retain the movement of moving objects not conforming with the motion of the ground (like moving cars and their previously occluded ground), we additionally consider regions containing such objects as interesting (ROI-MO). Finally, both ROIs are used as input to an externally controlled video encoder. While we use GMC for the reconstruction of the ground from ROI-NA, we use meshed-based motion compensation in order to generate the pelwise difference in the luminance channel (difference image) between the mesh-based motion compensated and the current input image to detect the ROI-MO. High spots of energy within this difference image are used as seeds to select corresponding superpixels from an independent (temporally consistent) superpixel segmentation of the input image in order to obtain accurate shape information of ROI-MO. For a false positive detection rate (regions falsely classified as containing local motion) of less than 2% we detect more than 97% true positives (correctly detected ROI-MOs) in challenging scenarios. Furthermore, we propose to use a modified high-efficiency video coding (HEVC) video encoder. Retaining full HDTV video resolution at 30 fps and subjectively high quality we achieve bit rates of about 0.6-0.9 Mbit/s, which is a bit rate saving of about 90% compared to an unmodified HEVC encoder.

KW - Low bit rate HDTV video coding

KW - Mesh-based motion compensation

KW - Moving object detection

KW - Region of interest ROI coding

KW - Superpixel segmentation

UR - http://www.scopus.com/inward/record.url?scp=84944317475&partnerID=8YFLogxK

U2 - 10.1017/ATSIP.2015.12

DO - 10.1017/ATSIP.2015.12

M3 - Article

AN - SCOPUS:84944317475

VL - 4

JO - APSIPA Transactions on Signal and Information Processing

JF - APSIPA Transactions on Signal and Information Processing

IS - e13

M1 - 1017

ER -

Research@Leibniz University

Mesh-based piecewise planar motion compensation and optical flow clustering for ROI coding

Authors

Research Organisations

Details

Abstract

Keywords

ASJC Scopus subject areas

Cite this

By the same author(s)

Self-supervised domain adaptation for machinery remaining useful life prediction

Acoustic Emission Detection in Noisy Environments using Linear Prediction

Genie: the first open-source ISO/IEC encoder for genomic data

Matched Filter for Acoustic Emission Monitoring in Noisy Environments: Application to Wire Break Detection

Blind extraction of guitar effects through blind system inversion and neural guitar effect modeling