Real-Time Estimation of Propagation Delays for Temporal Alignment of Audio Signals in Augmented Reality Applications

Marcel Martin Nophut; Robert Johannes Hupke; Stephan Preihs; Jürgen Karl Peissig

Details

Original language	English
Title of host publication	Fortschritte der Akustik - DAGA 2018
Subtitle of host publication	44. Deutsche Jahrestagung für Akustik
Publication status	Published - 2018
Event	DAGA 2018: 44. JAHRESTAGUNG FÜR AKUSTIK - München, Germany Duration: 19 Mar 2018 → 22 Mar 2018

Abstract

In augmented reality audio applications a superposition of environmental sounds and supplementary audio content is used to create auditory enhancements for the listener in a broad range of use cases. In some use cases environmental sounds and supplementary content may be highly correlated, for example in audience services at live events, where a live playback through PA speakers is enhanced by augmented reality audio content, e.g. to create an individualized live mix. Without temporal alignment of those signals a superposition causes comb filtering effects or confusing echoes.This contribution proposes an efficient method that is able to robustly detect a temporal offset of correlated audio signals. It is based on a recursive cross-correlation estimation and a peak detection algorithm. The method focuses on indoor music and speech events with their typically occurring problems like room reflections, crosstalk, tonal components and a large number of correlation lags. The obtained temporal offset is used to delay the supplementary audio content in order to achieve a temporal alignment of the signals.

Cite this

Real-Time Estimation of Propagation Delays for Temporal Alignment of Audio Signals in Augmented Reality Applications. / Nophut, Marcel Martin; Hupke, Robert Johannes; Preihs, Stephan et al.
Fortschritte der Akustik - DAGA 2018: 44. Deutsche Jahrestagung für Akustik. 2018.

Research output: Chapter in book/report/conference proceeding › Conference contribution › Research

Nophut, MM, Hupke, RJ, Preihs, S & Peissig, JK 2018, Real-Time Estimation of Propagation Delays for Temporal Alignment of Audio Signals in Augmented Reality Applications. in Fortschritte der Akustik - DAGA 2018: 44. Deutsche Jahrestagung für Akustik. DAGA 2018, München, Germany, 19 Mar 2018. <https://www.dega-akustik.de/publikationen/online-proceedings>

Nophut, M. M., Hupke, R. J., Preihs, S., & Peissig, J. K. (2018). Real-Time Estimation of Propagation Delays for Temporal Alignment of Audio Signals in Augmented Reality Applications. In Fortschritte der Akustik - DAGA 2018: 44. Deutsche Jahrestagung für Akustik https://www.dega-akustik.de/publikationen/online-proceedings

Nophut MM, Hupke RJ, Preihs S, Peissig JK. Real-Time Estimation of Propagation Delays for Temporal Alignment of Audio Signals in Augmented Reality Applications. In Fortschritte der Akustik - DAGA 2018: 44. Deutsche Jahrestagung für Akustik. 2018

Nophut, Marcel Martin ; Hupke, Robert Johannes ; Preihs, Stephan et al. / Real-Time Estimation of Propagation Delays for Temporal Alignment of Audio Signals in Augmented Reality Applications. Fortschritte der Akustik - DAGA 2018: 44. Deutsche Jahrestagung für Akustik. 2018.

Download

@inproceedings{ea0bfec1792547ed80127fc2de463d73,

title = "Real-Time Estimation of Propagation Delays for Temporal Alignment of Audio Signals in Augmented Reality Applications",

abstract = "In augmented reality audio applications a superposition of environmental sounds and supplementary audio content is used to create auditory enhancements for the listener in a broad range of use cases. In some use cases environmental sounds and supplementary content may be highly correlated, for example in audience services at live events, where a live playback through PA speakers is enhanced by augmented reality audio content, e.g. to create an individualized live mix. Without temporal alignment of those signals a superposition causes comb filtering effects or confusing echoes.This contribution proposes an efficient method that is able to robustly detect a temporal offset of correlated audio signals. It is based on a recursive cross-correlation estimation and a peak detection algorithm. The method focuses on indoor music and speech events with their typically occurring problems like room reflections, crosstalk, tonal components and a large number of correlation lags. The obtained temporal offset is used to delay the supplementary audio content in order to achieve a temporal alignment of the signals.",

author = "Nophut, {Marcel Martin} and Hupke, {Robert Johannes} and Stephan Preihs and Peissig, {J{\"u}rgen Karl}",

year = "2018",

language = "English",

booktitle = "Fortschritte der Akustik - DAGA 2018",

note = "DAGA 2018 : 44. JAHRESTAGUNG F{\"U}R AKUSTIK ; Conference date: 19-03-2018 Through 22-03-2018",

}

Download

TY - GEN

T1 - Real-Time Estimation of Propagation Delays for Temporal Alignment of Audio Signals in Augmented Reality Applications

AU - Nophut, Marcel Martin

AU - Hupke, Robert Johannes

AU - Preihs, Stephan

AU - Peissig, Jürgen Karl

PY - 2018

Y1 - 2018

N2 - In augmented reality audio applications a superposition of environmental sounds and supplementary audio content is used to create auditory enhancements for the listener in a broad range of use cases. In some use cases environmental sounds and supplementary content may be highly correlated, for example in audience services at live events, where a live playback through PA speakers is enhanced by augmented reality audio content, e.g. to create an individualized live mix. Without temporal alignment of those signals a superposition causes comb filtering effects or confusing echoes.This contribution proposes an efficient method that is able to robustly detect a temporal offset of correlated audio signals. It is based on a recursive cross-correlation estimation and a peak detection algorithm. The method focuses on indoor music and speech events with their typically occurring problems like room reflections, crosstalk, tonal components and a large number of correlation lags. The obtained temporal offset is used to delay the supplementary audio content in order to achieve a temporal alignment of the signals.

AB - In augmented reality audio applications a superposition of environmental sounds and supplementary audio content is used to create auditory enhancements for the listener in a broad range of use cases. In some use cases environmental sounds and supplementary content may be highly correlated, for example in audience services at live events, where a live playback through PA speakers is enhanced by augmented reality audio content, e.g. to create an individualized live mix. Without temporal alignment of those signals a superposition causes comb filtering effects or confusing echoes.This contribution proposes an efficient method that is able to robustly detect a temporal offset of correlated audio signals. It is based on a recursive cross-correlation estimation and a peak detection algorithm. The method focuses on indoor music and speech events with their typically occurring problems like room reflections, crosstalk, tonal components and a large number of correlation lags. The obtained temporal offset is used to delay the supplementary audio content in order to achieve a temporal alignment of the signals.

M3 - Conference contribution

BT - Fortschritte der Akustik - DAGA 2018

T2 - DAGA 2018

Y2 - 19 March 2018 through 22 March 2018

ER -

Research@Leibniz University

Real-Time Estimation of Propagation Delays for Temporal Alignment of Audio Signals in Augmented Reality Applications

Authors

Research Organisations

Details

Abstract

Cite this