Real-Time Estimation of Propagation Delays for Temporal Alignment of Audio Signals in Augmented Reality Applications

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschung

Autoren

  • Marcel Martin Nophut
  • Robert Johannes Hupke
  • Stephan Preihs
  • Jürgen Karl Peissig

Organisationseinheiten

Forschungs-netzwerk anzeigen

Details

OriginalspracheEnglisch
Titel des SammelwerksFortschritte der Akustik - DAGA 2018
Untertitel44. Deutsche Jahrestagung für Akustik
PublikationsstatusVeröffentlicht - 2018
VeranstaltungDAGA 2018: 44. JAHRESTAGUNG FÜR AKUSTIK - München, Deutschland
Dauer: 19 März 201822 März 2018

Abstract

In augmented reality audio applications a superposition of environmental sounds and supplementary audio content is used to create auditory enhancements for the listener in a broad range of use cases. In some use cases environmental sounds and supplementary content may be highly correlated, for example in audience services at live events, where a live playback through PA speakers is enhanced by augmented reality audio content, e.g. to create an individualized live mix. Without temporal alignment of those signals a superposition causes comb filtering effects or confusing echoes.This contribution proposes an efficient method that is able to robustly detect a temporal offset of correlated audio signals. It is based on a recursive cross-correlation estimation and a peak detection algorithm. The method focuses on indoor music and speech events with their typically occurring problems like room reflections, crosstalk, tonal components and a large number of correlation lags. The obtained temporal offset is used to delay the supplementary audio content in order to achieve a temporal alignment of the signals.

Zitieren

Real-Time Estimation of Propagation Delays for Temporal Alignment of Audio Signals in Augmented Reality Applications. / Nophut, Marcel Martin; Hupke, Robert Johannes; Preihs, Stephan et al.
Fortschritte der Akustik - DAGA 2018: 44. Deutsche Jahrestagung für Akustik. 2018.

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschung

Nophut, MM, Hupke, RJ, Preihs, S & Peissig, JK 2018, Real-Time Estimation of Propagation Delays for Temporal Alignment of Audio Signals in Augmented Reality Applications. in Fortschritte der Akustik - DAGA 2018: 44. Deutsche Jahrestagung für Akustik. DAGA 2018, München, Deutschland, 19 März 2018. <https://www.dega-akustik.de/publikationen/online-proceedings>
Nophut, M. M., Hupke, R. J., Preihs, S., & Peissig, J. K. (2018). Real-Time Estimation of Propagation Delays for Temporal Alignment of Audio Signals in Augmented Reality Applications. In Fortschritte der Akustik - DAGA 2018: 44. Deutsche Jahrestagung für Akustik https://www.dega-akustik.de/publikationen/online-proceedings
Nophut MM, Hupke RJ, Preihs S, Peissig JK. Real-Time Estimation of Propagation Delays for Temporal Alignment of Audio Signals in Augmented Reality Applications. in Fortschritte der Akustik - DAGA 2018: 44. Deutsche Jahrestagung für Akustik. 2018
Nophut, Marcel Martin ; Hupke, Robert Johannes ; Preihs, Stephan et al. / Real-Time Estimation of Propagation Delays for Temporal Alignment of Audio Signals in Augmented Reality Applications. Fortschritte der Akustik - DAGA 2018: 44. Deutsche Jahrestagung für Akustik. 2018.
Download
@inproceedings{ea0bfec1792547ed80127fc2de463d73,
title = "Real-Time Estimation of Propagation Delays for Temporal Alignment of Audio Signals in Augmented Reality Applications",
abstract = "In augmented reality audio applications a superposition of environmental sounds and supplementary audio content is used to create auditory enhancements for the listener in a broad range of use cases. In some use cases environmental sounds and supplementary content may be highly correlated, for example in audience services at live events, where a live playback through PA speakers is enhanced by augmented reality audio content, e.g. to create an individualized live mix. Without temporal alignment of those signals a superposition causes comb filtering effects or confusing echoes.This contribution proposes an efficient method that is able to robustly detect a temporal offset of correlated audio signals. It is based on a recursive cross-correlation estimation and a peak detection algorithm. The method focuses on indoor music and speech events with their typically occurring problems like room reflections, crosstalk, tonal components and a large number of correlation lags. The obtained temporal offset is used to delay the supplementary audio content in order to achieve a temporal alignment of the signals.",
author = "Nophut, {Marcel Martin} and Hupke, {Robert Johannes} and Stephan Preihs and Peissig, {J{\"u}rgen Karl}",
year = "2018",
language = "English",
booktitle = "Fortschritte der Akustik - DAGA 2018",
note = "DAGA 2018 : 44. JAHRESTAGUNG F{\"U}R AKUSTIK ; Conference date: 19-03-2018 Through 22-03-2018",

}

Download

TY - GEN

T1 - Real-Time Estimation of Propagation Delays for Temporal Alignment of Audio Signals in Augmented Reality Applications

AU - Nophut, Marcel Martin

AU - Hupke, Robert Johannes

AU - Preihs, Stephan

AU - Peissig, Jürgen Karl

PY - 2018

Y1 - 2018

N2 - In augmented reality audio applications a superposition of environmental sounds and supplementary audio content is used to create auditory enhancements for the listener in a broad range of use cases. In some use cases environmental sounds and supplementary content may be highly correlated, for example in audience services at live events, where a live playback through PA speakers is enhanced by augmented reality audio content, e.g. to create an individualized live mix. Without temporal alignment of those signals a superposition causes comb filtering effects or confusing echoes.This contribution proposes an efficient method that is able to robustly detect a temporal offset of correlated audio signals. It is based on a recursive cross-correlation estimation and a peak detection algorithm. The method focuses on indoor music and speech events with their typically occurring problems like room reflections, crosstalk, tonal components and a large number of correlation lags. The obtained temporal offset is used to delay the supplementary audio content in order to achieve a temporal alignment of the signals.

AB - In augmented reality audio applications a superposition of environmental sounds and supplementary audio content is used to create auditory enhancements for the listener in a broad range of use cases. In some use cases environmental sounds and supplementary content may be highly correlated, for example in audience services at live events, where a live playback through PA speakers is enhanced by augmented reality audio content, e.g. to create an individualized live mix. Without temporal alignment of those signals a superposition causes comb filtering effects or confusing echoes.This contribution proposes an efficient method that is able to robustly detect a temporal offset of correlated audio signals. It is based on a recursive cross-correlation estimation and a peak detection algorithm. The method focuses on indoor music and speech events with their typically occurring problems like room reflections, crosstalk, tonal components and a large number of correlation lags. The obtained temporal offset is used to delay the supplementary audio content in order to achieve a temporal alignment of the signals.

M3 - Conference contribution

BT - Fortschritte der Akustik - DAGA 2018

T2 - DAGA 2018

Y2 - 19 March 2018 through 22 March 2018

ER -