Real-Time Estimation of Propagation Delays for Temporal Alignment of Audio Signals in Augmented Reality Applications

Research output: Chapter in book/report/conference proceedingConference contributionResearch

Authors

  • Marcel Martin Nophut
  • Robert Johannes Hupke
  • Stephan Preihs
  • Jürgen Karl Peissig
View graph of relations

Details

Original languageEnglish
Title of host publicationFortschritte der Akustik - DAGA 2018
Subtitle of host publication44. Deutsche Jahrestagung für Akustik
Publication statusPublished - 2018
EventDAGA 2018: 44. JAHRESTAGUNG FÜR AKUSTIK - München, Germany
Duration: 19 Mar 201822 Mar 2018

Abstract

In augmented reality audio applications a superposition of environmental sounds and supplementary audio content is used to create auditory enhancements for the listener in a broad range of use cases. In some use cases environmental sounds and supplementary content may be highly correlated, for example in audience services at live events, where a live playback through PA speakers is enhanced by augmented reality audio content, e.g. to create an individualized live mix. Without temporal alignment of those signals a superposition causes comb filtering effects or confusing echoes.This contribution proposes an efficient method that is able to robustly detect a temporal offset of correlated audio signals. It is based on a recursive cross-correlation estimation and a peak detection algorithm. The method focuses on indoor music and speech events with their typically occurring problems like room reflections, crosstalk, tonal components and a large number of correlation lags. The obtained temporal offset is used to delay the supplementary audio content in order to achieve a temporal alignment of the signals.

Cite this

Real-Time Estimation of Propagation Delays for Temporal Alignment of Audio Signals in Augmented Reality Applications. / Nophut, Marcel Martin; Hupke, Robert Johannes; Preihs, Stephan et al.
Fortschritte der Akustik - DAGA 2018: 44. Deutsche Jahrestagung für Akustik. 2018.

Research output: Chapter in book/report/conference proceedingConference contributionResearch

Nophut, MM, Hupke, RJ, Preihs, S & Peissig, JK 2018, Real-Time Estimation of Propagation Delays for Temporal Alignment of Audio Signals in Augmented Reality Applications. in Fortschritte der Akustik - DAGA 2018: 44. Deutsche Jahrestagung für Akustik. DAGA 2018, München, Germany, 19 Mar 2018. <https://www.dega-akustik.de/publikationen/online-proceedings>
Nophut, M. M., Hupke, R. J., Preihs, S., & Peissig, J. K. (2018). Real-Time Estimation of Propagation Delays for Temporal Alignment of Audio Signals in Augmented Reality Applications. In Fortschritte der Akustik - DAGA 2018: 44. Deutsche Jahrestagung für Akustik https://www.dega-akustik.de/publikationen/online-proceedings
Nophut MM, Hupke RJ, Preihs S, Peissig JK. Real-Time Estimation of Propagation Delays for Temporal Alignment of Audio Signals in Augmented Reality Applications. In Fortschritte der Akustik - DAGA 2018: 44. Deutsche Jahrestagung für Akustik. 2018
Nophut, Marcel Martin ; Hupke, Robert Johannes ; Preihs, Stephan et al. / Real-Time Estimation of Propagation Delays for Temporal Alignment of Audio Signals in Augmented Reality Applications. Fortschritte der Akustik - DAGA 2018: 44. Deutsche Jahrestagung für Akustik. 2018.
Download
@inproceedings{ea0bfec1792547ed80127fc2de463d73,
title = "Real-Time Estimation of Propagation Delays for Temporal Alignment of Audio Signals in Augmented Reality Applications",
abstract = "In augmented reality audio applications a superposition of environmental sounds and supplementary audio content is used to create auditory enhancements for the listener in a broad range of use cases. In some use cases environmental sounds and supplementary content may be highly correlated, for example in audience services at live events, where a live playback through PA speakers is enhanced by augmented reality audio content, e.g. to create an individualized live mix. Without temporal alignment of those signals a superposition causes comb filtering effects or confusing echoes.This contribution proposes an efficient method that is able to robustly detect a temporal offset of correlated audio signals. It is based on a recursive cross-correlation estimation and a peak detection algorithm. The method focuses on indoor music and speech events with their typically occurring problems like room reflections, crosstalk, tonal components and a large number of correlation lags. The obtained temporal offset is used to delay the supplementary audio content in order to achieve a temporal alignment of the signals.",
author = "Nophut, {Marcel Martin} and Hupke, {Robert Johannes} and Stephan Preihs and Peissig, {J{\"u}rgen Karl}",
year = "2018",
language = "English",
booktitle = "Fortschritte der Akustik - DAGA 2018",
note = "DAGA 2018 : 44. JAHRESTAGUNG F{\"U}R AKUSTIK ; Conference date: 19-03-2018 Through 22-03-2018",

}

Download

TY - GEN

T1 - Real-Time Estimation of Propagation Delays for Temporal Alignment of Audio Signals in Augmented Reality Applications

AU - Nophut, Marcel Martin

AU - Hupke, Robert Johannes

AU - Preihs, Stephan

AU - Peissig, Jürgen Karl

PY - 2018

Y1 - 2018

N2 - In augmented reality audio applications a superposition of environmental sounds and supplementary audio content is used to create auditory enhancements for the listener in a broad range of use cases. In some use cases environmental sounds and supplementary content may be highly correlated, for example in audience services at live events, where a live playback through PA speakers is enhanced by augmented reality audio content, e.g. to create an individualized live mix. Without temporal alignment of those signals a superposition causes comb filtering effects or confusing echoes.This contribution proposes an efficient method that is able to robustly detect a temporal offset of correlated audio signals. It is based on a recursive cross-correlation estimation and a peak detection algorithm. The method focuses on indoor music and speech events with their typically occurring problems like room reflections, crosstalk, tonal components and a large number of correlation lags. The obtained temporal offset is used to delay the supplementary audio content in order to achieve a temporal alignment of the signals.

AB - In augmented reality audio applications a superposition of environmental sounds and supplementary audio content is used to create auditory enhancements for the listener in a broad range of use cases. In some use cases environmental sounds and supplementary content may be highly correlated, for example in audience services at live events, where a live playback through PA speakers is enhanced by augmented reality audio content, e.g. to create an individualized live mix. Without temporal alignment of those signals a superposition causes comb filtering effects or confusing echoes.This contribution proposes an efficient method that is able to robustly detect a temporal offset of correlated audio signals. It is based on a recursive cross-correlation estimation and a peak detection algorithm. The method focuses on indoor music and speech events with their typically occurring problems like room reflections, crosstalk, tonal components and a large number of correlation lags. The obtained temporal offset is used to delay the supplementary audio content in order to achieve a temporal alignment of the signals.

M3 - Conference contribution

BT - Fortschritte der Akustik - DAGA 2018

T2 - DAGA 2018

Y2 - 19 March 2018 through 22 March 2018

ER -