Blind Room Acoustic Parameters Estimation Using Mobile Audio Transformer

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Autoren

  • Shivam Saini
  • Jürgen Peissig

Organisationseinheiten

Externe Organisationen

  • Huawei Technologies Deutschland GmbH
Forschungs-netzwerk anzeigen

Details

OriginalspracheEnglisch
Titel des SammelwerksProceedings of the 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2023
Herausgeber (Verlag)Institute of Electrical and Electronics Engineers Inc.
ISBN (elektronisch)9798350323726
ISBN (Print)979-8-3503-2373-3
PublikationsstatusVeröffentlicht - 2023
Veranstaltung2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) - New Paltz, USA / Vereinigte Staaten
Dauer: 22 Okt. 202325 Okt. 2023

Publikationsreihe

NameIEEE Workshop on Applications of Signal Processing to Audio and Acoustics
ISSN (Print)1931-1168
ISSN (elektronisch)1947-1629

Abstract

For an accurate representation of a virtual sound source in an Augmented Reality environment, it is crucial to understand the acoustic properties of the current room where the source is to be rendered. Reverberation Time (RT60) and Clarity (C50) are two of the most significant parameters that could negatively impact the plausibility of the source when estimated incorrectly. We propose using an audio transformer to estimate these parameters blindly and solely from a single-channel noisy speech signal. Furthermore, to ensure efficiency and minimum computational complexity, the use of an efficient lightweight architecture is proposed for its suitability for mobile-friendly applications. We evaluate the proposed model with regard to its complexity and accuracy against the existing state-of-the-art approaches in unseen and real acoustic settings. Results demonstrate that the proposed model surpasses the traditional CNN models in terms of complexity, size, speed, and precision.

ASJC Scopus Sachgebiete

Zitieren

Blind Room Acoustic Parameters Estimation Using Mobile Audio Transformer. / Saini, Shivam; Peissig, Jürgen.
Proceedings of the 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2023. Institute of Electrical and Electronics Engineers Inc., 2023. (IEEE Workshop on Applications of Signal Processing to Audio and Acoustics).

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Saini, S & Peissig, J 2023, Blind Room Acoustic Parameters Estimation Using Mobile Audio Transformer. in Proceedings of the 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2023. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, Institute of Electrical and Electronics Engineers Inc., 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), New Paltz, New York, USA / Vereinigte Staaten, 22 Okt. 2023. https://doi.org/10.1109/WASPAA58266.2023.10248186
Saini, S., & Peissig, J. (2023). Blind Room Acoustic Parameters Estimation Using Mobile Audio Transformer. In Proceedings of the 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2023 (IEEE Workshop on Applications of Signal Processing to Audio and Acoustics). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/WASPAA58266.2023.10248186
Saini S, Peissig J. Blind Room Acoustic Parameters Estimation Using Mobile Audio Transformer. in Proceedings of the 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2023. Institute of Electrical and Electronics Engineers Inc. 2023. (IEEE Workshop on Applications of Signal Processing to Audio and Acoustics). doi: 10.1109/WASPAA58266.2023.10248186
Saini, Shivam ; Peissig, Jürgen. / Blind Room Acoustic Parameters Estimation Using Mobile Audio Transformer. Proceedings of the 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2023. Institute of Electrical and Electronics Engineers Inc., 2023. (IEEE Workshop on Applications of Signal Processing to Audio and Acoustics).
Download
@inproceedings{98d491996a484dfaad49b28a97594d35,
title = "Blind Room Acoustic Parameters Estimation Using Mobile Audio Transformer",
abstract = "For an accurate representation of a virtual sound source in an Augmented Reality environment, it is crucial to understand the acoustic properties of the current room where the source is to be rendered. Reverberation Time (RT60) and Clarity (C50) are two of the most significant parameters that could negatively impact the plausibility of the source when estimated incorrectly. We propose using an audio transformer to estimate these parameters blindly and solely from a single-channel noisy speech signal. Furthermore, to ensure efficiency and minimum computational complexity, the use of an efficient lightweight architecture is proposed for its suitability for mobile-friendly applications. We evaluate the proposed model with regard to its complexity and accuracy against the existing state-of-the-art approaches in unseen and real acoustic settings. Results demonstrate that the proposed model surpasses the traditional CNN models in terms of complexity, size, speed, and precision.",
keywords = "AudMobNet, Mobile Audio Transformer, Reverberation Time Estimation, Room Acoustic Parameter Estimation",
author = "Shivam Saini and J{\"u}rgen Peissig",
year = "2023",
doi = "10.1109/WASPAA58266.2023.10248186",
language = "English",
isbn = "979-8-3503-2373-3",
series = "IEEE Workshop on Applications of Signal Processing to Audio and Acoustics",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
booktitle = "Proceedings of the 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2023",
address = "United States",
note = "2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2023 ; Conference date: 22-10-2023 Through 25-10-2023",

}

Download

TY - GEN

T1 - Blind Room Acoustic Parameters Estimation Using Mobile Audio Transformer

AU - Saini, Shivam

AU - Peissig, Jürgen

PY - 2023

Y1 - 2023

N2 - For an accurate representation of a virtual sound source in an Augmented Reality environment, it is crucial to understand the acoustic properties of the current room where the source is to be rendered. Reverberation Time (RT60) and Clarity (C50) are two of the most significant parameters that could negatively impact the plausibility of the source when estimated incorrectly. We propose using an audio transformer to estimate these parameters blindly and solely from a single-channel noisy speech signal. Furthermore, to ensure efficiency and minimum computational complexity, the use of an efficient lightweight architecture is proposed for its suitability for mobile-friendly applications. We evaluate the proposed model with regard to its complexity and accuracy against the existing state-of-the-art approaches in unseen and real acoustic settings. Results demonstrate that the proposed model surpasses the traditional CNN models in terms of complexity, size, speed, and precision.

AB - For an accurate representation of a virtual sound source in an Augmented Reality environment, it is crucial to understand the acoustic properties of the current room where the source is to be rendered. Reverberation Time (RT60) and Clarity (C50) are two of the most significant parameters that could negatively impact the plausibility of the source when estimated incorrectly. We propose using an audio transformer to estimate these parameters blindly and solely from a single-channel noisy speech signal. Furthermore, to ensure efficiency and minimum computational complexity, the use of an efficient lightweight architecture is proposed for its suitability for mobile-friendly applications. We evaluate the proposed model with regard to its complexity and accuracy against the existing state-of-the-art approaches in unseen and real acoustic settings. Results demonstrate that the proposed model surpasses the traditional CNN models in terms of complexity, size, speed, and precision.

KW - AudMobNet

KW - Mobile Audio Transformer

KW - Reverberation Time Estimation

KW - Room Acoustic Parameter Estimation

UR - http://www.scopus.com/inward/record.url?scp=85173012743&partnerID=8YFLogxK

U2 - 10.1109/WASPAA58266.2023.10248186

DO - 10.1109/WASPAA58266.2023.10248186

M3 - Conference contribution

AN - SCOPUS:85173012743

SN - 979-8-3503-2373-3

T3 - IEEE Workshop on Applications of Signal Processing to Audio and Acoustics

BT - Proceedings of the 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2023

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2023

Y2 - 22 October 2023 through 25 October 2023

ER -