Blind Room Acoustic Parameters Estimation Using Mobile Audio Transformer

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

  • Shivam Saini
  • Jürgen Peissig

External Research Organisations

  • HUAWEI TECHNOLOGIES Duesseldorf GmbH
View graph of relations

Details

Original languageEnglish
Title of host publicationProceedings of the 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (electronic)9798350323726
ISBN (print)979-8-3503-2373-3
Publication statusPublished - 2023
Event2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2023 - New Paltz, United States
Duration: 22 Oct 202325 Oct 2023

Publication series

NameIEEE Workshop on Applications of Signal Processing to Audio and Acoustics
ISSN (Print)1931-1168
ISSN (electronic)1947-1629

Abstract

For an accurate representation of a virtual sound source in an Augmented Reality environment, it is crucial to understand the acoustic properties of the current room where the source is to be rendered. Reverberation Time (RT60) and Clarity (C50) are two of the most significant parameters that could negatively impact the plausibility of the source when estimated incorrectly. We propose using an audio transformer to estimate these parameters blindly and solely from a single-channel noisy speech signal. Furthermore, to ensure efficiency and minimum computational complexity, the use of an efficient lightweight architecture is proposed for its suitability for mobile-friendly applications. We evaluate the proposed model with regard to its complexity and accuracy against the existing state-of-the-art approaches in unseen and real acoustic settings. Results demonstrate that the proposed model surpasses the traditional CNN models in terms of complexity, size, speed, and precision.

Keywords

    AudMobNet, Mobile Audio Transformer, Reverberation Time Estimation, Room Acoustic Parameter Estimation

ASJC Scopus subject areas

Cite this

Blind Room Acoustic Parameters Estimation Using Mobile Audio Transformer. / Saini, Shivam; Peissig, Jürgen.
Proceedings of the 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2023. Institute of Electrical and Electronics Engineers Inc., 2023. (IEEE Workshop on Applications of Signal Processing to Audio and Acoustics).

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Saini, S & Peissig, J 2023, Blind Room Acoustic Parameters Estimation Using Mobile Audio Transformer. in Proceedings of the 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2023. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, Institute of Electrical and Electronics Engineers Inc., 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2023, New Paltz, New York, United States, 22 Oct 2023. https://doi.org/10.1109/WASPAA58266.2023.10248186
Saini, S., & Peissig, J. (2023). Blind Room Acoustic Parameters Estimation Using Mobile Audio Transformer. In Proceedings of the 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2023 (IEEE Workshop on Applications of Signal Processing to Audio and Acoustics). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/WASPAA58266.2023.10248186
Saini S, Peissig J. Blind Room Acoustic Parameters Estimation Using Mobile Audio Transformer. In Proceedings of the 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2023. Institute of Electrical and Electronics Engineers Inc. 2023. (IEEE Workshop on Applications of Signal Processing to Audio and Acoustics). doi: 10.1109/WASPAA58266.2023.10248186
Saini, Shivam ; Peissig, Jürgen. / Blind Room Acoustic Parameters Estimation Using Mobile Audio Transformer. Proceedings of the 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2023. Institute of Electrical and Electronics Engineers Inc., 2023. (IEEE Workshop on Applications of Signal Processing to Audio and Acoustics).
Download
@inproceedings{98d491996a484dfaad49b28a97594d35,
title = "Blind Room Acoustic Parameters Estimation Using Mobile Audio Transformer",
abstract = "For an accurate representation of a virtual sound source in an Augmented Reality environment, it is crucial to understand the acoustic properties of the current room where the source is to be rendered. Reverberation Time (RT60) and Clarity (C50) are two of the most significant parameters that could negatively impact the plausibility of the source when estimated incorrectly. We propose using an audio transformer to estimate these parameters blindly and solely from a single-channel noisy speech signal. Furthermore, to ensure efficiency and minimum computational complexity, the use of an efficient lightweight architecture is proposed for its suitability for mobile-friendly applications. We evaluate the proposed model with regard to its complexity and accuracy against the existing state-of-the-art approaches in unseen and real acoustic settings. Results demonstrate that the proposed model surpasses the traditional CNN models in terms of complexity, size, speed, and precision.",
keywords = "AudMobNet, Mobile Audio Transformer, Reverberation Time Estimation, Room Acoustic Parameter Estimation",
author = "Shivam Saini and J{\"u}rgen Peissig",
year = "2023",
doi = "10.1109/WASPAA58266.2023.10248186",
language = "English",
isbn = "979-8-3503-2373-3",
series = "IEEE Workshop on Applications of Signal Processing to Audio and Acoustics",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
booktitle = "Proceedings of the 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2023",
address = "United States",
note = "2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2023 ; Conference date: 22-10-2023 Through 25-10-2023",

}

Download

TY - GEN

T1 - Blind Room Acoustic Parameters Estimation Using Mobile Audio Transformer

AU - Saini, Shivam

AU - Peissig, Jürgen

PY - 2023

Y1 - 2023

N2 - For an accurate representation of a virtual sound source in an Augmented Reality environment, it is crucial to understand the acoustic properties of the current room where the source is to be rendered. Reverberation Time (RT60) and Clarity (C50) are two of the most significant parameters that could negatively impact the plausibility of the source when estimated incorrectly. We propose using an audio transformer to estimate these parameters blindly and solely from a single-channel noisy speech signal. Furthermore, to ensure efficiency and minimum computational complexity, the use of an efficient lightweight architecture is proposed for its suitability for mobile-friendly applications. We evaluate the proposed model with regard to its complexity and accuracy against the existing state-of-the-art approaches in unseen and real acoustic settings. Results demonstrate that the proposed model surpasses the traditional CNN models in terms of complexity, size, speed, and precision.

AB - For an accurate representation of a virtual sound source in an Augmented Reality environment, it is crucial to understand the acoustic properties of the current room where the source is to be rendered. Reverberation Time (RT60) and Clarity (C50) are two of the most significant parameters that could negatively impact the plausibility of the source when estimated incorrectly. We propose using an audio transformer to estimate these parameters blindly and solely from a single-channel noisy speech signal. Furthermore, to ensure efficiency and minimum computational complexity, the use of an efficient lightweight architecture is proposed for its suitability for mobile-friendly applications. We evaluate the proposed model with regard to its complexity and accuracy against the existing state-of-the-art approaches in unseen and real acoustic settings. Results demonstrate that the proposed model surpasses the traditional CNN models in terms of complexity, size, speed, and precision.

KW - AudMobNet

KW - Mobile Audio Transformer

KW - Reverberation Time Estimation

KW - Room Acoustic Parameter Estimation

UR - http://www.scopus.com/inward/record.url?scp=85173012743&partnerID=8YFLogxK

U2 - 10.1109/WASPAA58266.2023.10248186

DO - 10.1109/WASPAA58266.2023.10248186

M3 - Conference contribution

AN - SCOPUS:85173012743

SN - 979-8-3503-2373-3

T3 - IEEE Workshop on Applications of Signal Processing to Audio and Acoustics

BT - Proceedings of the 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2023

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2023 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2023

Y2 - 22 October 2023 through 25 October 2023

ER -