Direction of Arrival Estimation of Noisy Speech using Convolutional Recurrent Neural Networks with Higher-Order Ambisonics Signals

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Authors

  • Nils Poschadel
  • Robert Hupke
  • Stephan Preihs
  • Jürgen Peissig
View graph of relations

Details

Original languageEnglish
Title of host publication29th European Signal Processing Conference, EUSIPCO 2021 - Proceedings
Pages211-215
Number of pages5
ISBN (electronic)9789082797060
Publication statusPublished - 2021
Event29th European Signal Processing Conference, EUSIPCO 2021 - Dublin, Ireland
Duration: 23 Aug 202127 Aug 2021

Publication series

NameEuropean Signal Processing Conference
Volume2021-August
ISSN (Print)2219-5491
ISSN (electronic)2076-1465

Abstract

Training convolutional recurrent neural networks on first-order Ambisonics signals is a well-known approach when estimating the direction of arrival for speech/sound signals. In this work, we investigate whether increasing the order of Ambisonics up to the fourth order further improves the estimation performance of convolutional recurrent neural networks. While our results on data based on simulated spatial room impulse responses show that the use of higher Ambisonics orders does have the potential to provide better localization results, no further improvement was shown on data based on real spatial room impulse responses from order two onwards. Rather, it seems to be crucial to extract meaningful features from the raw data. First order features derived from the acoustic intensity vector were superior to pure higher-order magnitude and phase features in almost all scenarios.

Keywords

    Convolutional recurrent neural network, Direction of arrival estimation, Higher-order ambisonics, Spherical harmonics

ASJC Scopus subject areas

Cite this

Direction of Arrival Estimation of Noisy Speech using Convolutional Recurrent Neural Networks with Higher-Order Ambisonics Signals. / Poschadel, Nils; Hupke, Robert; Preihs, Stephan et al.
29th European Signal Processing Conference, EUSIPCO 2021 - Proceedings. 2021. p. 211-215 (European Signal Processing Conference; Vol. 2021-August).

Research output: Chapter in book/report/conference proceedingConference contributionResearchpeer review

Poschadel, N, Hupke, R, Preihs, S & Peissig, J 2021, Direction of Arrival Estimation of Noisy Speech using Convolutional Recurrent Neural Networks with Higher-Order Ambisonics Signals. in 29th European Signal Processing Conference, EUSIPCO 2021 - Proceedings. European Signal Processing Conference, vol. 2021-August, pp. 211-215, 29th European Signal Processing Conference, EUSIPCO 2021, Dublin, Ireland, 23 Aug 2021. https://doi.org/10.48550/arXiv.2102.09853, https://doi.org/10.23919/EUSIPCO54536.2021.9616204
Poschadel, N., Hupke, R., Preihs, S., & Peissig, J. (2021). Direction of Arrival Estimation of Noisy Speech using Convolutional Recurrent Neural Networks with Higher-Order Ambisonics Signals. In 29th European Signal Processing Conference, EUSIPCO 2021 - Proceedings (pp. 211-215). (European Signal Processing Conference; Vol. 2021-August). https://doi.org/10.48550/arXiv.2102.09853, https://doi.org/10.23919/EUSIPCO54536.2021.9616204
Poschadel N, Hupke R, Preihs S, Peissig J. Direction of Arrival Estimation of Noisy Speech using Convolutional Recurrent Neural Networks with Higher-Order Ambisonics Signals. In 29th European Signal Processing Conference, EUSIPCO 2021 - Proceedings. 2021. p. 211-215. (European Signal Processing Conference). Epub 2021 Feb 19. doi: 10.48550/arXiv.2102.09853, 10.23919/EUSIPCO54536.2021.9616204
Poschadel, Nils ; Hupke, Robert ; Preihs, Stephan et al. / Direction of Arrival Estimation of Noisy Speech using Convolutional Recurrent Neural Networks with Higher-Order Ambisonics Signals. 29th European Signal Processing Conference, EUSIPCO 2021 - Proceedings. 2021. pp. 211-215 (European Signal Processing Conference).
Download
@inproceedings{fa6cdc7d5d6c4521a8061700d2849b64,
title = "Direction of Arrival Estimation of Noisy Speech using Convolutional Recurrent Neural Networks with Higher-Order Ambisonics Signals",
abstract = " Training convolutional recurrent neural networks on first-order Ambisonics signals is a well-known approach when estimating the direction of arrival for speech/sound signals. In this work, we investigate whether increasing the order of Ambisonics up to the fourth order further improves the estimation performance of convolutional recurrent neural networks. While our results on data based on simulated spatial room impulse responses show that the use of higher Ambisonics orders does have the potential to provide better localization results, no further improvement was shown on data based on real spatial room impulse responses from order two onwards. Rather, it seems to be crucial to extract meaningful features from the raw data. First order features derived from the acoustic intensity vector were superior to pure higher-order magnitude and phase features in almost all scenarios. ",
keywords = "Convolutional recurrent neural network, Direction of arrival estimation, Higher-order ambisonics, Spherical harmonics",
author = "Nils Poschadel and Robert Hupke and Stephan Preihs and J{\"u}rgen Peissig",
year = "2021",
doi = "10.48550/arXiv.2102.09853",
language = "English",
isbn = "978-1-6654-0900-1",
series = "European Signal Processing Conference",
pages = "211--215",
booktitle = "29th European Signal Processing Conference, EUSIPCO 2021 - Proceedings",
note = "29th European Signal Processing Conference, EUSIPCO 2021 ; Conference date: 23-08-2021 Through 27-08-2021",

}

Download

TY - GEN

T1 - Direction of Arrival Estimation of Noisy Speech using Convolutional Recurrent Neural Networks with Higher-Order Ambisonics Signals

AU - Poschadel, Nils

AU - Hupke, Robert

AU - Preihs, Stephan

AU - Peissig, Jürgen

PY - 2021

Y1 - 2021

N2 - Training convolutional recurrent neural networks on first-order Ambisonics signals is a well-known approach when estimating the direction of arrival for speech/sound signals. In this work, we investigate whether increasing the order of Ambisonics up to the fourth order further improves the estimation performance of convolutional recurrent neural networks. While our results on data based on simulated spatial room impulse responses show that the use of higher Ambisonics orders does have the potential to provide better localization results, no further improvement was shown on data based on real spatial room impulse responses from order two onwards. Rather, it seems to be crucial to extract meaningful features from the raw data. First order features derived from the acoustic intensity vector were superior to pure higher-order magnitude and phase features in almost all scenarios.

AB - Training convolutional recurrent neural networks on first-order Ambisonics signals is a well-known approach when estimating the direction of arrival for speech/sound signals. In this work, we investigate whether increasing the order of Ambisonics up to the fourth order further improves the estimation performance of convolutional recurrent neural networks. While our results on data based on simulated spatial room impulse responses show that the use of higher Ambisonics orders does have the potential to provide better localization results, no further improvement was shown on data based on real spatial room impulse responses from order two onwards. Rather, it seems to be crucial to extract meaningful features from the raw data. First order features derived from the acoustic intensity vector were superior to pure higher-order magnitude and phase features in almost all scenarios.

KW - Convolutional recurrent neural network

KW - Direction of arrival estimation

KW - Higher-order ambisonics

KW - Spherical harmonics

UR - http://www.scopus.com/inward/record.url?scp=85123160520&partnerID=8YFLogxK

U2 - 10.48550/arXiv.2102.09853

DO - 10.48550/arXiv.2102.09853

M3 - Conference contribution

SN - 978-1-6654-0900-1

T3 - European Signal Processing Conference

SP - 211

EP - 215

BT - 29th European Signal Processing Conference, EUSIPCO 2021 - Proceedings

T2 - 29th European Signal Processing Conference, EUSIPCO 2021

Y2 - 23 August 2021 through 27 August 2021

ER -