Vector-Quantized Zero-Delay Deep Autoencoders for the Compression of Electrical Stimulation Patterns of Cochlear Implants using STOI

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Autoren

Forschungs-netzwerk anzeigen

Details

OriginalspracheEnglisch
Titel des Sammelwerks2022 IEEE-EMBS Conference on Biomedical Engineering and Sciences
Untertitel(IECBES)
Herausgeber (Verlag)Institute of Electrical and Electronics Engineers Inc.
Seiten165-170
Seitenumfang6
ISBN (elektronisch)9781665494694
ISBN (Print)978-1-6654-9470-0
PublikationsstatusVeröffentlicht - 2022
Veranstaltung7th IEEE-EMBS Conference on Biomedical Engineering and Sciences, IECBES 2022 - Virtual, Online, Malaysia
Dauer: 7 Dez. 20229 Dez. 2022

Abstract

Cochlear implants (CIs) are battery-powered, surgically implanted hearing-aids capable of restoring a sense of hearing in people suffering from moderate to profound hearing loss. Wireless transmission of audio from or to signal processors of cochlear implants can be used to improve speech understanding and localization of CI users. Data compression algorithms can be used to conserve battery power in this wireless transmission. However, very low latency is a strict requirement, limiting severly the available source coding algorithms. Previously, instead of coding the audio, coding of the electrical stimulation patterns of CIs was proposed to optimize the trade-off between bit-rate, latency and quality. In this work, a zero-delay deep autoencoder (DAE) for the coding of the electrical stimulation patters of CIs is proposed. Combining for the first time bayesian optimization with numerical approximated gradients of a nondifferential speech intelligibility measure for CIs, the short-time intelligibility measure (STOI), an optimized DAE architecture was found and trained that achieved equal or superior speech understanding at zero delay, outperforming well-known audio codecs. The DAE achieved reference vocoder STOI scores at 13.5 kbit/s compared to 33.6 kbit/s for Opus and 24.5 kbit/s for AMR-WB.

ASJC Scopus Sachgebiete

Ziele für nachhaltige Entwicklung

Zitieren

Vector-Quantized Zero-Delay Deep Autoencoders for the Compression of Electrical Stimulation Patterns of Cochlear Implants using STOI. / Hinrichs, Reemt; Ortmann, Felix; Ostermann, Jorn.
2022 IEEE-EMBS Conference on Biomedical Engineering and Sciences : (IECBES). Institute of Electrical and Electronics Engineers Inc., 2022. S. 165-170.

Publikation: Beitrag in Buch/Bericht/Sammelwerk/KonferenzbandAufsatz in KonferenzbandForschungPeer-Review

Hinrichs, R, Ortmann, F & Ostermann, J 2022, Vector-Quantized Zero-Delay Deep Autoencoders for the Compression of Electrical Stimulation Patterns of Cochlear Implants using STOI. in 2022 IEEE-EMBS Conference on Biomedical Engineering and Sciences : (IECBES). Institute of Electrical and Electronics Engineers Inc., S. 165-170, 7th IEEE-EMBS Conference on Biomedical Engineering and Sciences, IECBES 2022, Virtual, Online, Malaysia, 7 Dez. 2022. https://doi.org/10.1109/IECBES54088.2022.10079466
Hinrichs, R., Ortmann, F., & Ostermann, J. (2022). Vector-Quantized Zero-Delay Deep Autoencoders for the Compression of Electrical Stimulation Patterns of Cochlear Implants using STOI. In 2022 IEEE-EMBS Conference on Biomedical Engineering and Sciences : (IECBES) (S. 165-170). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/IECBES54088.2022.10079466
Hinrichs R, Ortmann F, Ostermann J. Vector-Quantized Zero-Delay Deep Autoencoders for the Compression of Electrical Stimulation Patterns of Cochlear Implants using STOI. in 2022 IEEE-EMBS Conference on Biomedical Engineering and Sciences : (IECBES). Institute of Electrical and Electronics Engineers Inc. 2022. S. 165-170 doi: 10.1109/IECBES54088.2022.10079466
Hinrichs, Reemt ; Ortmann, Felix ; Ostermann, Jorn. / Vector-Quantized Zero-Delay Deep Autoencoders for the Compression of Electrical Stimulation Patterns of Cochlear Implants using STOI. 2022 IEEE-EMBS Conference on Biomedical Engineering and Sciences : (IECBES). Institute of Electrical and Electronics Engineers Inc., 2022. S. 165-170
Download
@inproceedings{0017287cc1484b42b1df998cb9a92128,
title = "Vector-Quantized Zero-Delay Deep Autoencoders for the Compression of Electrical Stimulation Patterns of Cochlear Implants using STOI",
abstract = "Cochlear implants (CIs) are battery-powered, surgically implanted hearing-aids capable of restoring a sense of hearing in people suffering from moderate to profound hearing loss. Wireless transmission of audio from or to signal processors of cochlear implants can be used to improve speech understanding and localization of CI users. Data compression algorithms can be used to conserve battery power in this wireless transmission. However, very low latency is a strict requirement, limiting severly the available source coding algorithms. Previously, instead of coding the audio, coding of the electrical stimulation patterns of CIs was proposed to optimize the trade-off between bit-rate, latency and quality. In this work, a zero-delay deep autoencoder (DAE) for the coding of the electrical stimulation patters of CIs is proposed. Combining for the first time bayesian optimization with numerical approximated gradients of a nondifferential speech intelligibility measure for CIs, the short-time intelligibility measure (STOI), an optimized DAE architecture was found and trained that achieved equal or superior speech understanding at zero delay, outperforming well-known audio codecs. The DAE achieved reference vocoder STOI scores at 13.5 kbit/s compared to 33.6 kbit/s for Opus and 24.5 kbit/s for AMR-WB.",
keywords = "autoencoder, cochlear implants, hyperparameter optimization, Kiefer-Wolfowitz",
author = "Reemt Hinrichs and Felix Ortmann and Jorn Ostermann",
note = "Funding Information: This work was supported by the DFG Cluster of Excellence EXC 1077/1 Hearing4all and funded by the German Research Foundation (DFG) - Project number: 381895691.; 7th IEEE-EMBS Conference on Biomedical Engineering and Sciences, IECBES 2022 ; Conference date: 07-12-2022 Through 09-12-2022",
year = "2022",
doi = "10.1109/IECBES54088.2022.10079466",
language = "English",
isbn = "978-1-6654-9470-0",
pages = "165--170",
booktitle = "2022 IEEE-EMBS Conference on Biomedical Engineering and Sciences",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
address = "United States",

}

Download

TY - GEN

T1 - Vector-Quantized Zero-Delay Deep Autoencoders for the Compression of Electrical Stimulation Patterns of Cochlear Implants using STOI

AU - Hinrichs, Reemt

AU - Ortmann, Felix

AU - Ostermann, Jorn

N1 - Funding Information: This work was supported by the DFG Cluster of Excellence EXC 1077/1 Hearing4all and funded by the German Research Foundation (DFG) - Project number: 381895691.

PY - 2022

Y1 - 2022

N2 - Cochlear implants (CIs) are battery-powered, surgically implanted hearing-aids capable of restoring a sense of hearing in people suffering from moderate to profound hearing loss. Wireless transmission of audio from or to signal processors of cochlear implants can be used to improve speech understanding and localization of CI users. Data compression algorithms can be used to conserve battery power in this wireless transmission. However, very low latency is a strict requirement, limiting severly the available source coding algorithms. Previously, instead of coding the audio, coding of the electrical stimulation patterns of CIs was proposed to optimize the trade-off between bit-rate, latency and quality. In this work, a zero-delay deep autoencoder (DAE) for the coding of the electrical stimulation patters of CIs is proposed. Combining for the first time bayesian optimization with numerical approximated gradients of a nondifferential speech intelligibility measure for CIs, the short-time intelligibility measure (STOI), an optimized DAE architecture was found and trained that achieved equal or superior speech understanding at zero delay, outperforming well-known audio codecs. The DAE achieved reference vocoder STOI scores at 13.5 kbit/s compared to 33.6 kbit/s for Opus and 24.5 kbit/s for AMR-WB.

AB - Cochlear implants (CIs) are battery-powered, surgically implanted hearing-aids capable of restoring a sense of hearing in people suffering from moderate to profound hearing loss. Wireless transmission of audio from or to signal processors of cochlear implants can be used to improve speech understanding and localization of CI users. Data compression algorithms can be used to conserve battery power in this wireless transmission. However, very low latency is a strict requirement, limiting severly the available source coding algorithms. Previously, instead of coding the audio, coding of the electrical stimulation patterns of CIs was proposed to optimize the trade-off between bit-rate, latency and quality. In this work, a zero-delay deep autoencoder (DAE) for the coding of the electrical stimulation patters of CIs is proposed. Combining for the first time bayesian optimization with numerical approximated gradients of a nondifferential speech intelligibility measure for CIs, the short-time intelligibility measure (STOI), an optimized DAE architecture was found and trained that achieved equal or superior speech understanding at zero delay, outperforming well-known audio codecs. The DAE achieved reference vocoder STOI scores at 13.5 kbit/s compared to 33.6 kbit/s for Opus and 24.5 kbit/s for AMR-WB.

KW - autoencoder

KW - cochlear implants

KW - hyperparameter optimization

KW - Kiefer-Wolfowitz

UR - http://www.scopus.com/inward/record.url?scp=85152382534&partnerID=8YFLogxK

U2 - 10.1109/IECBES54088.2022.10079466

DO - 10.1109/IECBES54088.2022.10079466

M3 - Conference contribution

AN - SCOPUS:85152382534

SN - 978-1-6654-9470-0

SP - 165

EP - 170

BT - 2022 IEEE-EMBS Conference on Biomedical Engineering and Sciences

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 7th IEEE-EMBS Conference on Biomedical Engineering and Sciences, IECBES 2022

Y2 - 7 December 2022 through 9 December 2022

ER -

Von denselben Autoren