Y Tang
Evaluating a distortion-weighted glimpsing metric for predicting binaural speech intelligibility in rooms
Tang, Y; Hughes, RJ; Fazenda, BM; Cox, TJ
Authors
RJ Hughes
Dr Bruno Fazenda B.M.Fazenda@salford.ac.uk
Associate Professor/Reader
Prof Trevor Cox T.J.Cox@salford.ac.uk
Professor
Abstract
A distortion-weighted glimpse proportion metric (BiDWGP) for predicting binaural speech intelligibility were evaluated in simulated anechoic and reverberant conditions, with and without a noise masker. The predictive performance of BiDWGP was compared to four reference binaural intelligibility metrics, which were extended from the Speech Intelligibility Index (SII) and the Speech Transmission Index (STI). In the anechoic sound field, BiDWGP demonstrated high accuracy in predicting binaural intelligibility for individual maskers (ρ ≥ 0.95) and across maskers (ρ ≥ 0.94). The reference metrics however performed less well in across-masker prediction (0.54 ≤ ρ ≤ 0.86) despite reasonable accuracy for individual maskers. In reverberant rooms, BiDWGP was more stable in all test conditions (ρ ≥ 0.87) than the reference metrics, which showed different predictive patterns: the binaural STIs were more robust for the stationary than for the fluctuating noise masker, whilst the binaural SII displayed the opposite behaviour. The study shows that the new BiDWGP metric can provide similar or even more robust predictive power than the current standard metrics
Citation
Tang, Y., Hughes, R., Fazenda, B., & Cox, T. (2016). Evaluating a distortion-weighted glimpsing metric for predicting binaural speech intelligibility in rooms. Speech Communication, 82, 26-37. https://doi.org/10.1016/j.specom.2016.04.003
Journal Article Type | Article |
---|---|
Acceptance Date | Apr 14, 2016 |
Online Publication Date | May 28, 2016 |
Publication Date | Sep 1, 2016 |
Deposit Date | Apr 18, 2016 |
Publicly Available Date | Jun 10, 2016 |
Journal | Speech Communication |
Print ISSN | 0167-6393 |
Publisher | Elsevier |
Volume | 82 |
Pages | 26-37 |
DOI | https://doi.org/10.1016/j.specom.2016.04.003 |
Publisher URL | http://dx.doi.org/10.1016/j.specom.2016.04.003 |
Related Public URLs | http://www.journals.elsevier.com/speech-communication/ |
Files
1-s2.0-S0167639315300959-main.pdf
(1.1 Mb)
PDF
Licence
http://creativecommons.org/licenses/by/4.0/
Publisher Licence URL
http://creativecommons.org/licenses/by/4.0/
You might also like
Speech-to-screen : spatial separation of dialogue from noise towards improved speech intelligibility for the small screen
(2018)
Presentation / Conference
An audio-visual system for object-based audio : from recording to listening
(2018)
Journal Article
Object-based reverberation for spatial audio
(2017)
Journal Article
Characterization of acoustic diffusion using refracto-vibrometry
(2016)
Journal Article
Downloadable Citations
About USIR
Administrator e-mail: library-research@salford.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search