Y Tang
A non-intrusive method for estimating binaural speech intelligibility from noise-corrupted signals captured by a pair of microphones
Tang, Y; Liu, Q; Wang, W; Cox, TJ
Abstract
A non-intrusive method is introduced to predict binaural speech intelligibility in noise directly from signals captured using a pair of microphones. The approach combines signal processing techniques in blind source separation
and localisation, with an intrusive objective intelligibility measure (OIM). Therefore, unlike classic intrusive OIMs, this method does not require a clean reference speech signal and knowing the location of the sources to operate.
The proposed approach is able to estimate intelligibility in stationary and fluctuating noises, when the noise masker is presented as a point or diffused source, and is spatially separated from the target speech source on a horizontal
plane. The performance of the proposed method was evaluated in two rooms. When predicting subjective intelligibility measured as word recognition rate, this method showed reasonable predictive accuracy with correlation coefficients above 0.82, which is comparable to that of a reference intrusive OIM in most of the conditions. The proposed approach offers a solution for fast binaural intelligibility prediction, and therefore has practical potential
to be deployed in situations where on-site speech intelligibility is a concern.
Journal Article Type | Article |
---|---|
Acceptance Date | Dec 11, 2017 |
Online Publication Date | Dec 13, 2017 |
Publication Date | Feb 1, 2018 |
Deposit Date | Dec 12, 2017 |
Publicly Available Date | Dec 14, 2017 |
Journal | Speech Communication |
Print ISSN | 0167-6393 |
Publisher | Elsevier |
Volume | 96 |
Pages | 116-128 |
DOI | https://doi.org/10.1016/j.specom.2017.12.005 |
Publisher URL | http://dx.doi.org/10.1016/j.specom.2017.12.005 |
Related Public URLs | https://www.journals.elsevier.com/speech-communication |
Files
1-s2.0-S0167639317302248-main final VOR.pdf
(1.4 Mb)
PDF
Licence
http://creativecommons.org/licenses/by/4.0/
Publisher Licence URL
http://creativecommons.org/licenses/by/4.0/
You might also like
Improving the measurement and acoustic performance of transparent face masks and shields
(2022)
Journal Article
Using scale modelling to assess the prehistoric acoustics of stonehenge
(2020)
Journal Article
Fast speech intelligibility estimation using a neural network trained via distillation
(2020)
Presentation / Conference
Pupil dilation reveals changes in listening effort due to energetic and informational masking
(2019)
Presentation / Conference
Downloadable Citations
About USIR
Administrator e-mail: library-research@salford.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search