Dr Simone Graetzer S.N.Graetzer@salford.ac.uk
Research Fellow
The effect of additive white Gaussian noise and high-pass filtering on speech intelligibility at signal-to-noise ratios (SNRs) from -26 to 0 dB was evaluated using British English talkers and normal hearing listeners. SNRs below -10 dB were considered as they are relevant to speech security applications. Eight objective metrics were assessed: Short-Time Objective Intelligibility (STOI), a proposed variant termed STOI+, Extended Short-Time Objective Intelligibility (ESTOI), Normalised Covariance Metric (NCM), Normalised Sub-band Envelope Correlation metric (NSEC), two metrics derived from the Coherence Speech Intelligibility Index (CSII), and an envelope-based regression method Speech Transmission Index (STI). For speech and noise mixtures associated with intelligibility scores ranging from 0% to 98%, STOI+ performed at least as well as other metrics, and under some conditions better than STOI, ESTOI, STI, NSEC, CSIIMid and CSIIHigh. Both STOI+ and NCM were associated with relatively low prediction error and bias for intelligibility prediction at SNRs from -26 to 0 dB. STI performed least well in terms of correlation with intelligibility scores, prediction error, bias and reliability. Logistic regression modelling demonstrated that high-pass filtering, which increases the proportion of high to low frequency energy, was detrimental to intelligibility for SNRs between -5 and -17 dB inclusive.
Graetzer, S., & Hopkins, C. (2021). Intelligibility prediction for speech mixed with white Gaussian noise at low signal-to-noise ratios. The Journal of the Acoustical Society of America (Online), 149(2), 1346-1362. https://doi.org/10.1121/10.0003557
Journal Article Type | Article |
---|---|
Acceptance Date | Jan 28, 2021 |
Publication Date | Feb 25, 2021 |
Deposit Date | Mar 5, 2021 |
Publicly Available Date | Mar 5, 2021 |
Journal | The Journal of the Acoustical Society of America (JASA) |
Print ISSN | 0001-4966 |
Volume | 149 |
Issue | 2 |
Pages | 1346-1362 |
DOI | https://doi.org/10.1121/10.0003557 |
Publisher URL | https://doi.org/10.1121/10.0003557 |
Related Public URLs | http://asa.scitation.org/journal/jas |
1346_1.pdf
(4.8 Mb)
PDF
Licence
http://creativecommons.org/licenses/by/4.0/
Publisher Licence URL
http://creativecommons.org/licenses/by/4.0/
Understanding human responses to air source heat pump noise: examining tonal changes and operating cycles
(2024)
Presentation / Conference Contribution
Higher-fidelity analysis of air source heat pump noise via scanning intensity measurement
(2024)
Presentation / Conference Contribution
The 2nd Clarity Prediction Challenge: A Machine Learning Challenge for Hearing Aid Intelligibility Prediction
(2024)
Presentation / Conference Contribution
The ICASSP SP Cadenza Challenge: Music Demixing/Remixing for Hearing Aids
(2024)
Presentation / Conference Contribution
About USIR
Administrator e-mail: library-research@salford.ac.uk
This application uses the following open-source libraries:
Apache License Version 2.0 (http://www.apache.org/licenses/)
Apache License Version 2.0 (http://www.apache.org/licenses/)
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search