Y Tang
A study on the relationship between the intelligibility and quality of algorithmically-modified speech for normal hearing listeners
Tang, Y; Arnold, C; Cox, TJ
Abstract
This study investigates the relationship between the intelligibility and quality of modified speech in noise and in quiet. Speech signals were processed by seven algorithms designed to increase speech intelligibility in noise without altering speech intensity. In three noise maskers, including both stationary and fluctuating noise at two signal-to-noise ratios (SNR), listeners identified keywords from unmodified or modified sentences. The intelligibility performance of each type of speech was measured as the listeners’ word recognition rate in each condition, while the quality was rated as a mean opinion score. In quiet, only the perceptual quality of each type of speech was assessed. The results suggest that when listening in noise, modification performance on improving intelligibility is more important than its potential negative impact on speech quality. However, when listening in quiet or at SNRs in which intelligibility is no longer an issue to listeners, the impact to speech quality due to modification becomes a concern.
Citation
Tang, Y., Arnold, C., & Cox, T. (2017). A study on the relationship between the intelligibility and quality of algorithmically-modified speech for normal hearing listeners. Journal of Otorhinolaryngology, Hearing and Balance Medicine, 1(1), https://doi.org/10.3390/ohbm1010005
Journal Article Type | Article |
---|---|
Acceptance Date | Dec 7, 2017 |
Publication Date | Dec 8, 2017 |
Deposit Date | Dec 8, 2017 |
Publicly Available Date | Dec 8, 2017 |
Journal | Journal of Otorhinolaryngology, Hearing and Balance Medicine |
Publisher | MDPI |
Volume | 1 |
Issue | 1 |
DOI | https://doi.org/10.3390/ohbm1010005 |
Publisher URL | http://dx.doi.org/10.3390/ohbm1010005 |
Related Public URLs | http://www.mdpi.com/journal/ohbm |
Files
ohbm-01-00005.pdf
(500 Kb)
PDF
Licence
http://creativecommons.org/licenses/by/4.0/
Publisher Licence URL
http://creativecommons.org/licenses/by/4.0/
You might also like
Improving the measurement and acoustic performance of transparent face masks and shields
(2022)
Journal Article
Using scale modelling to assess the prehistoric acoustics of stonehenge
(2020)
Journal Article
Fast speech intelligibility estimation using a neural network trained via distillation
(2020)
Presentation / Conference
Pupil dilation reveals changes in listening effort due to energetic and informational masking
(2019)
Presentation / Conference