Skip to main content

Research Repository

Advanced Search

Evaluating the predictions of objective intelligibility metrics for modified and synthetic speech

Tang, Y; Cooke, M; Valentini-Botinhao, C

Authors

Y Tang

M Cooke

C Valentini-Botinhao



Abstract

Several modification algorithms that alter natural or synthetic speech with the goal of improving intelligibility in noise have been proposed recently. A key requirement of many modification techniques is the ability to predict intelligibility, both offline during algorithm development, and online, in order to determine the optimal modification for the current noise context. While existing objective intelligibility metrics (OIMs) have good predictive power for unmodified natural speech in stationary and fluctuating noise, little is known about their effectiveness for other forms of speech. The current study evaluated how well seven OIMs predict listener responses in three large datasets of modified and synthetic speech which together represent 396 combinations of speech modification, masker type and signal-to-noise ratio. The chief finding is a clear reduction in predictive power for most OIMs when faced with modified and synthetic speech. Modifications introducing durational changes are particularly harmful to intelligibility predictors. OIMs that measure masked audibility tend to over-estimate intelligibility in the presence of fluctuating maskers relative to stationary maskers, while OIMs that estimate the distortion caused by the masker to a clean speech prototype exhibit the reverse pattern.

Citation

Tang, Y., Cooke, M., & Valentini-Botinhao, C. (2015). Evaluating the predictions of objective intelligibility metrics for modified and synthetic speech. Computer Speech and Language, 35, 73-92. https://doi.org/10.1016/j.csl.2015.06.002

Journal Article Type Article
Acceptance Date Jun 11, 2015
Publication Date Jun 20, 2015
Deposit Date Nov 23, 2015
Journal Computer Speech & Language
Print ISSN 0885-2308
Publisher Elsevier
Volume 35
Pages 73-92
DOI https://doi.org/10.1016/j.csl.2015.06.002
Publisher URL http://dx.doi.org/10.1016/j.csl.2015.06.002
Related Public URLs http://www.journals.elsevier.com/computer-speech-and-language/
Additional Information Funders : EU
Projects : the LISTA Project
Grant Number: 256230

Downloadable Citations