P Kendrick
Monaural room acoustic parameters from music and speech
Kendrick, P; Cox, TJ; Li, FF; Zhang, Y; Chambers, JA
Abstract
This paper compares two methods for extracting room acoustic parameters from reverberated speech and music. An approach which uses statistical machine learning, previously developed for speech, is extended to work with music. For speech, reverberation time estimations are within a perceptual difference limen of the true value. For music, virtually all early decay time estimations are within a difference limen of the true value. The estimation accuracy is not good enough in other cases due to differences between the simulated data set used to develop the empirical model and real rooms. The second method carries out a maximum likelihood estimation on decay phases at the end of notes or speech utterances. This paper extends the method to estimate parameters relating to the balance of early and late energies in the impulse response. For reverberation time and speech, the method provides estimations which are within the perceptual difference limen of the true value. For other parameters such as clarity, the estimations are not sufficiently accurate due to the natural reverberance of the excitation signals. Speech is a better test signal than music because of the greater periods of silence in the signal, although music is needed for low frequency measurement.
Citation
Kendrick, P., Cox, T., Li, F., Zhang, Y., & Chambers, J. (2008). Monaural room acoustic parameters from music and speech. The Journal of the Acoustical Society of America (Online), 124(1), 278-287. https://doi.org/10.1121/1.2931960
Journal Article Type | Article |
---|---|
Acceptance Date | Apr 23, 2008 |
Publication Date | Jul 1, 2008 |
Deposit Date | Nov 25, 2009 |
Publicly Available Date | Jun 6, 2017 |
Journal | The Journal of the Acoustical Society of America (JASA) |
Print ISSN | 0001-4966 |
Peer Reviewed | Peer Reviewed |
Volume | 124 |
Issue | 1 |
Pages | 278-287 |
DOI | https://doi.org/10.1121/1.2931960 |
Keywords | architectural acoustics, learning (artificial intelligence), maximum likelihood estimation, musical acoustics, reverberation, speech |
Publisher URL | http://dx.doi.org/10.1121/1.2931960 |
Related Public URLs | http://asa.scitation.org/journal/jas |
Additional Information | Funders : Engineering and Physical Sciences Research Council (EPSRC) |
Files
Published Version
(415 Kb)
PDF
You might also like
Improving the measurement and acoustic performance of transparent face masks and shields
(2022)
Journal Article
Using scale modelling to assess the prehistoric acoustics of stonehenge
(2020)
Journal Article
Fast speech intelligibility estimation using a neural network trained via distillation
(2020)
Presentation / Conference
Pupil dilation reveals changes in listening effort due to energetic and informational masking
(2019)
Presentation / Conference
Downloadable Citations
About USIR
Administrator e-mail: library-research@salford.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search