Skip to main content

Research Repository

Advanced Search

Monaural room acoustic parameters from music and speech

Kendrick, P; Cox, TJ; Li, FF; Zhang, Y; Chambers, JA

Monaural room acoustic parameters from music and speech Thumbnail


Authors

P Kendrick

FF Li

Y Zhang

JA Chambers



Abstract

This paper compares two methods for extracting room acoustic parameters from reverberated speech and music. An approach which uses statistical machine learning, previously developed for speech, is extended to work with music. For speech, reverberation time estimations are within a perceptual difference limen of the true value. For music, virtually all early decay time estimations are within a difference limen of the true value. The estimation accuracy is not good enough in other cases due to differences between the simulated data set used to develop the empirical model and real rooms. The second method carries out a maximum likelihood estimation on decay phases at the end of notes or speech utterances. This paper extends the method to estimate parameters relating to the balance of early and late energies in the impulse response. For reverberation time and speech, the method provides estimations which are within the perceptual difference limen of the true value. For other parameters such as clarity, the estimations are not sufficiently accurate due to the natural reverberance of the excitation signals. Speech is a better test signal than music because of the greater periods of silence in the signal, although music is needed for low frequency measurement.

Citation

Kendrick, P., Cox, T., Li, F., Zhang, Y., & Chambers, J. (2008). Monaural room acoustic parameters from music and speech. ˜The œJournal of the Acoustical Society of America (Online), 124(1), 278-287. https://doi.org/10.1121/1.2931960

Journal Article Type Article
Acceptance Date Apr 23, 2008
Publication Date Jul 1, 2008
Deposit Date Nov 25, 2009
Publicly Available Date Jun 6, 2017
Journal The Journal of the Acoustical Society of America (JASA)
Print ISSN 0001-4966
Peer Reviewed Peer Reviewed
Volume 124
Issue 1
Pages 278-287
DOI https://doi.org/10.1121/1.2931960
Keywords architectural acoustics, learning (artificial intelligence), maximum likelihood estimation, musical acoustics, reverberation, speech
Publisher URL http://dx.doi.org/10.1121/1.2931960
Related Public URLs http://asa.scitation.org/journal/jas
Additional Information Funders : Engineering and Physical Sciences Research Council (EPSRC)

Files






You might also like



Downloadable Citations