Monaural room acoustic parameters from music and speech

Kendrick, P; Cox, TJ; Li, FF; Zhang, Y; Chambers, JA

doi:10.1121/1.2931960

Monaural room acoustic parameters from music and speech

Kendrick, P; Cox, TJ; Li, FF; Zhang, Y; Chambers, JA

Authors

P Kendrick

Prof Trevor Cox T.J.Cox@salford.ac.uk
Professor

FF Li

Y Zhang

JA Chambers

Abstract

This paper compares two methods for extracting room acoustic parameters from reverberated speech and music. An approach which uses statistical machine learning, previously developed for speech, is extended to work with music. For speech, reverberation time estimations are within a perceptual difference limen of the true value. For music, virtually all early decay time estimations are within a difference limen of the true value. The estimation accuracy is not good enough in other cases due to differences between the simulated data set used to develop the empirical model and real rooms. The second method carries out a maximum likelihood estimation on decay phases at the end of notes or speech utterances. This paper extends the method to estimate parameters relating to the balance of early and late energies in the impulse response. For reverberation time and speech, the method provides estimations which are within the perceptual difference limen of the true value. For other parameters such as clarity, the estimations are not sufficiently accurate due to the natural reverberance of the excitation signals. Speech is a better test signal than music because of the greater periods of silence in the signal, although music is needed for low frequency measurement.

Citation

Kendrick, P., Cox, T., Li, F., Zhang, Y., & Chambers, J. (2008). Monaural room acoustic parameters from music and speech. The Journal of the Acoustical Society of America (Online), 124(1), 278-287. https://doi.org/10.1121/1.2931960

Journal Article Type	Article
Acceptance Date	Apr 23, 2008
Publication Date	Jul 1, 2008
Deposit Date	Nov 25, 2009
Publicly Available Date	Jun 6, 2017
Journal	The Journal of the Acoustical Society of America (JASA)
Print ISSN	0001-4966
Peer Reviewed	Peer Reviewed
Volume	124
Issue	1
Pages	278-287
DOI	https://doi.org/10.1121/1.2931960
Keywords	architectural acoustics, learning (artificial intelligence), maximum likelihood estimation, musical acoustics, reverberation, speech
Publisher URL	http://dx.doi.org/10.1121/1.2931960
Related Public URLs	http://asa.scitation.org/journal/jas
Additional Information	Funders : Engineering and Physical Sciences Research Council (EPSRC)