Skip to main content

Research Repository

Advanced Search

Outputs (28)

Fast speech intelligibility estimation using a neural network trained via distillation (2020)
Presentation / Conference
Cox, T., Bailey, W., & Tang, Y. (2020, January). Fast speech intelligibility estimation using a neural network trained via distillation. Poster presented at 12th Speech in Noise Workshop, Toulouse, France

Objective measures of speech intelligibility have many uses, including the evaluation of degradation during transmission and the development of processing algorithms. One intrusive approach is to use a method based on the audibility of speech glimpse... Read More about Fast speech intelligibility estimation using a neural network trained via distillation.

Pupil dilation reveals changes in listening effort due to energetic and informational masking (2019)
Presentation / Conference
Woodcock, J., Fazenda, B., Cox, T., & Davies, W. (2019, September). Pupil dilation reveals changes in listening effort due to energetic and informational masking. Presented at ICA 2019, Aachen, Germany

Pupil dilation has previously been shown to be a useful involuntary marker of listening effort. An inverse relationship between pupil diameter and signal to noise ratio has been shown when speech is energetically masked by noise. The work reported he... Read More about Pupil dilation reveals changes in listening effort due to energetic and informational masking.

Background adaptation for improved listening experience in broadcasting (2019)
Presentation / Conference
Tang, Y., Cox, T., Fazenda, B., Liu, Q., & Wang, W. (2019, May). Background adaptation for improved listening experience in broadcasting. Presented at 44th International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2019, Brighton, UK

The intelligibility of speech in noise can be improved by modifying the speech. But with object-based audio, there is the possibility of altering the background sound while leaving the speech unaltered. This may prove a less intrusive approach, affor... Read More about Background adaptation for improved listening experience in broadcasting.

Generalisation in environmental sound classification : the ‘making sense of sounds’ data set and challenge (2019)
Presentation / Conference
Kroos, C., Bones, O., Cao, Y., Harris, L., Jackson, P., Davies, W., …Plumbley, M. (2019, May). Generalisation in environmental sound classification : the ‘making sense of sounds’ data set and challenge. Presented at 44th International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2019), Brighton, UK

Humans are able to identify a large number of environmental sounds and categorise them according to high-level semantic categories, e.g. urban sounds or music. They are also capable of generalising from past experience to new sounds when applying the... Read More about Generalisation in environmental sound classification : the ‘making sense of sounds’ data set and challenge.

Perceptual audio evaluation of media device orchestration using the multi-stimulus ideal profile method (2018)
Presentation / Conference
Wilson, A., Cox, T., Zacharov, N., & Pike, C. (2018, October). Perceptual audio evaluation of media device orchestration using the multi-stimulus ideal profile method. Presented at Audio Engineering Society 145th Convention, New York, USA

The evaluation of object-based audio reproduction methods in a real-world context remains a challenge as it is difficult to separate the effects of the reproduction system from the effects of the audio mix rendered for that system. This is often comp... Read More about Perceptual audio evaluation of media device orchestration using the multi-stimulus ideal profile method.

Improving intelligibility prediction under informational masking using an auditory saliency model (2018)
Presentation / Conference
Tang, Y., & Cox, T. (2018, September). Improving intelligibility prediction under informational masking using an auditory saliency model. Presented at International Conference on Digital Audio Effects, Aveiro, Portugal

The reduction of speech intelligibility in noise is usually dominated by energetic masking (EM) and informational masking (IM). Most state-of-the-art objective intelligibility measures (OIM) estimate intelligibility by quantifying EM. Few measures m... Read More about Improving intelligibility prediction under informational masking using an auditory saliency model.

Speech-to-screen : spatial separation of dialogue from noise towards improved speech intelligibility for the small screen (2018)
Presentation / Conference
Demonte, P., Tang, Y., Hughes, R., Cox, T., Fazenda, B., & Shirley, B. (2018, May). Speech-to-screen : spatial separation of dialogue from noise towards improved speech intelligibility for the small screen. Presented at 144th International Pro Audio Convention (AES Milan 2018), Milan, Italy

Can externalizing dialogue when in the presence of stereo background noise improve speech intelligibility? This has been investigated for audio over headphones using head-tracking in order to explore potential future developments for small-screen dev... Read More about Speech-to-screen : spatial separation of dialogue from noise towards improved speech intelligibility for the small screen.

An evidence-based soundscape taxonomy (2017)
Presentation / Conference
Bones, O., Cox, T., & Davies, W. (2017, July). An evidence-based soundscape taxonomy. Presented at 24th International Congress on Sound and Vibration ICSV24, London, UK

In an attempt to cultivate standardization in soundscape reporting Brown, Kang and Gjestland offered an influential schema by which the acoustic environment is divided initially into indoor and outdoor environments, and within each into further cate... Read More about An evidence-based soundscape taxonomy.

Extended simulations of wind noise contamination of amplitude modulation ratings (2017)
Presentation / Conference
von Hünerbein, S., Kendrick, P., & Cox, T. (2017, May). Extended simulations of wind noise contamination of amplitude modulation ratings. Presented at Wind Turbine Noise 2017, Rotterdam, NL

Microphone wind noise can corrupt outdoor measurements and recordings and especially the rating of Amplitude Modulation (AM) depth. In a previous study simulations of synthesised wind turbine sounds in wind noise have shown that even at relatively lo... Read More about Extended simulations of wind noise contamination of amplitude modulation ratings.

A glimpse-based approach for predicting binaural intelligibility with single and multiple maskers in anechoic conditions (2015)
Presentation / Conference
Tang, Y., Cooke, M., Fazenda, B., & Cox, T. (2015, September). A glimpse-based approach for predicting binaural intelligibility with single and multiple maskers in anechoic conditions. Presented at Interspeech 2015, Dresden, Germany

A distortion-weighted glimpsing metric developed for estimating monaural speech intelligibility is extended to predict binaural speech intelligibility in noise. Two aspects of binaural listen- ing, the better ear effect and the binaural advantage, ar... Read More about A glimpse-based approach for predicting binaural intelligibility with single and multiple maskers in anechoic conditions.