Skip to main content

Research Repository

Advanced Search

All Outputs (8)

Perceptual audio evaluation of media device orchestration using the multi-stimulus ideal profile method (2018)
Presentation / Conference
Wilson, A., Cox, T., Zacharov, N., & Pike, C. (2018, October). Perceptual audio evaluation of media device orchestration using the multi-stimulus ideal profile method. Presented at Audio Engineering Society 145th Convention, New York, USA

The evaluation of object-based audio reproduction methods in a real-world context remains a challenge as it is difficult to separate the effects of the reproduction system from the effects of the audio mix rendered for that system. This is often comp... Read More about Perceptual audio evaluation of media device orchestration using the multi-stimulus ideal profile method.

Improving intelligibility prediction under informational masking using an auditory saliency model (2018)
Presentation / Conference
Tang, Y., & Cox, T. (2018, September). Improving intelligibility prediction under informational masking using an auditory saliency model. Presented at International Conference on Digital Audio Effects, Aveiro, Portugal

The reduction of speech intelligibility in noise is usually dominated by energetic masking (EM) and informational masking (IM). Most state-of-the-art objective intelligibility measures (OIM) estimate intelligibility by quantifying EM. Few measures m... Read More about Improving intelligibility prediction under informational masking using an auditory saliency model.

Sound categories : category formation and evidence-based taxonomies (2018)
Journal Article
Bones, O., Cox, T., & Davies, W. (2018). Sound categories : category formation and evidence-based taxonomies. Frontiers in Psychology, 9, #1277. https://doi.org/10.3389/fpsyg.2018.01277

Five evidence-based taxonomies of everyday sounds frequently reported in the soundscape literature have been generated. An online sorting and category-labelling method that elicits rather than prescribes descriptive words was used. A total of N=242 p... Read More about Sound categories : category formation and evidence-based taxonomies.

Qualitative evaluation of media device orchestration for immersive spatial audio reproduction (2018)
Journal Article
Francombe, J., Woodcock, J., Hughes, R., Mason, R., Franck, A., Pike, C., …Hilton, A. (2018). Qualitative evaluation of media device orchestration for immersive spatial audio reproduction. Journal of the Audio Engineering Society, 66(6), 414-429. https://doi.org/10.17743/jaes.2018.0027

The challenge of installing and setting up dedicated spatial audio systems can make it difficult to deliver immersive listening experiences to the general public. However, the proliferation of smart mobile devices and the rise of the Internet of Thin... Read More about Qualitative evaluation of media device orchestration for immersive spatial audio reproduction.

Speech-to-screen : spatial separation of dialogue from noise towards improved speech intelligibility for the small screen (2018)
Presentation / Conference
Demonte, P., Tang, Y., Hughes, R., Cox, T., Fazenda, B., & Shirley, B. (2018, May). Speech-to-screen : spatial separation of dialogue from noise towards improved speech intelligibility for the small screen. Presented at 144th International Pro Audio Convention (AES Milan 2018), Milan, Italy

Can externalizing dialogue when in the presence of stereo background noise improve speech intelligibility? This has been investigated for audio over headphones using head-tracking in order to explore potential future developments for small-screen dev... Read More about Speech-to-screen : spatial separation of dialogue from noise towards improved speech intelligibility for the small screen.

Elicitation of expert knowledge to inform object-based audio rendering to different systems (2018)
Journal Article
rendering to different systems. Journal of the Audio Engineering Society, 66(1/2), 44-59. https://doi.org/10.17743/jaes.2018.0001

Object-based audio presents the opportunity to optimise audio reproduction for different listening scenarios. Vector base amplitude panning (VBAP) is typically used to render object-based scenes. Optimizing this process based on knowledge of the perc... Read More about Elicitation of expert knowledge to inform object-based audio rendering to different systems.

Automatic speech-to-background ratio selection to maintain speech intelligibility in broadcasts using an objective intelligibility metric (2018)
Journal Article
Tang, Y., Fazenda, B., & Cox, T. (2018). Automatic speech-to-background ratio selection to maintain speech intelligibility in broadcasts using an objective intelligibility metric. Applied Sciences, 8(1), 59. https://doi.org/10.3390/app8010059

While mixing, sound producers and audio professionals empirically set the speech-to-background ratio (SBR) based on rules of thumb and their own perception of sounds. There is no guarantee that the speech content will be intelligible for the general... Read More about Automatic speech-to-background ratio selection to maintain speech intelligibility in broadcasts using an objective intelligibility metric.

An audio-visual system for object-based audio : from recording to listening (2018)
Journal Article
Coleman, P., Franck, A., Francombe, J., Liu, Q., de Campos, T., Hughes, R., …Hilton, A. (2018). An audio-visual system for object-based audio : from recording to listening. IEEE Transactions on Multimedia, 20(8), 1919-1931. https://doi.org/10.1109/TMM.2018.2794780

Object-based audio is an emerging representation for audio content, where content is represented in a reproduction format-agnostic way and, thus, produced once for consumption on many different kinds of devices. This affords new opportunities for im... Read More about An audio-visual system for object-based audio : from recording to listening.