Skip to main content

Research Repository

Advanced Search

Prof Trevor Cox's Outputs (172)

Speech-to-screen : spatial separation of dialogue from noise towards improved speech intelligibility for the small screen (2018)
Presentation / Conference
Demonte, P., Tang, Y., Hughes, R., Cox, T., Fazenda, B., & Shirley, B. (2018, May). Speech-to-screen : spatial separation of dialogue from noise towards improved speech intelligibility for the small screen. Presented at 144th International Pro Audio Convention (AES Milan 2018), Milan, Italy

Can externalizing dialogue when in the presence of stereo background noise improve speech intelligibility? This has been investigated for audio over headphones using head-tracking in order to explore potential future developments for small-screen dev... Read More about Speech-to-screen : spatial separation of dialogue from noise towards improved speech intelligibility for the small screen.

Elicitation of expert knowledge to inform object-based audio rendering to different systems (2018)
Journal Article
rendering to different systems. Journal of the Audio Engineering Society, 66(1/2), 44-59. https://doi.org/10.17743/jaes.2018.0001

Object-based audio presents the opportunity to optimise audio reproduction for different listening scenarios. Vector base amplitude panning (VBAP) is typically used to render object-based scenes. Optimizing this process based on knowledge of the perc... Read More about Elicitation of expert knowledge to inform object-based audio rendering to different systems.

Data for 'Automatic speech-to-background ratio selection to maintain speech intelligibility in broadcasts using an objective intelligibility metric' (2018)
Data

This repository contains the stimuli that were used in Expt. I, II and III and corresponding results in the following article.

Tang, Y., Fazenda, B.M. and Cox, T.J. (2018). "Automatic Speech-to-Background Ratio Selection to Maintain Speech Intelli... Read More about Data for 'Automatic speech-to-background ratio selection to maintain speech intelligibility in broadcasts using an objective intelligibility metric'.

Automatic speech-to-background ratio selection to maintain speech intelligibility in broadcasts using an objective intelligibility metric (2018)
Journal Article

While mixing, sound producers and audio professionals empirically set the speech-to-background ratio (SBR) based on rules of thumb and their own perception of sounds. There is no guarantee that the speech content will be intelligible for the general... Read More about Automatic speech-to-background ratio selection to maintain speech intelligibility in broadcasts using an objective intelligibility metric.

Data and supporting figures for JAES journal paper "Intelligent rendering of object-based audio: Elicitation of Expert Knowledge to Inform Object-Based Audio Rendering to Different Systems" (2018)
Data

This repository contains the stimuli and data underlying the JAES publication "Elicitation of Expert Knowledge to Inform Object-Based Audio Rendering to Different Systems".

The zip archive "stimuli.wav" contains the four clips that were used in th... Read More about Data and supporting figures for JAES journal paper "Intelligent rendering of object-based audio: Elicitation of Expert Knowledge to Inform Object-Based Audio Rendering to Different Systems".

Data for 'A non-intrusive method for estimating binaural speech intelligibility from noise-corrupted signals captured by a pair of microphones' (2018)
Data

This repository contains the stimuli that were used to elicit listener responses of speech intelligibility in noise, and the implementation of the main components of the proposed method

Tang, Y., Liu, Q., Wang, W. and Cox, T. J. (2017). "A non-int... Read More about Data for 'A non-intrusive method for estimating binaural speech intelligibility from noise-corrupted signals captured by a pair of microphones'.

Data for 'A study on the relationship between the intelligibility and quality of algorithmically-modified speech for normal hearing listeners' (2017)
Data

This repository contains the audio files that were used to elicit listener responses to speech intelligibility and quality, presented in the following work:

Tang, Y.; Arnold, C.; Cox, T.J. A Study on the Relationship between the Intelligibility an... Read More about Data for 'A study on the relationship between the intelligibility and quality of algorithmically-modified speech for normal hearing listeners'.

A non-intrusive method for estimating binaural speech intelligibility from noise-corrupted signals captured by a pair of microphones (2017)
Journal Article

A non-intrusive method is introduced to predict binaural speech intelligibility in noise directly from signals captured using a pair of microphones. The approach combines signal processing techniques in blind source separation
and localisation, with... Read More about A non-intrusive method for estimating binaural speech intelligibility from noise-corrupted signals captured by a pair of microphones.

A study on the relationship between the intelligibility and quality of algorithmically-modified speech for normal hearing listeners (2017)
Journal Article
Tang, Y., Arnold, C., & Cox, T. (2017). A study on the relationship between the intelligibility and quality of algorithmically-modified speech for normal hearing listeners. Journal of Otorhinolaryngology, Hearing and Balance Medicine, 1(1), https://doi.org/10.3390/ohbm1010005

This study investigates the relationship between the intelligibility and quality of modified speech in noise and in quiet. Speech signals were processed by seven algorithms designed to increase speech intelligibility in noise without altering speech... Read More about A study on the relationship between the intelligibility and quality of algorithmically-modified speech for normal hearing listeners.

An evidence-based soundscape taxonomy (2017)
Presentation / Conference
Bones, O., Cox, T., & Davies, W. (2017, July). An evidence-based soundscape taxonomy. Presented at 24th International Congress on Sound and Vibration ICSV24, London, UK

In an attempt to cultivate standardization in soundscape reporting Brown, Kang and Gjestland offered
an influential schema by which the acoustic environment is divided initially into indoor and outdoor environments, and within each into further cate... Read More about An evidence-based soundscape taxonomy.

Metadiffusers : deep-subwavelength sound diffusers (2017)
Journal Article
Jiménez, N., Cox, T., Romero-García, V., & Groby, J. (2017). Metadiffusers : deep-subwavelength sound diffusers. Scientific reports, 7(1), 5389. https://doi.org/10.1038/s41598-017-05710-5

We present deep-subwavelength diffusing surfaces based on acoustic metamaterials, namely metadiffusers. These sound diffusers are rigidly backed slotted panels, with each slit being loaded by an array of Helmholtz resonators. Strong dispersion is pro... Read More about Metadiffusers : deep-subwavelength sound diffusers.

Clang, chitter, crunch : perceptual organisation of onomatopoeia (2017)
Journal Article
Bones, O., Davies, W., & Cox, T. (2017). Clang, chitter, crunch : perceptual organisation of onomatopoeia. ˜The œJournal of the Acoustical Society of America (Online), 141(5), 3694-3694. https://doi.org/10.1121/1.4988048

A method has been developed that utilizes a sound-sorting and labeling procedure, with correspondence analysis of participant-generated descriptive terms, to elicit perceptual categories of sound. Unlike many other methods for identifying perceptual... Read More about Clang, chitter, crunch : perceptual organisation of onomatopoeia.

A user-centered taxonomy of factors contributing to the listener experience of reproduced audio (2017)
Journal Article
Woodcock, J., Davies, W., & Cox, T. (2017). A user-centered taxonomy of factors contributing to the listener experience of reproduced audio. ˜The œJournal of the Acoustical Society of America (Online), 141(5), 3464-3464. https://doi.org/10.1121/1.4987193

The traditional paradigm for the assessment of audio quality is that of a listener positioned in the geometric center of a standardized loudspeaker setup, fully attending to the reproduced sound scene. However, this is not how listeners generally int... Read More about A user-centered taxonomy of factors contributing to the listener experience of reproduced audio.

Extended simulations of wind noise contamination of amplitude modulation ratings (2017)
Presentation / Conference
von Hünerbein, S., Kendrick, P., & Cox, T. (2017, May). Extended simulations of wind noise contamination of amplitude modulation ratings. Presented at Wind Turbine Noise 2017, Rotterdam, NL

Microphone wind noise can corrupt outdoor measurements and recordings and especially the rating of Amplitude Modulation (AM) depth. In a previous study simulations of synthesised wind turbine sounds in wind noise have shown that even at relatively lo... Read More about Extended simulations of wind noise contamination of amplitude modulation ratings.

Results and audio materials from an experiment into the cognitive categorisation of everyday sounds within urban soundscapes (2017)
Data

This fileset contains data relating to the Applied Acoustics paper "A cognitive framework for the categorisation of auditory objects in urban soundscapes"

The zip file "data.zip" contains the following:

./soundscape_recordings contains the pro... Read More about Results and audio materials from an experiment into the cognitive categorisation of everyday sounds within urban soundscapes.