Skip to main content

Research Repository

Advanced Search

All Outputs (124)

Speech-to-screen : spatial separation of dialogue from noise towards improved speech intelligibility for the small screen (2018)
Presentation / Conference
Demonte, P., Tang, Y., Hughes, R., Cox, T., Fazenda, B., & Shirley, B. (2018, May). Speech-to-screen : spatial separation of dialogue from noise towards improved speech intelligibility for the small screen. Presented at 144th International Pro Audio Convention (AES Milan 2018), Milan, Italy

Can externalizing dialogue when in the presence of stereo background noise improve speech intelligibility? This has been investigated for audio over headphones using head-tracking in order to explore potential future developments for small-screen dev... Read More about Speech-to-screen : spatial separation of dialogue from noise towards improved speech intelligibility for the small screen.

Elicitation of expert knowledge to inform object-based audio rendering to different systems (2018)
Journal Article
rendering to different systems. Journal of the Audio Engineering Society, 66(1/2), 44-59. https://doi.org/10.17743/jaes.2018.0001

Object-based audio presents the opportunity to optimise audio reproduction for different listening scenarios. Vector base amplitude panning (VBAP) is typically used to render object-based scenes. Optimizing this process based on knowledge of the perc... Read More about Elicitation of expert knowledge to inform object-based audio rendering to different systems.

Automatic speech-to-background ratio selection to maintain speech intelligibility in broadcasts using an objective intelligibility metric (2018)
Journal Article
Tang, Y., Fazenda, B., & Cox, T. (2018). Automatic speech-to-background ratio selection to maintain speech intelligibility in broadcasts using an objective intelligibility metric. Applied Sciences, 8(1), 59. https://doi.org/10.3390/app8010059

While mixing, sound producers and audio professionals empirically set the speech-to-background ratio (SBR) based on rules of thumb and their own perception of sounds. There is no guarantee that the speech content will be intelligible for the general... Read More about Automatic speech-to-background ratio selection to maintain speech intelligibility in broadcasts using an objective intelligibility metric.

An audio-visual system for object-based audio : from recording to listening (2018)
Journal Article
Coleman, P., Franck, A., Francombe, J., Liu, Q., de Campos, T., Hughes, R., …Hilton, A. (2018). An audio-visual system for object-based audio : from recording to listening. IEEE Transactions on Multimedia, 20(8), 1919-1931. https://doi.org/10.1109/TMM.2018.2794780

Object-based audio is an emerging representation for audio content, where content is represented in a reproduction format-agnostic way and, thus, produced once for consumption on many different kinds of devices. This affords new opportunities for im... Read More about An audio-visual system for object-based audio : from recording to listening.

A non-intrusive method for estimating binaural speech intelligibility from noise-corrupted signals captured by a pair of microphones (2017)
Journal Article
Tang, Y., Liu, Q., Wang, W., & Cox, T. (2018). A non-intrusive method for estimating binaural speech intelligibility from noise-corrupted signals captured by a pair of microphones. Speech Communication, 96, 116-128. https://doi.org/10.1016/j.specom.2017.12.005

A non-intrusive method is introduced to predict binaural speech intelligibility in noise directly from signals captured using a pair of microphones. The approach combines signal processing techniques in blind source separation and localisation, with... Read More about A non-intrusive method for estimating binaural speech intelligibility from noise-corrupted signals captured by a pair of microphones.

A study on the relationship between the intelligibility and quality of algorithmically-modified speech for normal hearing listeners (2017)
Journal Article
Tang, Y., Arnold, C., & Cox, T. (2017). A study on the relationship between the intelligibility and quality of algorithmically-modified speech for normal hearing listeners. Journal of Otorhinolaryngology, Hearing and Balance Medicine, 1(1), https://doi.org/10.3390/ohbm1010005

This study investigates the relationship between the intelligibility and quality of modified speech in noise and in quiet. Speech signals were processed by seven algorithms designed to increase speech intelligibility in noise without altering speech... Read More about A study on the relationship between the intelligibility and quality of algorithmically-modified speech for normal hearing listeners.

An evidence-based soundscape taxonomy (2017)
Presentation / Conference
Bones, O., Cox, T., & Davies, W. (2017, July). An evidence-based soundscape taxonomy. Presented at 24th International Congress on Sound and Vibration ICSV24, London, UK

In an attempt to cultivate standardization in soundscape reporting Brown, Kang and Gjestland offered an influential schema by which the acoustic environment is divided initially into indoor and outdoor environments, and within each into further cate... Read More about An evidence-based soundscape taxonomy.

Metadiffusers : deep-subwavelength sound diffusers (2017)
Journal Article
Jiménez, N., Cox, T., Romero-García, V., & Groby, J. (2017). Metadiffusers : deep-subwavelength sound diffusers. Scientific reports, 7(1), 5389. https://doi.org/10.1038/s41598-017-05710-5

We present deep-subwavelength diffusing surfaces based on acoustic metamaterials, namely metadiffusers. These sound diffusers are rigidly backed slotted panels, with each slit being loaded by an array of Helmholtz resonators. Strong dispersion is pro... Read More about Metadiffusers : deep-subwavelength sound diffusers.

Clang, chitter, crunch : perceptual organisation of onomatopoeia (2017)
Journal Article
Bones, O., Davies, W., & Cox, T. (2017). Clang, chitter, crunch : perceptual organisation of onomatopoeia. ˜The œJournal of the Acoustical Society of America (Online), 141(5), 3694-3694. https://doi.org/10.1121/1.4988048

A method has been developed that utilizes a sound-sorting and labeling procedure, with correspondence analysis of participant-generated descriptive terms, to elicit perceptual categories of sound. Unlike many other methods for identifying perceptual... Read More about Clang, chitter, crunch : perceptual organisation of onomatopoeia.

A user-centered taxonomy of factors contributing to the listener experience of reproduced audio (2017)
Journal Article
Woodcock, J., Davies, W., & Cox, T. (2017). A user-centered taxonomy of factors contributing to the listener experience of reproduced audio. ˜The œJournal of the Acoustical Society of America (Online), 141(5), 3464-3464. https://doi.org/10.1121/1.4987193

The traditional paradigm for the assessment of audio quality is that of a listener positioned in the geometric center of a standardized loudspeaker setup, fully attending to the reproduced sound scene. However, this is not how listeners generally int... Read More about A user-centered taxonomy of factors contributing to the listener experience of reproduced audio.

Extended simulations of wind noise contamination of amplitude modulation ratings (2017)
Presentation / Conference
von Hünerbein, S., Kendrick, P., & Cox, T. (2017, May). Extended simulations of wind noise contamination of amplitude modulation ratings. Presented at Wind Turbine Noise 2017, Rotterdam, NL

Microphone wind noise can corrupt outdoor measurements and recordings and especially the rating of Amplitude Modulation (AM) depth. In a previous study simulations of synthesised wind turbine sounds in wind noise have shown that even at relatively lo... Read More about Extended simulations of wind noise contamination of amplitude modulation ratings.

A cognitive framework for the categorisation of auditory objects in urban soundscapes (2017)
Journal Article
Woodcock, J., Davies, W., & Cox, T. (2017). A cognitive framework for the categorisation of auditory objects in urban soundscapes. Applied Acoustics, 121, 56-64. https://doi.org/10.1016/j.apacoust.2017.01.027

Categorisation is a fundamental cognitive process that plays a central role in everyday behaviour and action. Whereas previous studies have investigated the categorisation of isolated everyday sounds, this paper presents an experiment to investiga... Read More about A cognitive framework for the categorisation of auditory objects in urban soundscapes.

Toward an evidence-based taxonomy of everyday sounds (2016)
Journal Article
Bones, O., Cox, T., & Davies, W. (2016). Toward an evidence-based taxonomy of everyday sounds. ˜The œJournal of the Acoustical Society of America (Online), 140(4), 3266-3266. https://doi.org/10.1121/1.4970357

An organizing account of everyday sounds could greatly simplify the management of audio data. The job of an audio database manager will typically involve assigning a combination of textual descriptors, and perhaps allocating to a predefined category.... Read More about Toward an evidence-based taxonomy of everyday sounds.

A metric for predicting binaural speech intelligibility in stationary noise and competing speech maskers (2016)
Journal Article
Tang, Y., Cooke, M., Fazenda, B., & Cox, T. (2016). A metric for predicting binaural speech intelligibility in stationary noise and competing speech maskers. ˜The œJournal of the Acoustical Society of America (Online), 140(3), 1858-1870. https://doi.org/10.1121/1.4962484

One criterion in the design of binaural sound scenes in audio production is the extent to which the intended speech message is correctly understood. Object-based audio broadcasting systems have permitted sound editors to gain more access to the metad... Read More about A metric for predicting binaural speech intelligibility in stationary noise and competing speech maskers.

Acoustic absorbers and diffusers : theory, design and application [third edition] (2016)
Book
Cox, T., & D'Antonio, P. (2016). Acoustic absorbers and diffusers : theory, design and application [third edition]. Boca Raton: CRC Press

This definitive guide covers the design and application of absorbers and diffusers in acoustics. Surface diffusion is a relatively young subject area, and diffuser design, application and characterisation are often not well understood. Although there... Read More about Acoustic absorbers and diffusers : theory, design and application [third edition].

The effect of microphone wind noise on the amplitude modulation of wind turbine noise and its mitigation (2016)
Journal Article
Kendrick, P., von Hünerbein, S., & Cox, T. (2016). The effect of microphone wind noise on the amplitude modulation of wind turbine noise and its mitigation. ˜The œJournal of the Acoustical Society of America (Online), 140(1), EL79. https://doi.org/10.1121/1.4955010

Microphone wind noise can corrupt outdoor recordings even when wind shields are used. When monitoring wind turbine noise, microphone wind noise is almost inevitable because measurements cannot be made in still conditions. The effect of microphone win... Read More about The effect of microphone wind noise on the amplitude modulation of wind turbine noise and its mitigation.

Categorization of broadcast audio objects in complex auditory scenes (2016)
Journal Article
Woodcock, J., Davies, W., Cox, T., & Melchior, F. (2016). Categorization of broadcast audio objects in complex auditory scenes. Journal of the Audio Engineering Society, 64(6), 380-394. https://doi.org/10.17743/jaes.2016.0007

This paper presents a series of experiments to determine a categorization framework for broadcast audio objects. Object-based audio is becoming an evermore important paradigm for the representation of complex sound scenes. However, there is a lack of... Read More about Categorization of broadcast audio objects in complex auditory scenes.

Perception and automated assessment of audio quality in user generated content (2016)
Conference Proceeding
quality in user generated content. In Quality of Multimedia Experience (QoMEX), 2016 Eighth International Conference on 6-8 June 2016. https://doi.org/10.1109/QoMEX.2016.7498974

Technology to record sound, available in personal devices such as smartphones or video recording devices, is now ubiquitous. However, the production quality of the sound on this user-generated content is often very poor: distorted, noisy, with garble... Read More about Perception and automated assessment of audio quality in user generated content.

Evaluating a distortion-weighted glimpsing metric for predicting binaural speech intelligibility in rooms (2016)
Journal Article
Tang, Y., Hughes, R., Fazenda, B., & Cox, T. (2016). Evaluating a distortion-weighted glimpsing metric for predicting binaural speech intelligibility in rooms. Speech Communication, 82, 26-37. https://doi.org/10.1016/j.specom.2016.04.003

A distortion-weighted glimpse proportion metric (BiDWGP) for predicting binaural speech intelligibility were evaluated in simulated anechoic and reverberant conditions, with and without a noise masker. The predictive performance of BiDWGP was compare... Read More about Evaluating a distortion-weighted glimpsing metric for predicting binaural speech intelligibility in rooms.