Skip to main content

Research Repository

Advanced Search

All Outputs (69)

The cadenza woodwind dataset: Synthesised quartets for music information retrieval and machine learning. (2024)
Journal Article

This paper presents the Cadenza Woodwind Dataset. This publicly available data is synthesised audio for woodwind quartets including renderings of each instrument in isolation. The data was created to be used as training data within Cadenza's second o... Read More about The cadenza woodwind dataset: Synthesised quartets for music information retrieval and machine learning..

Muddy, muddled, or muffled? Understanding the perception of audio quality in music by hearing aid users (2024)
Journal Article

Introduction: Previous work on audio quality evaluation has demonstrated a developing convergence of the key perceptual attributes underlying judgments of quality, such as timbral, spatial and technical attributes. However, across existing research t... Read More about Muddy, muddled, or muffled? Understanding the perception of audio quality in music by hearing aid users.

Improving the measurement and acoustic performance of transparent face masks and shields (2022)
Journal Article
Cox, T. J., Dodgson, G., Harris, L., Perugia, E., Stone, M. A., & Walsh, M. (2022). Improving the measurement and acoustic performance of transparent face masks and shields. ˜The œJournal of the Acoustical Society of America (Online), 151(5), 2931-2944. https://doi.org/10.1121/10.0010384

Opaque face masks harm communication by preventing speech-reading (lip-reading) and attenuating high-frequency sound. Although transparent masks and shields (visors) with clear plastic inserts allow speech-reading, they usually create more sound atte... Read More about Improving the measurement and acoustic performance of transparent face masks and shields.

Dataset of British English speech recordings for psychoacoustics and speech processing research : the Clarity Speech Corpus (2022)
Journal Article

This paper presents the Clarity Speech Corpus, a publicly available, forty speaker British English speech dataset. The corpus was created for the purpose of running listening tests to gauge speech intelligibility and quality in the Clarity Project, w... Read More about Dataset of British English speech recordings for psychoacoustics and speech processing research : the Clarity Speech Corpus.

Using scale modelling to assess the prehistoric acoustics of stonehenge (2020)
Journal Article
Cox, T., Fazenda, B., & Greaney, S. (2020). Using scale modelling to assess the prehistoric acoustics of stonehenge. Journal of Archaeological Science, 122, 105218. https://doi.org/10.1016/j.jas.2020.105218

With social rituals usually involving sound, an archaeological understanding of a site requires the acoustics to be assessed. This paper demonstrates how this can be done with acoustic scale models. Scale modelling is an established method in archite... Read More about Using scale modelling to assess the prehistoric acoustics of stonehenge.

The effects of classroom noise on the reading comprehension of adolescents (2019)
Journal Article
Connolly, D., Dockrell, J., Shield, B., Conetta, R., Mydlarz, C., & Cox, T. (2019). The effects of classroom noise on the reading comprehension of adolescents. ˜The œJournal of the Acoustical Society of America (Online), 145(1), 372-381. https://doi.org/10.1121/1.5087126

An investigation has been carried out to examine the impact of different levels of classroom noise on adolescents’ performance on reading and vocabulary-learning tasks. A total of 976 English high school pupils (564 aged 11 to 13 years and 412 aged 1... Read More about The effects of classroom noise on the reading comprehension of adolescents.

Elicitation of expert knowledge to inform object-based audio rendering to different systems (2018)
Journal Article
rendering to different systems. Journal of the Audio Engineering Society, 66(1/2), 44-59. https://doi.org/10.17743/jaes.2018.0001

Object-based audio presents the opportunity to optimise audio reproduction for different listening scenarios. Vector base amplitude panning (VBAP) is typically used to render object-based scenes. Optimizing this process based on knowledge of the perc... Read More about Elicitation of expert knowledge to inform object-based audio rendering to different systems.

Automatic speech-to-background ratio selection to maintain speech intelligibility in broadcasts using an objective intelligibility metric (2018)
Journal Article

While mixing, sound producers and audio professionals empirically set the speech-to-background ratio (SBR) based on rules of thumb and their own perception of sounds. There is no guarantee that the speech content will be intelligible for the general... Read More about Automatic speech-to-background ratio selection to maintain speech intelligibility in broadcasts using an objective intelligibility metric.

A non-intrusive method for estimating binaural speech intelligibility from noise-corrupted signals captured by a pair of microphones (2017)
Journal Article

A non-intrusive method is introduced to predict binaural speech intelligibility in noise directly from signals captured using a pair of microphones. The approach combines signal processing techniques in blind source separation
and localisation, with... Read More about A non-intrusive method for estimating binaural speech intelligibility from noise-corrupted signals captured by a pair of microphones.

A study on the relationship between the intelligibility and quality of algorithmically-modified speech for normal hearing listeners (2017)
Journal Article
Tang, Y., Arnold, C., & Cox, T. (2017). A study on the relationship between the intelligibility and quality of algorithmically-modified speech for normal hearing listeners. Journal of Otorhinolaryngology, Hearing and Balance Medicine, 1(1), https://doi.org/10.3390/ohbm1010005

This study investigates the relationship between the intelligibility and quality of modified speech in noise and in quiet. Speech signals were processed by seven algorithms designed to increase speech intelligibility in noise without altering speech... Read More about A study on the relationship between the intelligibility and quality of algorithmically-modified speech for normal hearing listeners.

Metadiffusers : deep-subwavelength sound diffusers (2017)
Journal Article
Jiménez, N., Cox, T., Romero-García, V., & Groby, J. (2017). Metadiffusers : deep-subwavelength sound diffusers. Scientific reports, 7(1), 5389. https://doi.org/10.1038/s41598-017-05710-5

We present deep-subwavelength diffusing surfaces based on acoustic metamaterials, namely metadiffusers. These sound diffusers are rigidly backed slotted panels, with each slit being loaded by an array of Helmholtz resonators. Strong dispersion is pro... Read More about Metadiffusers : deep-subwavelength sound diffusers.

A user-centered taxonomy of factors contributing to the listener experience of reproduced audio (2017)
Journal Article
Woodcock, J., Davies, W., & Cox, T. (2017). A user-centered taxonomy of factors contributing to the listener experience of reproduced audio. ˜The œJournal of the Acoustical Society of America (Online), 141(5), 3464-3464. https://doi.org/10.1121/1.4987193

The traditional paradigm for the assessment of audio quality is that of a listener positioned in the geometric center of a standardized loudspeaker setup, fully attending to the reproduced sound scene. However, this is not how listeners generally int... Read More about A user-centered taxonomy of factors contributing to the listener experience of reproduced audio.

Clang, chitter, crunch : perceptual organisation of onomatopoeia (2017)
Journal Article
Bones, O., Davies, W., & Cox, T. (2017). Clang, chitter, crunch : perceptual organisation of onomatopoeia. ˜The œJournal of the Acoustical Society of America (Online), 141(5), 3694-3694. https://doi.org/10.1121/1.4988048

A method has been developed that utilizes a sound-sorting and labeling procedure, with correspondence analysis of participant-generated descriptive terms, to elicit perceptual categories of sound. Unlike many other methods for identifying perceptual... Read More about Clang, chitter, crunch : perceptual organisation of onomatopoeia.

Toward an evidence-based taxonomy of everyday sounds (2016)
Journal Article
Bones, O., Cox, T., & Davies, W. (2016). Toward an evidence-based taxonomy of everyday sounds. ˜The œJournal of the Acoustical Society of America (Online), 140(4), 3266-3266. https://doi.org/10.1121/1.4970357

An organizing account of everyday sounds could greatly simplify the management of audio data. The job of an audio database manager will typically involve assigning a combination of textual descriptors, and perhaps allocating to a predefined category.... Read More about Toward an evidence-based taxonomy of everyday sounds.

A metric for predicting binaural speech intelligibility in stationary noise and competing speech maskers (2016)
Journal Article

One criterion in the design of binaural sound scenes in audio production is the extent to which the intended speech message is correctly understood. Object-based audio broadcasting systems have permitted sound editors to gain more access to the metad... Read More about A metric for predicting binaural speech intelligibility in stationary noise and competing speech maskers.