AD Wilson
Evaluation and modelling of perceived audio quality in popular music, towards intelligent music production
Wilson, AD
Abstract
This thesis addresses three fundamental questions: What is mixing? What makes a high-quality mix? How can high-quality mixes be automatically generated? While these may seem essential to the very foundations of intelligent music production, this thesis argues that they have not been sufficiently addressed in previous studies. An important contribution is the questioning of previously-held definitions of a 'mix'. Experiments were conducted in which participants used traditional mixing interfaces to create mixes using gain, panning and equalisation. The data was analysed in a novel 'mix-space', 'panning-space' and 'tone-space' in order to determine if there is a consensus in how these tools are used. Methods were developed to create mixes by populating the mix-space according to parametric models. These mixes were characterised by signal features, the distributions of which suggest tolerance bounds for automated mixing systems. This was complemented by a study of real-world music mixes, containing hundreds of mixes each for ten songs, collected from on-line communities. Mixes were shown to vary along four dimensions: loudness/dynamics, brightness, bass and stereo width. The variations between individual mix engineers were also studied, indicating a small effect of the mix engineer on mix preference ratings (eta2 = 0.021). Perceptual audio evaluation revealed that listeners appreciate 'quality' in a variety of ways, depending on the circumstances. In commercially-released music, 'quality' was related to the loudness/dynamic dimension. In mixes, 'quality' is highly correlated with 'preference'. To create mixes which maximised perceived quality, a novel semi-automatic mixing system was developed using evolutionary computation, wherein a population of mixes, generated in the mix-space, is guided by the subjective evaluations of the listener. This system was evaluated by a panel of users, who used it to create their ideal mixes, rather than the technically-correct mixes which previous systems strived for. It is hoped that this thesis encourages the community to pursue subjectively motivated methods when designing systems for music-mixing.
Citation
Wilson, A. Evaluation and modelling of perceived audio quality in popular music, towards intelligent music production. (Thesis). University of Salford
Thesis Type | Thesis |
---|---|
Deposit Date | Jan 19, 2018 |
Publicly Available Date | Jan 19, 2018 |
Files
AlexWilson - PhD Thesis - FINAL.pdf
(10 Mb)
PDF
You might also like
Using scale modelling to assess the prehistoric acoustics of stonehenge
(2020)
Journal Article
Misleading description of first and second order ambisonic systems
(2020)
Journal Article
Pupil dilation reveals changes in listening effort due to energetic and informational masking
(2019)
Presentation / Conference
Adding the room to the mix : perceptual aspects of modal resonance in live audio
(2019)
Presentation / Conference
Downloadable Citations
About USIR
Administrator e-mail: library-research@salford.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search