JS Woodcock
Elicitation of expert knowledge to inform object-based audio
rendering to different systems
Woodcock, JS; Davies, WJ; Melchior, F; Cox, TJ
Authors
Prof Bill Davies W.Davies@salford.ac.uk
Professor
F Melchior
Prof Trevor Cox T.J.Cox@salford.ac.uk
Professor
Abstract
Object-based audio presents the opportunity to optimise audio reproduction for different listening scenarios. Vector base amplitude panning (VBAP) is typically used to render object-based scenes. Optimizing this process based on knowledge of the perception and practices of experts could result in significant improvements to the end user's listening experience. An experiment was conducted to investigate how content creators perceive changes in the perceptual attributes of the same content rendered to systems with different numbers of channels, and to determine what they would do differently to standard VBAP and matrix based downmixes to minimize these changes. Text mining and clustering of the content creators' responses revealed 6 general mix processes: the spatial spread of individual objects, EQ and processing, reverberation, position, bass, and level. Logistic regression models show the relationships between the mix processes, perceived changes in perceptual attributes, and the rendering method/speaker layout. The relative frequency of use for the different mix processes was found to differ between categories of audio object suggesting that any downmix rules should be object category specific. These results give insight into how object-based audio can be used to improve listener experience and provide the first template for doing this across different reproduction systems.
Citation
rendering to different systems. Journal of the Audio Engineering Society, 66(1/2), 44-59. https://doi.org/10.17743/jaes.2018.0001
Journal Article Type | Article |
---|---|
Acceptance Date | Jan 3, 2018 |
Online Publication Date | Feb 14, 2018 |
Publication Date | Feb 14, 2018 |
Deposit Date | Jan 5, 2018 |
Publicly Available Date | Mar 1, 2018 |
Journal | Journal of the Audio Engineering Society |
Print ISSN | 1549-4950 |
Publisher | Audio Engineering Society |
Volume | 66 |
Issue | 1/2 |
Pages | 44-59 |
DOI | https://doi.org/10.17743/jaes.2018.0001 |
Publisher URL | http://dx.doi.org/10.17743/jaes.2018.0001 |
Related Public URLs | http://www.aes.org/journal/ |
Additional Information | Funders : Engineering and Physical Sciences Research Council (EPSRC) Projects : S3A Grant Number: EP/L000539/1 |
Files
JWoodcock_JAES_manuscript.pdf
(558 Kb)
PDF
19375.pdf
(404 Kb)
PDF
Licence
http://creativecommons.org/licenses/by/4.0/
Publisher Licence URL
http://creativecommons.org/licenses/by/4.0/
You might also like
Improving the measurement and acoustic performance of transparent face masks and shields
(2022)
Journal Article
Using scale modelling to assess the prehistoric acoustics of stonehenge
(2020)
Journal Article
Fast speech intelligibility estimation using a neural network trained via distillation
(2020)
Presentation / Conference
Pupil dilation reveals changes in listening effort due to energetic and informational masking
(2019)
Presentation / Conference
Downloadable Citations
About USIR
Administrator e-mail: library-research@salford.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search