Dr Bruno Fazenda B.M.Fazenda@salford.ac.uk
Associate Professor/Reader
Perception and automated assessment of audio
quality in user generated content
Fazenda, BM; Jackson, I; Kendrick, P; Cox, TJ; Li, FF
Authors
I Jackson
P Kendrick
Prof Trevor Cox T.J.Cox@salford.ac.uk
Professor
FF Li
Abstract
Technology to record sound, available in personal devices such as smartphones or video recording devices, is now ubiquitous. However, the production quality of the sound on this user-generated content is often very poor: distorted, noisy, with garbled speech or indistinct music. Our interest lies in the causes of the poor recording, especially what happens between the sound source and the electronic signal emerging from the microphone, and finding an automated method to warn the user of such problems. Typical problems, such as distortion, wind noise, microphone handling noise and frequency response, were tested. A perceptual model has been developed from subjective tests on the perceived quality of such errors and data measured from a training dataset composed of various audio files. It is shown that perceived quality is associated with distortion and frequency response, with wind and handling noise being just slightly less important. In addition, the contextual content of the audio sample was found to modulate perceived quality at similar levels to degradations such as wind and rendering those introduced by handling noise negligible.
Presentation Conference Type | Conference Paper (published) |
---|---|
Start Date | Jul 1, 2016 |
Publication Date | Jun 27, 2016 |
Deposit Date | Nov 17, 2016 |
Book Title | Quality of Multimedia Experience (QoMEX), 2016 Eighth International Conference on 6-8 June 2016 |
ISBN | 9781509003549 |
DOI | https://doi.org/10.1109/QoMEX.2016.7498974 |
Publisher URL | http://dx.doi.org/10.1109/QoMEX.2016.7498974 |
Related Public URLs | http://ieeexplore.ieee.org/xpl/mostRecentIssue.jsp?punumber=7495595 |
Additional Information | Event Type : Conference |
You might also like
Using scale modelling to assess the prehistoric acoustics of stonehenge
(2020)
Journal Article
Misleading description of first and second order ambisonic systems
(2020)
Journal Article
Pupil dilation reveals changes in listening effort due to energetic and informational masking
(2019)
Presentation / Conference
Adding the room to the mix : perceptual aspects of modal resonance in live audio
(2019)
Presentation / Conference