D Geary
Loudness differences for Voice-over-Voice audio in TV and streaming
Geary, D; Torcoli, M; Paulus, J; Simon, C; Straninger, D; Travaglini, A; Shirley, BG
Authors
M Torcoli
J Paulus
C Simon
D Straninger
A Travaglini
Dr Ben Shirley B.G.Shirley@salford.ac.uk
Associate Professor/Reader
Abstract
Voice-over-Voice (VoV) is a common mixing practice observed in news reports and docu- mentaries, where a foreground voice is mixed on top of a background voice, e.g., to translate an interview. This is achieved by ducking the background voice so that the foreground voice is more intelligible, while still allowing the listener to perceive the presence and tone of the background voice. Currently there is little published research on ducking practices for VoV or on technical details such as the Loudness Difference (LD) between foreground and background speech. This paper investigates the ducking practices of nine expert audio engineers and the preferred LDs of 13 non-expert listeners of ages 57 years and older. Results highlight a clear difference between the LDs used by the experts and those preferred by the non-expert listeners. Experts tended toward LDs of 11.5–17 LU, while non-experts preferred a range of 20–30 LU. Based on these results, a minimum LD of 20 LU is recommended for VoV. High inter-subject variance due to personal preference was observed. This variance makes a substantial case for the introduction of personalization in broadcast and streaming. The audiovisual material used for the tests is provided at https://www.audiolabs-erlangen.de/resources/2020-VoV-DB.
Journal Article Type | Article |
---|---|
Acceptance Date | Sep 21, 2020 |
Online Publication Date | Dec 21, 2020 |
Publication Date | Nov 1, 2020 |
Deposit Date | Apr 21, 2021 |
Publicly Available Date | Apr 21, 2021 |
Journal | Journal of the Audio Engineering Society |
Print ISSN | 1549-4950 |
Publisher | Audio Engineering Society |
Volume | 68 |
Issue | 11 |
Pages | 810-818 |
DOI | https://doi.org/10.17743/jaes.2020.0022 |
Publisher URL | https://doi.org/10.17743/jaes.2020.0022 |
Related Public URLs | http://www.aes.org/journal/ |
Additional Information | Corporate Creators : Fraunhofer Institute for Integrated Circuits IIS, Erlangen, Germany |
Files
VoV_Loudness_JAES_Author_Copy_revised.pdf
(151 Kb)
PDF
You might also like
Cloud-based AI for automatic audio production for personalized immersive XR experiences
(2022)
Journal Article
Background ducking to produce esthetically pleasing
audio for TV with clear speech
(2019)
Presentation / Conference
Speech-to-screen : spatial separation of dialogue from noise towards improved speech intelligibility for the small screen
(2018)
Presentation / Conference
Big pictures and small screens; how television sound research can work with, and for, hard of hearing viewers
(2017)
Presentation / Conference