PJ Duncan
Audio information extraction from arbitrary sound recordings
Duncan, PJ; Mohammed, DY; Li, FF
Authors
DY Mohammed
FF Li
Abstract
Numerous archives of entertainment soundtracks and other recordings such as environmental noise samples have imposed a big data challenge in audio related industries. This necessitates the use of machine audition and retrieval tools to extract semantic information for various applications. Speech recognition, environmental noise classification and music information retrieval tools haven been developed in the past for specific purposes. Combined use of these tools to process arbitrary sound recordings remains challenging: overlap of diverse sources mitigates the classification, resulting in poor recognition and/or missing content. Following a review of a universal framework for arbitrary soundtrack information mining proposed by the authors, a new solution to the overlapped sound sources has been developed in this paper by iterative signal cleaning techniques. The system classifies the arbitrary audio signals into music, speech, ambient sounds and silence, allowing overlap. Validation tests have shown that the new techniques can reduce or eliminate information losses in machine audition, hence improving the usability of machine audition in processing real-world audio archives. This paper will also discuss the dataset and principles, present the validation results and discuss potential applications.
Citation
Duncan, P., Mohammed, D., & Li, F. Audio information extraction from arbitrary sound recordings. Presented at 22nd International Congress on Sound and Vibration (ICSV22), Florence, Italy
Presentation Conference Type | Other |
---|---|
Conference Name | 22nd International Congress on Sound and Vibration (ICSV22) |
Conference Location | Florence, Italy |
End Date | Jul 16, 2015 |
Acceptance Date | Feb 1, 2015 |
Publication Date | Jul 14, 2015 |
Deposit Date | Jun 8, 2015 |
Publisher URL | https://iiav.org/archives_icsv_last/2015_icsv22/content/papers/papers/full_paper_1004_20150311175400892.pdf |
Related Public URLs | http://icsv22.org/ https://iiav.org/archives_icsv_last/2015_icsv22/index.html |
Additional Information | Event Type : Conference Funders : Ph.D sponsored by Iraqi governmnent |
You might also like
Microphone wind noise reduction using singular spectrum analysis techniques
(2017)
Presentation / Conference
Mitigating wind noise in outdoor microphone signals using a singular spectral subspace method
(2017)
Presentation / Conference