Skip to main content

Research Repository

Advanced Search

Audio information extraction from arbitrary sound recordings

Duncan, PJ; Mohammed, DY; Li, FF

Authors

PJ Duncan

DY Mohammed

FF Li



Abstract

Numerous archives of entertainment soundtracks and other recordings such as environmental noise samples have imposed a big data challenge in audio related industries. This necessitates the use of machine audition and retrieval tools to extract semantic information for various applications. Speech recognition, environmental noise classification and music information retrieval tools haven been developed in the past for specific purposes. Combined use of these tools to process arbitrary sound recordings remains challenging: overlap of diverse sources mitigates the classification, resulting in poor recognition and/or missing content. Following a review of a universal framework for arbitrary soundtrack information mining proposed by the authors, a new solution to the overlapped sound sources has been developed in this paper by iterative signal cleaning techniques. The system classifies the arbitrary audio signals into music, speech, ambient sounds and silence, allowing overlap. Validation tests have shown that the new techniques can reduce or eliminate information losses in machine audition, hence improving the usability of machine audition in processing real-world audio archives. This paper will also discuss the dataset and principles, present the validation results and discuss potential applications.

Citation

Duncan, P., Mohammed, D., & Li, F. Audio information extraction from arbitrary sound recordings. Presented at 22nd International Congress on Sound and Vibration (ICSV22), Florence, Italy

Presentation Conference Type Other
Conference Name 22nd International Congress on Sound and Vibration (ICSV22)
Conference Location Florence, Italy
End Date Jul 16, 2015
Acceptance Date Feb 1, 2015
Publication Date Jul 14, 2015
Deposit Date Jun 8, 2015
Publisher URL https://iiav.org/archives_icsv_last/2015_icsv22/content/papers/papers/full_paper_1004_20150311175400892.pdf
Related Public URLs http://icsv22.org/
https://iiav.org/archives_icsv_last/2015_icsv22/index.html
Additional Information Event Type : Conference
Funders : Ph.D sponsored by Iraqi governmnent