DY Mohammed
Audio content analysis in the presence of overlapped classes : a non-exclusive segmentation approach to mitigate information losses
Mohammed, DY; Duncan, PJ; Li, FF
Authors
PJ Duncan
FF Li
Abstract
Soundtracks of multimedia files are information rich, from which much content-related metadata can be extracted. There is a pressing demand for automated classification, identification and information mining of audio content. A segment of the audio soundtrack can be either speech, music, event sounds or a combination of them.There exist many individual algorithms for the recognition and analysis of speech, music or event sounds, allowing for embedded information to be retrieved in a semantic fashion. A systematic review shows that a universal system that is optimised to extract the maximum amount of information for further text mining and inference does not exist. Mainstream algorithms typically work with a single class of sound, e.g. speech, music or even sounds and classification methods are predominantly exclusive (detects one class at a time) and losing much of information when two or three classes are overlapped.
A universal open architecture for audio content and scene analysis has been proposed by the authors. To mitigate information losses in overlapped content, non-exclusive segmentation approaches were adopted. This paper is presented from one possible implementation deploying the universal open architecture as a paradigm to show how the universal open architecture can integrate existing methods and workflow but maximise extractable semantic information.
In the current work, overlapped content is identified and segmented from carefully tailored feature spaces and a family of decision trees are used to generate a content score. Results show that the developed system, when compared with well established audio content analysers, can identify and thus extract information from much more speech and music segments. The full paper will discuss the methods, detail the results and illustrate how the system works.
Citation
Mohammed, D., Duncan, P., & Li, F. (2015, August). Audio content analysis in the presence of overlapped classes : a non-exclusive segmentation approach to mitigate information losses. Presented at Global Summit and Expo on Multimedia & Applications, Birmingham, UK
Presentation Conference Type | Speech |
---|---|
Conference Name | Global Summit and Expo on Multimedia & Applications |
Conference Location | Birmingham, UK |
Start Date | Aug 10, 2015 |
End Date | Aug 11, 2015 |
Deposit Date | Jul 7, 2017 |
Publicly Available Date | Jul 7, 2017 |
DOI | https://doi.org/10.4172/2165-7866.S1.002 |
Publisher URL | http://dx.doi.org/10.4172/2165-7866.S1.002 |
Related Public URLs | https://www.omicsgroup.org/journals/ArchiveJITSE/multimedia-and-applications-2015-proceedings.php |
Additional Information | Event Type : Conference |
Files
Audio content analysis in the presence of overlapped classes - a non-exclusive segmentation approach to mitigate information losses2..pdf
(292 Kb)
PDF
Version
Abstract
Audio content analysis in the presence of overlapped classes - a non-exclusive segmentation approach to mitigate information losses2..docx
(125 Kb)
Document
Version
Abstract
You might also like
Microphone wind noise reduction using singular spectrum analysis techniques
(2017)
Presentation / Conference
Mitigating wind noise in outdoor microphone signals using a singular spectral subspace method
(2017)
Presentation / Conference