Skip to main content

Research Repository

Advanced Search

Audio content analysis in the presence of overlapped classes : a non-exclusive segmentation approach to mitigate information losses

Mohammed, DY; Duncan, PJ; Li, FF

Audio content analysis in the presence of overlapped classes : a non-exclusive segmentation approach to mitigate information losses Thumbnail


Authors

DY Mohammed

PJ Duncan

FF Li



Abstract

Soundtracks of multimedia files are information rich, from which much content-related metadata can be extracted. There is a pressing demand for automated classification, identification and information mining of audio content. A segment of the audio soundtrack can be either speech, music, event sounds or a combination of them.There exist many individual algorithms for the recognition and analysis of speech, music or event sounds, allowing for embedded information to be retrieved in a semantic fashion. A systematic review shows that a universal system that is optimised to extract the maximum amount of information for further text mining and inference does not exist. Mainstream algorithms typically work with a single class of sound, e.g. speech, music or even sounds and classification methods are predominantly exclusive (detects one class at a time) and losing much of information when two or three classes are overlapped.
A universal open architecture for audio content and scene analysis has been proposed by the authors. To mitigate information losses in overlapped content, non-exclusive segmentation approaches were adopted. This paper is presented from one possible implementation deploying the universal open architecture as a paradigm to show how the universal open architecture can integrate existing methods and workflow but maximise extractable semantic information.
In the current work, overlapped content is identified and segmented from carefully tailored feature spaces and a family of decision trees are used to generate a content score. Results show that the developed system, when compared with well established audio content analysers, can identify and thus extract information from much more speech and music segments. The full paper will discuss the methods, detail the results and illustrate how the system works.

Citation

Mohammed, D., Duncan, P., & Li, F. (2015, August). Audio content analysis in the presence of overlapped classes : a non-exclusive segmentation approach to mitigate information losses. Presented at Global Summit and Expo on Multimedia & Applications, Birmingham, UK

Presentation Conference Type Speech
Conference Name Global Summit and Expo on Multimedia & Applications
Conference Location Birmingham, UK
Start Date Aug 10, 2015
End Date Aug 11, 2015
Deposit Date Jul 7, 2017
Publicly Available Date Jul 7, 2017
DOI https://doi.org/10.4172/2165-7866.S1.002
Publisher URL http://dx.doi.org/10.4172/2165-7866.S1.002
Related Public URLs https://www.omicsgroup.org/journals/ArchiveJITSE/multimedia-and-applications-2015-proceedings.php
Additional Information Event Type : Conference

Files

Audio content analysis in the presence of overlapped classes - a non-exclusive segmentation approach to mitigate information losses2..pdf (292 Kb)
PDF

Version
Abstract


Audio content analysis in the presence of overlapped classes - a non-exclusive segmentation approach to mitigate information losses2..docx (125 Kb)
Document

Version
Abstract





You might also like



Downloadable Citations