Audio content analysis in the presence of overlapped classes : a non-exclusive segmentation approach to mitigate information losses

Mohammed, DY; Duncan, PJ; Li, FF

doi:10.4172/2165-7866.S1.002

Audio content analysis in the presence of overlapped classes : a non-exclusive segmentation approach to mitigate information losses

Mohammed, DY; Duncan, PJ; Li, FF

Authors

DY Mohammed

PJ Duncan

FF Li

Abstract

Soundtracks of multimedia files are information rich, from which much content-related metadata can be extracted. There is a pressing demand for automated classification, identification and information mining of audio content. A segment of the audio soundtrack can be either speech, music, event sounds or a combination of them.There exist many individual algorithms for the recognition and analysis of speech, music or event sounds, allowing for embedded information to be retrieved in a semantic fashion. A systematic review shows that a universal system that is optimised to extract the maximum amount of information for further text mining and inference does not exist. Mainstream algorithms typically work with a single class of sound, e.g. speech, music or even sounds and classification methods are predominantly exclusive (detects one class at a time) and losing much of information when two or three classes are overlapped.
A universal open architecture for audio content and scene analysis has been proposed by the authors. To mitigate information losses in overlapped content, non-exclusive segmentation approaches were adopted. This paper is presented from one possible implementation deploying the universal open architecture as a paradigm to show how the universal open architecture can integrate existing methods and workflow but maximise extractable semantic information.
In the current work, overlapped content is identified and segmented from carefully tailored feature spaces and a family of decision trees are used to generate a content score. Results show that the developed system, when compared with well established audio content analysers, can identify and thus extract information from much more speech and music segments. The full paper will discuss the methods, detail the results and illustrate how the system works.

Citation

Mohammed, D., Duncan, P., & Li, F. (2015, August). Audio content analysis in the presence of overlapped classes : a non-exclusive segmentation approach to mitigate information losses. Presented at Global Summit and Expo on Multimedia & Applications, Birmingham, UK

Presentation Conference Type	Speech
Conference Name	Global Summit and Expo on Multimedia & Applications
Conference Location	Birmingham, UK
Start Date	Aug 10, 2015
End Date	Aug 11, 2015
Deposit Date	Jul 7, 2017
Publicly Available Date	Jul 7, 2017
DOI	https://doi.org/10.4172/2165-7866.S1.002
Publisher URL	http://dx.doi.org/10.4172/2165-7866.S1.002
Related Public URLs	https://www.omicsgroup.org/journals/ArchiveJITSE/multimedia-and-applications-2015-proceedings.php
Additional Information	Event Type : Conference

Files

Audio content analysis in the presence of overlapped classes - a non-exclusive segmentation approach to mitigate information losses2..pdf (292 Kb)
PDF

Version
Abstract

Audio content analysis in the presence of overlapped classes - a non-exclusive segmentation approach to mitigate information losses2..docx (125 Kb)
Document

Version
Abstract