DY Mohammed
A system for semantic information extraction from mixed soundtracks deploying MARSYAS framework
Mohammed, DY; Duncan, PJ; Al-Maathidi, MM; Li, FF
Authors
PJ Duncan
MM Al-Maathidi
FF Li
Abstract
Ever increasing volumes of media content and the desire to extract information from media archives motivate the studies into semantic audio information mining. Much research in this filed concerns development of bespoke systems, in which sound tracks are exclusively classified and segmented, and a specific type of sound is recognized and analyzed. This approach however is detrimental to the complete extraction of all relevant semantic information and audio scene analysis. The current study addresses the issues of sound tracks with overlapped music, speech and ambient sounds, and explores how MARSYAS (Music Analysis, Retrieval and Synthesis for Audio Signals) can be extended to mixed and overlapped soundtrack applications. The MARSYAS has been adapted to this application by means of adopting additional speech cleaning algorithms. The proposed new system can analyze arbitrary sound tracks and timestamp the occurrence of music and speech, allowing overlaps, in the form of a “sound score” for further recognition methods to extract music score and text information. Validation tests have shown that the new system handles overlapping cases and is therefore capable of extracting more information than other existing methods.
Citation
Mohammed, D., Duncan, P., Al-Maathidi, M., & Li, F. A system for semantic information extraction from mixed soundtracks deploying MARSYAS framework. In 2015 IEEE 13th International Conference on Industrial Informatics (INDIN) (1084-1089). IEEE. https://doi.org/10.1109/INDIN.2015.7281886
Deposit Date | May 9, 2016 |
---|---|
Pages | 1084-1089 |
Book Title | 2015 IEEE 13th International Conference on Industrial Informatics (INDIN) |
ISBN | 9781479966493 |
DOI | https://doi.org/10.1109/INDIN.2015.7281886 |
Publisher URL | http://dx.doi.org/10.1109/INDIN.2015.7281886 |
Additional Information | Event Type : Conference |
You might also like
Machine learning and DSP algorithms for screening of possible osteoporosis using electronic stethoscopes
(2018)
Presentation / Conference
Microphone wind noise reduction using singular spectrum analysis techniques
(2017)
Presentation / Conference
Downloadable Citations
About USIR
Administrator e-mail: library-research@salford.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search