Improving robustness of speaker recognition in noisy and reverberant conditions via training

Al-Noori, AH; Al-Karawi, KA; Li, FF

doi:10.1109/EISIC.2015.20

Audio content feature selection and classification : a random forests and decision tree approach (2015)
Presentation / Conference
AI-Maathidi, M., & Li, F. (2015, December). Audio content feature selection and classification : a random forests and decision tree approach. Presented at IEEE International Conference on Progress in Informatics and Computing (PIC), Nanjing, China

Content information can be extracted from soundtracks of multimedia files. A good audio classifier as a preprocessor is crucial in such applications. Efforts have been made to develop effective and efficient audio content classifiers in which feat... Read More about Audio content feature selection and classification : a random forests and decision tree approach.

Automatic Speaker Recognition System in Adverse Conditions — Implication of Noise and Reverberation on System Performance (2015)
Journal Article
A. Al-Karawi, K., H. Al-Noori, A., Li, F., & Ritchings, T. (2015). Automatic Speaker Recognition System in Adverse Conditions — Implication of Noise and Reverberation on System Performance. International journal of information and electronics engineering (Singapore : Online), 5(6), 423-427. https://doi.org/10.7763/IJIEE.2015.V5.571

Speaker recognition has been developed and evolved over the past few decades into a supposedly mature technique. Existing methods typically utilize robust features extracted from clean speech. In real-world applications, especially security and for... Read More about Automatic Speaker Recognition System in Adverse Conditions — Implication of Noise and Reverberation on System Performance.

Microphone handling noise : measurements of perceptual threshold and effects on audio quality (2015)
Journal Article
Kendrick, P., Jackson, I., Fazenda, B., Cox, T., & Li, F. (2015). Microphone handling noise : measurements of perceptual threshold and effects on audio quality. PLoS ONE, 10(10), e0140256. https://doi.org/10.1371/journal.pone.0140256

A psychoacoustic experiment was carried out to test the effects of microphone handling noise on perceived audio quality. Handling noise is a problem affecting both amateurs using their smartphones and cameras, as well as professionals using separate... Read More about Microphone handling noise : measurements of perceptual threshold and effects on audio quality.

Perceived audio quality of sounds degraded by non-linear distortions and single-ended assessment using HASQI (2015)
Journal Article
assessment using HASQI. Journal of the Audio Engineering Society, 63(9), 698-712. https://doi.org/10.17743/jaes.2015.0068

For field recordings and user generated content recorded on phones, tablets, and other mobile devices nonlinear distortions caused by clipping and limiting at pre-amplification stages, and dynamic range control (DRC) are common causes of poor audio... Read More about Perceived audio quality of sounds degraded by non-linear distortions and single-ended assessment using HASQI.

Audio content analysis in the presence of overlapped classes : a non-exclusive segmentation approach to mitigate information losses (2015)
Presentation / Conference
Mohammed, D., Duncan, P., & Li, F. (2015, August). Audio content analysis in the presence of overlapped classes : a non-exclusive segmentation approach to mitigate information losses. Presented at Global Summit and Expo on Multimedia & Applications, Birmingham, UK

Soundtracks of multimedia files are information rich, from which much content-related metadata can be extracted. There is a pressing demand for automated classification, identification and information mining of audio content. A segment of the audio s... Read More about Audio content analysis in the presence of overlapped classes : a non-exclusive segmentation approach to mitigate information losses.

Improving headphone user experience in ubiquitous multimedia content consumption : a universal cross-feed filter (2015)
Presentation / Conference
Li, F. (2015, June). Improving headphone user experience in ubiquitous multimedia content consumption : a universal cross-feed filter. Presented at IEEE BMSB 2015, Ghent, Belgium

High performance audio and video codecs and ever increasing bandwidths of data communications networks have enabled multi-platform delivery of high definition media content via transmission channels such as terrestrial broadcast, broadband IP net... Read More about Improving headphone user experience in ubiquitous multimedia content consumption : a universal cross-feed filter.

Independent Component Analysis Methods to Improve Electrocardiogram Patterns Recognition in the Presence of Non-Trivial Artifacts (2015)
Journal Article
Sarfraz, M., Li, F., & Khan, A. (2015). Independent Component Analysis Methods to Improve Electrocardiogram Patterns Recognition in the Presence of Non-Trivial Artifacts. Journal of medical and bioengineering, 4(3), 221-226. https://doi.org/10.12720/jomb.4.3.221-226

Electrocardiogram (ECG) signals are affected by various kinds of noise and artifacts that may impede correct recognition by automated monitoring or diagnosis systems. Independent component analysis (ICA) is considered as a new technique suitable for... Read More about Independent Component Analysis Methods to Improve Electrocardiogram Patterns Recognition in the Presence of Non-Trivial Artifacts.

Microphone handling noise database (2015)
Dataset
Kendrick, P., Jackson, I., Fazenda, B., Cox, T., & Li, F. Microphone handling noise database. [Dataset]

Microphone handling noise can reduce the quality of audio recordings. A perceptual study into this effect has been carried out and the subjective response quantified. This database contains audio recordings of handling noises from 8 different micro... Read More about Microphone handling noise database.

Improving robustness of speaker recognition in noisy and reverberant conditions via training (2015)
Book Chapter
Al-Noori, A., Al-Karawi, K., & Li, F. (2015). Improving robustness of speaker recognition in noisy and reverberant conditions via training. In 2015 European Intelligence and Security Informatics Conference (180-180). IEEE. https://doi.org/10.1109/EISIC.2015.20

Speaker recognition can be used as a security means to authenticate the speaker or as a forensic tool to determine who is likely to be the talker. For such critical applications, robustness or reliability of the system is crucial. In spite of the dev... Read More about Improving robustness of speaker recognition in noisy and reverberant conditions via training.

All Outputs (9)