V Vinoth Kumar
A New Approach for Speech Emotion Recognition using Single Layered Convolutional Neural Network
Vinoth Kumar, V; Palaiahnakote, Shivakumara; Khan, Surbhi Bhatia; Almusharraf, Ahlam
Authors
Dr Shivakumara Palaiahnakote S.Palaiahnakote@salford.ac.uk
Lecturer
Dr Surbhi Khan S.Khan138@salford.ac.uk
Lecturer
Ahlam Almusharraf
Abstract
Creating a computational device to identify human emotions via voice analysis represents a notable achievement in the sector of human-computer interaction, especially within the healthcare domain. We propose a new lightweight model for addressing challenges of emotions recognition. The model works based on CNN with change of kernel processing. The proposed model performs a direct matching to recognize speech emotions of different eight categories using a statistical model named Analysis of Variance (ANOVA) as kernel for features extraction and Cosine Similarity Measurement (CSM) as activation function for CNN model. This proposed model contains eight-folded single-layered intermediate neurons, and each neuron can segregate speech emotion pattern using CSM from the voice convergence matrix to explore a part of the solution from the whole solution. Experiment results demonstrates that the proposed model outperforms compared with multiple layered existing CNN methods in identifying the emotional state of a speaker.
Citation
Vinoth Kumar, V., Palaiahnakote, S., Khan, S. B., & Almusharraf, A. (2024). A New Approach for Speech Emotion Recognition using Single Layered Convolutional Neural Network. Malaysian journal of computer science, 37(1), 89–106. https://doi.org/10.22452/mjcs.vol37no1.5
Journal Article Type | Article |
---|---|
Acceptance Date | Mar 28, 2024 |
Online Publication Date | Jan 31, 2024 |
Publication Date | Jan 31, 2024 |
Deposit Date | Mar 28, 2024 |
Publicly Available Date | May 1, 2024 |
Journal | Malaysian Journal of Computer Science |
Print ISSN | 0127-9084 |
Peer Reviewed | Peer Reviewed |
Volume | 37 |
Issue | 1 |
Pages | 89–106 |
DOI | https://doi.org/10.22452/mjcs.vol37no1.5 |
Keywords | Analysis of Variance; Speech Emotion Recognition; Deep Learning; CNN; Cosine- similarity measurement |
Files
Published Version
(1.6 Mb)
PDF
Publisher Licence URL
https://creativecommons.org/licenses/by-sa/4.0/
Manuscript MJCS-Shiva-6-Cleaned
(1.6 Mb)
PDF
You might also like
An Adaptive Xception Model for Classification of Brain Tumors
(2024)
Journal Article
Altered Handwritten Text Detection in Document Images Using Deep Learning
(2024)
Journal Article
Downloadable Citations
About USIR
Administrator e-mail: library-research@salford.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search