V Vinoth Kumar
A New Approach for Speech Emotion Recognition using Single Layered Convolutional Neural Network
Vinoth Kumar, V; Palaiahnakote, Shivakumara; Khan, Surbhi Bhatia; Almusharraf, Ahlam
Authors
Dr Shivakumara Palaiahnakote S.Palaiahnakote@salford.ac.uk
Lecturer in Computer Vision
Dr Surbhi Khan S.Khan138@salford.ac.uk
Lecturer in Data Science
Ahlam Almusharraf
Abstract
Creating a computational device to identify human emotions via voice analysis represents a notable achievement in the sector of human-computer interaction, especially within the healthcare domain. We propose a new lightweight model for addressing challenges of emotions recognition. The model works based on CNN with change of kernel processing. The proposed model performs a direct matching to recognize speech emotions of different eight categories using a statistical model named Analysis of Variance (ANOVA) as kernel for features extraction and Cosine Similarity Measurement (CSM) as activation function for CNN model. This proposed model contains eight-folded single-layered intermediate neurons, and each neuron can segregate speech emotion pattern using CSM from the voice convergence matrix to explore a part of the solution from the whole solution. Experiment results demonstrates that the proposed model outperforms compared with multiple layered existing CNN methods in identifying the emotional state of a speaker.
Citation
Vinoth Kumar, V., Palaiahnakote, S., Khan, S. B., & Almusharraf, A. (2024). A New Approach for Speech Emotion Recognition using Single Layered Convolutional Neural Network. Malaysian journal of computer science, 37(1), 89–106. https://doi.org/10.22452/mjcs.vol37no1.5
Journal Article Type | Article |
---|---|
Acceptance Date | Mar 28, 2024 |
Online Publication Date | Jan 31, 2024 |
Publication Date | Jan 31, 2024 |
Deposit Date | Mar 28, 2024 |
Publicly Available Date | May 1, 2024 |
Journal | Malaysian Journal of Computer Science |
Print ISSN | 0127-9084 |
Peer Reviewed | Peer Reviewed |
Volume | 37 |
Issue | 1 |
Pages | 89–106 |
DOI | https://doi.org/10.22452/mjcs.vol37no1.5 |
Keywords | Analysis of Variance; Speech Emotion Recognition; Deep Learning; CNN; Cosine- similarity measurement |
Files
Published Version
(1.6 Mb)
PDF
Publisher Licence URL
https://creativecommons.org/licenses/by-sa/4.0/
Manuscript MJCS-Shiva-6-Cleaned
(1.6 Mb)
PDF
You might also like
Exploring Topic Coherence with PCC-LDA and BERT for Contextual Word Generation
(2024)
Journal Article
Enhancing Image Security via Block Cyclic Construction and DNA Based LFSR
(2024)
Journal Article
Downloadable Citations
About USIR
Administrator e-mail: library-research@salford.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search