V Vinoth Kumar
A New Approach for Speech Emotion Recognition using Single Layered Convolutional Neural Network
Vinoth Kumar, V; Palaiahnakote, Shivakumara; Khan, Surbhi Bhatia; Almusharraf, Ahlam
Authors
Dr Shivakumara Palaiahnakote S.Palaiahnakote@salford.ac.uk
Lecturer
Surbhi Bhatia Khan
Ahlam Almusharraf
Abstract
Creating a computational device to identify human emotions via voice analysis represents a notable achievement in the sector of human-computer interaction, especially within the healthcare domain. We propose a new lightweight model for addressing challenges of emotions recognition. The model works based on CNN with change of kernel processing. The proposed model performs a direct matching to recognize speech emotions of different eight categories using a statistical model named Analysis of Variance (ANOVA) as kernel for features extraction and Cosine Similarity Measurement (CSM) as activation function for CNN model. This proposed model contains eight-folded single-layered intermediate neurons, and each neuron can segregate speech emotion pattern using CSM from the voice convergence matrix to explore a part of the solution from the whole solution. Experiment results demonstrates that the proposed model outperforms compared with multiple layered existing CNN methods in identifying the emotional state of a speaker.
Citation
Vinoth Kumar, V., Palaiahnakote, S., Khan, S. B., & Almusharraf, A. (in press). A New Approach for Speech Emotion Recognition using Single Layered Convolutional Neural Network. Malaysian journal of computer science,
Journal Article Type | Article |
---|---|
Acceptance Date | Mar 28, 2024 |
Deposit Date | Mar 28, 2024 |
Journal | Malaysian Journal of Computer Science |
Print ISSN | 0127-9084 |
Peer Reviewed | Peer Reviewed |
Keywords | Analysis of Variance; Speech Emotion Recognition; Deep Learning; CNN; Cosine- similarity measurement |
This file is under embargo due to copyright reasons.
Contact S.Palaiahnakote@salford.ac.uk to request a copy for personal use.
You might also like
Classification of aesthetic natural scene images using statistical and semantic features
(2023)
Journal Article
An Episodic Learning Network for Text Detection on Human Bodies in Sports Images
(2022)
Journal Article
A Conformable Moments-Based Deep Learning System for Forged Handwriting Detection
(2022)
Journal Article
Downloadable Citations
About USIR
Administrator e-mail: library-research@salford.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search