Neural Echoes: Speech Emotion Detection Using Machine Learning

Authors

  • Vijaya Grace Garaga Author
  • Yandamuri Sujana Sri Padmaja Author
  • Vande Rama Satya Kalyani Author
  • Dammala S C V N Gangadhar Author
  • Cherukumilli Bhuvaneswari Author

Keywords:

Accuracy, Classification Algorithms, Convolutional Neural Networks (CNNs), Feature Extraction, Recurrent Neural Networks (RNNs), Speech Emotion Recognition (SER)

Abstract

Speech Emotion Recognition (SER) has gained prominence due to its diverse applications and the complexities of analyzing emotional content from speech. Achieving 98% accuracy in SER highlights the effectiveness of advanced techniques in feature extraction and classification. Key methods include Mel-Frequency Cepstral Coefficients (MFCCs) for feature extraction, and various classification algorithms such as Support Vector Machines (SVMs), Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs) including Long Short-Term Memory (LSTM) networks, and Transformers. Hybrid approaches, like combining multiple classifiers and feature fusion, further enhance accuracy. This high level of performance underscores the impact of integrating sophisticated algorithms to overcome the challenges in subjective emotion detection from speech signals.

Downloads

Download data is not yet available.

Downloads

Published

26-03-2025

How to Cite

Neural Echoes: Speech Emotion Detection Using Machine Learning. (2025). International Journal of Information Technology and Computer Engineering, 13(1), 743-752. https://ijitce.org/index.php/ijitce/article/view/977