Neural Echoes: Speech Emotion Detection Using Machine Learning

Vijaya Grace Garaga; Yandamuri Sujana Sri Padmaja; Vande Rama Satya Kalyani; Dammala S C V N Gangadhar; Cherukumilli Bhuvaneswari

Authors

Vijaya Grace Garaga Author
Yandamuri Sujana Sri Padmaja Author
Vande Rama Satya Kalyani Author
Dammala S C V N Gangadhar Author
Cherukumilli Bhuvaneswari Author

Keywords:

Accuracy, Classification Algorithms, Convolutional Neural Networks (CNNs), Feature Extraction, Recurrent Neural Networks (RNNs), Speech Emotion Recognition (SER)

Abstract

Speech Emotion Recognition (SER) has gained prominence due to its diverse applications and the complexities of analyzing emotional content from speech. Achieving 98% accuracy in SER highlights the effectiveness of advanced techniques in feature extraction and classification. Key methods include Mel-Frequency Cepstral Coefficients (MFCCs) for feature extraction, and various classification algorithms such as Support Vector Machines (SVMs), Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs) including Long Short-Term Memory (LSTM) networks, and Transformers. Hybrid approaches, like combining multiple classifiers and feature fusion, further enhance accuracy. This high level of performance underscores the impact of integrating sophisticated algorithms to overcome the challenges in subjective emotion detection from speech signals.