Emotional_Speech_Recognition

This repository will contain the implementation for the paper "Emotional Speech Recognition with Pre-trained Deep Visual Models" very soon.

Paper Reference

If you're interested in understanding the details of the approach, please refer to the paper:

📄 "Emotional Speech Recognition with Pre-trained Deep Visual Models" 🔗 Read on arXiv 📑 Download PDF

Abstract

This work explores a novel approach to emotional speech recognition (ESR) by leveraging pre-trained deep visual models. Instead of traditional speech processing methods, the technique involves:

Converting acoustic features into image representations.
Utilizing pre-trained deep learning models (such as VGG-16) designed for computer vision to classify emotions.
Demonstrating state-of-the-art results on the Berlin EMO-DB dataset.

Stay tuned for the code implementation! 🚀

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
LICENSE		LICENSE
README.md		README.md
Train_Model_VGG_Transfer_Learning_Best_85.ipynb		Train_Model_VGG_Transfer_Learning_Best_85.ipynb
Train_Model_VGG_Transfer_Learning_Stable_89.ipynb		Train_Model_VGG_Transfer_Learning_Stable_89.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Emotional_Speech_Recognition

Paper Reference

Abstract

About

Releases

Packages

Languages

License

mehdi-mirzapour/Emotional_Speech_Recognition

Folders and files

Latest commit

History

Repository files navigation

Emotional_Speech_Recognition

Paper Reference

Abstract

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages