CFP last date
20 February 2025
Reseach Article

Real-Time Emotion Recognition using Deep Learning: A Comprehensive Approach

by Khushi Jhunjhunwala, Devansh Banka, Haider Kachwalla, Dhirendra Mishra
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 186 - Number 56
Year of Publication: 2024
Authors: Khushi Jhunjhunwala, Devansh Banka, Haider Kachwalla, Dhirendra Mishra
10.5120/ijca2024924296

Khushi Jhunjhunwala, Devansh Banka, Haider Kachwalla, Dhirendra Mishra . Real-Time Emotion Recognition using Deep Learning: A Comprehensive Approach. International Journal of Computer Applications. 186, 56 ( Dec 2024), 45-52. DOI=10.5120/ijca2024924296

@article{ 10.5120/ijca2024924296,
author = { Khushi Jhunjhunwala, Devansh Banka, Haider Kachwalla, Dhirendra Mishra },
title = { Real-Time Emotion Recognition using Deep Learning: A Comprehensive Approach },
journal = { International Journal of Computer Applications },
issue_date = { Dec 2024 },
volume = { 186 },
number = { 56 },
month = { Dec },
year = { 2024 },
issn = { 0975-8887 },
pages = { 45-52 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume186/number56/real-time-emotion-recognition-using-deep-learning-a-comprehensive-approach/ },
doi = { 10.5120/ijca2024924296 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-12-27T02:46:01.275274+05:30
%A Khushi Jhunjhunwala
%A Devansh Banka
%A Haider Kachwalla
%A Dhirendra Mishra
%T Real-Time Emotion Recognition using Deep Learning: A Comprehensive Approach
%J International Journal of Computer Applications
%@ 0975-8887
%V 186
%N 56
%P 45-52
%D 2024
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The work integrates a real-time facial emotion recognition system using the RAF-DB dataset, which contains 15,350 images annotated for seven basic emotions. Different aspects of dataset preparation and data augmentation, architectures to be used for the model, and performance evaluation on emotion classification are discussed within this study. A CNN is used in the emotion detection that can be drawn from live video streams in very high efficiency. Even though it is varied and contains images of various age ranges and demographic differences, the dataset nonetheless presents challenges like class imbalance and issues regarding the privacy and bias of data. Significant improvement in a model's generalization ability can be seen after using data augmentation techniques like rescaling, shearing, zooming, and horizontal flipping. Best accuracy was obtained at epoch 25, which was 78.32% for validation with three hidden layers and a filter of 3x3. This model therefore maintains an equilibrium between accuracy and real-time performance. The performance of the proposed model was tested across various demographic groups, and also with variations in the accuracy presented in most cases of the detection of fear and disgust. The research has compared its proposed model with existing models such as VGGNet and ResNet to give prominence to the computational efficiency, making it fit for real-time applications in situations where a dataset would be relatively small and limits the amount of computational resources available. The findings are important to put emotions recognition into practice in the real world, bringing to limelight the need for balancing model performance with ethical applicability in the field of AI.

References
  1. D. Duncan, G. Shine, and C. English, "Facial emotion recognition in real time," Stanford University, 2020.. Available: duncand@stanford.edu
  2. W. Mellouka and W. Handouzia, "Facial emotion recognition using deep learning: Review and insights," in *Proc. 2nd Int. Workshop Future Internet Everything (FIoE)*, Leuven, Belgium, 2020, pp. 1-8.
  3. S. Kaur and N. Kulkarni, "Emotion recognition – A review," CSE, MIT SOE, MIT ADT University, Pune, India, 2020.
  4. J. A. Ballesteros, G. M. Ramírez, F. Moreira, A. Solano, and C. A. Pelaez, "Facial emotion recognition through artificial intelligence," University of Cauca, Colombia, 2020.
  5. S. M. Saleem Abdullah, S. Y. Ameen, M. A. M.sadeeq, and S. R. M. Zeebaree, "Multimodal emotion recognition using deep learning," 2020.
  6. S. J. Oh, J. Y. Lee, and D. K. Kim, "The design of CNN architectures for optimal six basic emotion classification using multiple physiological signals," 2020.
  7. P. Vanneste, J. Oramas, T. Verelst, T. Tuytelaars, A. Raes, F. Depaepe, and W. Van den Noortgate, "Computer vision and human behaviour, emotion and cognition detection: A use case on student engagement," 2020.
  8. H. I. James, J. J. Anto Arnold, J. M. Masilla Ruban, M. Tamilarasan, and R. Saranya, "Emotion-based music recommendation system," Valliammai Engineering College, Chennai, Tamil Nadu, 2020.
  9. E. M. Onyema, P. K. Shukla, S. Dalal, M. N. Mathur, M. Zakariah, and B. Tiwari, "Enhancement of patient facial recognition through deep learning algorithm: ConvNet," 2020.
  10. A. Khopkar and A. A. Saxena, "Facial expression recognition using CNN with Keras," DES’s Navinchandra Mehta Institute of Technology and Development & Research Scholar Pacific University, Udaipur, India, 2020.
  11. A. Agrawal and N. Mittal, "Using CNN for facial expression recognition: A study of the effects of kernel size and number of filters on accuracy," 2020.
  12. I.-K. Choi, H.-E. Ahn, and J. Yoo, "Facial expression classification using deep convolutional neural network," 2020.
  13. Mollahosseini, A., Hasani, B., & Mahoor, M. H. (2017). AffectNet: A database for facial expression, valence, and arousal computing in the wild. IEEE Transactions on Affective Computing, 10(1), 18-31.
  14. Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep learning. MIT press.
  15. Viola, P., & Jones, M. (2001). Rapid object detection using a boosted cascade of simple features. Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 511-518.
  16. He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 770-778).
  17. Simonyan, K., & Zisserman, A. (2014). Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556.
Index Terms

Computer Science
Information Sciences

Keywords

Real-time Emotion Detection CNN Deep Learning Imbalanced Dataset Facial Expression Recognition Accuracy