CFP last date
20 May 2024
Reseach Article

A Comparative Study on the Effects of Pooling on FER CNN Models

by Muskan Agrawal, Padmavati Shrivastasva, Rahul R. Pillai, Chirag Budhwani, Shivam Khare
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 184 - Number 37
Year of Publication: 2022
Authors: Muskan Agrawal, Padmavati Shrivastasva, Rahul R. Pillai, Chirag Budhwani, Shivam Khare
10.5120/ijca2022922463

Muskan Agrawal, Padmavati Shrivastasva, Rahul R. Pillai, Chirag Budhwani, Shivam Khare . A Comparative Study on the Effects of Pooling on FER CNN Models. International Journal of Computer Applications. 184, 37 ( Nov 2022), 7-14. DOI=10.5120/ijca2022922463

@article{ 10.5120/ijca2022922463,
author = { Muskan Agrawal, Padmavati Shrivastasva, Rahul R. Pillai, Chirag Budhwani, Shivam Khare },
title = { A Comparative Study on the Effects of Pooling on FER CNN Models },
journal = { International Journal of Computer Applications },
issue_date = { Nov 2022 },
volume = { 184 },
number = { 37 },
month = { Nov },
year = { 2022 },
issn = { 0975-8887 },
pages = { 7-14 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume184/number37/32554-2022922463/ },
doi = { 10.5120/ijca2022922463 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T01:23:21.842586+05:30
%A Muskan Agrawal
%A Padmavati Shrivastasva
%A Rahul R. Pillai
%A Chirag Budhwani
%A Shivam Khare
%T A Comparative Study on the Effects of Pooling on FER CNN Models
%J International Journal of Computer Applications
%@ 0975-8887
%V 184
%N 37
%P 7-14
%D 2022
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Emotion recognition has attracted much attention in Artificial Intelligence in order to make machines understand emotional sentiments, with many industries trying hard to incorporate emotion recognition technologies into their products. The easiest way to detect a person’s emotion is recognizing their facial expressions. In this work, the researchers tend to use FER as a problem and use the Deep Convolutional Neural Network (DCNN), which extracts the features automatically and therefore surpasses and outperforms the limitations of traditional machine learning. This work provides a comparative study of various existing pre-trained model architectures. MobileNetV2, MobileNetV3_Small, NASNetMobile, ResNet50, ResNet50V2, ResNet152V2, DenseNet169 and DenseNet201 with modification in their pooling layer to achieve high accuracy and have the potential for implementation in embedded systems. In this project, various deep learning pre-trained models were trained, tested, and compared on a modified subset of the FER 2013 Dataset for Face Emotion Recognition under all the conditions of pooling, i.e., None, Min, Avg, and Max. FER 2013 being one of the most challenging dataset, and due to limited run-time cost available, the MobileNetV2 model gave the highest testing accuracy of 83.64% with a training accuracy of 97.87% on average pooling. The models were compared on the following evaluation metrics: Accuracy, Loss, Precision, Recall and F1-score. For a practical approach, they integrate the model into a mobile application so that models can be run on devices in real time.

References
  1. Nandhini Abirami, R., Durai Raj Vincent, P. M., Srinivasan, K., Tariq, U., & Chang, C. Y. (2021). Deep CNN and Deep GAN in Computational Visual Perception-Driven Image Analysis. Complexity, 2021.
  2. De Silva, L. C., Miyasato, T., & Nakatsu, R. (1997, September). Facial emotion recognition using multi-modal information. In Proceedings of ICICS, 1997 International Conference on Information, Communications and Signal Processing. Theme: Trends in Information Systems Engineering and Wireless Multimedia Communications (Cat. (Vol. 1, pp. 397-401). IEEE.
  3. kay, F. T. (2020). Facial expression recognition with deep learning. arXiv preprint arXiv:2004.11823.
  4. Hung, J. C., Lin, K. C., & Lai, N. X. (2019). Recognizing learning emotion based on convolutional neural networks and transfer learning. Applied Soft Computing, 84, 105724
  5. Rescigno, M., Spezialetti, M., & Rossi, S. (2020). Personalized models for facial emotion recognition through transfer learning. Multimedia Tools and Applications, 79(47), 35811-35828.
  6. Hu, L., & Ge, Q. (2020, June). Automatic facial expression recognition based on MobileNetV2 in Real-time. In Journal of Physics: Conference Series (Vol. 1549, No. 2, p. 022136). IOP Publishing.
  7. Singh, A., Srivastav, A. P., Choudhary, P., & Raj, S. (2021, April). Facial emotion recognition using convolutional neural networks. In 2021 2nd International Conference on Intelligent Engineering and Management (ICIEM) (pp. 486-490). IEEE.
  8. Benamara, N. K., Val-Calvo, M., Álvarez-Sánchez, J. R., Díaz-Morcillo, A., Vicente, J. M. F., Fernández-Jover, E., & Stambouli, T. B. (2019, June). Real-time emotional recognition for sociable robotics based on deep neural networks ensemble. In International Work-Conference on the Interplay Between Natural and Artificial Computation (pp. 171-180). Springer, Cham.
  9. Li, B., & Lima, D. (2021). Facial expression recognition via ResNet-50. International Journal of Cognitive Computing in Engineering, 2, 57-64.
  10. Agrawal, A., & Mittal, N. (2020). Using CNN for facial expression recognition: a study of the effects of kernel size and number of filters on accuracy. The Visual Computer, 36(2), 405-412.
  11. Pham, L., Vu, T. H., & Tran, T. A. (2021, January). Facial Expression Recognition Using Residual Masking Network. In 2020 25th International Conference on Pattern Recognition (ICPR) (pp. 4513-4519). IEEE.
  12. Howard, A., Sandler, M., Chu, G., Chen, L. C., Chen, B., Tan, M., ... & Adam, H. (2019). Searching for mobilenetv3. In Proceedings of the IEEE/CVF International Conference on Computer Vision (pp. 1314-1324).
  13. Zoph, B., Vasudevan, V., Shlens, J., & Le, Q. V. (2018). Learning transferable architectures for scalable image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 8697-8710).
  14. Huang, G., Liu, Z., Van Der Maaten, L., & Weinberger, K. Q. (2017). Densely connected convolutional networks. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 4700-4708).
  15. He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 770-778).
  16. Mbuthia, J. W. M. FaceNet: Facial Expression Recognition based on Deep Convolutional Neural Networks.
  17. Gondkar, A., Gandhi, R., & Jadhav, N. (2021, October). Facial Emotion Recognition using Transfer Learning: A Comparative Study. In 2021 2nd Global Conference for Advancement in Technology (GCAT) (pp. 1-6). IEEE.
  18. Porușniuc, G. C., Leon, F., Timofte, R., & Miron, C. (2019, November). Convolutional neural networks architectures for facial expression recognition. In 2019 E-Health and Bioengineering Conference (EHB) (pp. 1-6). IEEE.
  19. Yang, L., Zhang, H., Li, D., Xiao, F., & Yang, S. (2021, September). Facial Expression Recognition Based on Transfer Learning and SVM. In Journal of Physics: Conference Series (Vol. 2025, No. 1, p. 012015). IOP Publishing.
  20. Akhand, M. A. H., Roy, S., Siddique, N., Kamal, M. A. S., & Shimamura, T. (2021). Facial Emotion Recognition Using Transfer Learning in the Deep CNN. Electronics, 10(9), 1036.
  21. BELAL, F. (2020). BENCHMARKING OF CONVOLUTIONAL NEURAL NETWORKS FOR FACIAL EXPRESSIONS RECOGNITION. Journal of Theoretical and Applied Information Technology, 98(18).
  22. Sun, J., Slang, S., Elboth, T., Greiner, T. L., McDonald, S., & Gelius, L. J. (2020). Attenuation of marine seismic interference noise employing a customized U‐Net. Geophysical Prospecting, 68(3), 845-871.
  23. Goodfellow, I. J., Erhan, D., Carrier, P. L., Courville, A., Mirza, M., Hamner, B., ... & Bengio, Y. (2013, November). Challenges in representation learning: A report on three machine learning contests. In International conference on neural information processing (pp. 117-124). Springer, Berlin, Heidelber.
Index Terms

Computer Science
Information Sciences

Keywords

Transfer Learning Pooling Face Emotion Recognition