Performance Evaluation of Resnet Model on Sign Language Recognition

Millicent Agangiba; Ezekiel M. Martey; William A. Agangiba; Obed Appiah

Call for Paper

June Edition

IJCA solicits high quality original research papers for the upcoming June edition of the journal. The last date of research paper submission is 20 May 2024

Submit your paper

Know more

The week's pick

Enhancing Privacy Preservation: Multi-Attribute Protection with P-Sensitive K-Anonymity

Twinkle Patel Kiran Amin

Random Articles

Feasible Study on Pattern Matching Algorithms based on Intrusion Detection Systems

June

2014

Modeling and Economic Analysis of Energy Generation from Biomass Energy

December

2014

M-Pass: Web Authentication Protocol Resistant to Malware and Phishing

April

2014

Performance Analysis on the Effect of Doping Concentration in Copper Indium Gallium Selenide (CIGS) Thin-film Solar Cell

March

2015

Reseach Article

Performance Evaluation of Resnet Model on Sign Language Recognition

by Millicent Agangiba, Ezekiel M. Martey, William A. Agangiba, Obed Appiah

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 184 - Number 43

Year of Publication: 2023

Authors: Millicent Agangiba, Ezekiel M. Martey, William A. Agangiba, Obed Appiah

10.5120/ijca2023922534

Millicent Agangiba, Ezekiel M. Martey, William A. Agangiba, Obed Appiah . Performance Evaluation of Resnet Model on Sign Language Recognition. International Journal of Computer Applications. 184, 43 ( Jan 2023), 22-27. DOI=10.5120/ijca2023922534

@article{ 10.5120/ijca2023922534,

author = { Millicent Agangiba, Ezekiel M. Martey, William A. Agangiba, Obed Appiah },

title = { Performance Evaluation of Resnet Model on Sign Language Recognition },

journal = { International Journal of Computer Applications },

issue_date = { Jan 2023 },

volume = { 184 },

number = { 43 },

month = { Jan },

year = { 2023 },

issn = { 0975-8887 },

pages = { 22-27 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume184/number43/32597-2023922534/ },

doi = { 10.5120/ijca2023922534 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-07T01:23:53.185900+05:30

%A Millicent Agangiba

%A Ezekiel M. Martey

%A William A. Agangiba

%A Obed Appiah

%T Performance Evaluation of Resnet Model on Sign Language Recognition

%J International Journal of Computer Applications

%@ 0975-8887

%V 184

%N 43

%P 22-27

%D 2023

%I Foundation of Computer Science (FCS), NY, USA

Abstract

Communication is an important tool for sharing one’s ideas and thoughts and as such its role in our everyday lives cannot be over emphasised. Sign language is a form of communication used by the deaf and those hard-of-hearing. However, a challenge arises when deaf people have to communicate their ideas to those in the mainstream population. An automatic translator can be an effective way to address this problem. In this study, the performance of the ResNet model and its variants are evaluated on two different datasets. The first dataset contains images of American Sign language (ASL) data and the second dataset consists of images of Indian Sign language (ISL). The is a one-handed sign language, while ISL is mainly a two-handed sign language with complex shapes. ResNet variants such as Resnet18, ResNet34, ResNet50, ResNet101 and ResNet152 have been tested on these standard datasets. We conducted experiments by using deep neural networks to make recommendations and predictions in sign language. Experimental results using a standard dataset demonstrate that the model with 152 layers achieves the highest accuracy.

References

Bickenbach, J.E., Cieza, A. and Sabariego, C., (2016), “Disability and Public Health” Int. J. Environ. Res. Public Health, Vol. 13, pp. 123-132.
Groce, N.E., 2018. Global disability: an emerging issue. The Lancet Global Health, 6(7), pp.e724-e725.
Agangiba, M., “Accessibility of E-government Services for Persons with Disabilities in Developing Countries- The Case of Ghana ”, Unpublished Doctoral Thesis, Department of Information Systems, University of Cape Town, South Africa, 290pp.
Gedam, S. and Shrawankar, U. (2017), "Challenges and opportunities in fingerspelling recognition in the air", In International Conference on Innovative Mechanisms for Industry Applications, Bengaluru, India, pp. 60 – 65.
Nair, A.V. and Bindu, V., (2013), “A review on Indian sign language recognition”, International journal of computer applications, Vol. 73, No. 22, pp. 33-38.
Mahesh, M., Jayaprakash, A. and Geetha, M., 2017, September. Sign language translator for mobile platforms. In 2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI) (pp. 1176-1181). IEEE.
Brown, L. D., Hua, H., and Gao, C. 2003. A widget framework for augmented interaction in SCAPE.
Bousbai, K. and Merah, M., (2019), “A Comparative Study of Hand Gestures Recognition Based on MobileNetV2 and ConvNet Models”, In International Conference on Image and Signal Processing and their Applications (ISPA), Mostaganem, Algeria, pp. 1-6.
Kusters, A., De Meulder, M., and O’Brien, D. (2017), Innovations in deaf studies: The role of deaf scholars, Oxford University Press, 416 pp.
Bhujbal, V.P., and Warhade, K.K., (2018), “Hand sign recognition-based communication system for speech disable people", ICICCS 2018, In Proceedings of the 2nd International Conference on Intelligent Computing and Control Systems, Madurai, India, pp. 348 – 352.
Singleton, J. L., Remillard, E. T., Mitzner, T. L., and Rogers, W. A. (2019), “Everyday technology use among older deaf adults” Disability and Rehabilitation: Assistive Technology, Vol. 14, No. 4, pp. 325-332.
Dhiman, R., Joshi, G. and Krishna, C.R., (2021), “A deep learning approach for Indian sign language gestures classification with different backgrounds” In Journal of Physics: Conference Series, Vol. 1950, No. 1, pp. 1-15
Mannan, A., Abbasi, A., Javed, A. R., Ahsan, A., Gadekallu, T. R., & Xin, Q. (2022). Hypertuned deep convolutional neural network for sign language recognition. Computational Intelligence and Neuroscience.
Lum, K.Y., Goh, Y.H. and Lee, Y.B., 2020. American Sign Language recognition based on MobileNetV2. Adv. Sci. Technol. Eng. Syst., 5(6), pp.481-488.
Agrawal, M., Ainapure, R., Agrawal, S., Bhosale, S., & Desai, S. (2020, October). Models for hand gesture recognition using deep learning. In 2020 IEEE 5th International Conference on Computing Communication and Automation (ICCCA) (pp. 589-594). IEEE.
He, K., Zhang, X., Ren, S. and Sun, J. (2016), “Deep residual learning for image recognition", In Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, pp. 770 – 778.
Rathi, P., Kuwar Gupta, R., Agarwal, S. and Shukla, A., 2020, February. Sign language recognition using resnet50 deep neural network architecture. In 5th International Conference on Next Generation Computing Technologies (NGCT-2019).
Saleh, Y. & Issa, G. (2020). “Arabic Sign Language Recognition through Deep Neural Networks Fine-Tuning”. International Association of Online Engineering. https://www.learntechlib.org/p/217934/. Accessed: 21 June 2022
Alleema, N., & Chandrasekaran, S. (2022). Recognition of American Sign Language Using Modified Deep Residual CNN with Modified Canny Edge Segmentation.
Huang, G., Sun, Y., Liu, Z., Sedra, D., Weinberger, K.Q. (2016), “Deep networks with stochastic depth”, In Conference on Computer Vision, Amsterdam, The Netherlands, pp. 646–661.
Veit, A.; Wilber, M.J.; Belongie, S. (2016), “Residual networks behave like ensembles of relatively shallow networks”, In Advances in Neural Information Processing Systems; NIPS, Montreal, QC, Canada, pp. 550–558.
Wu, Z., Shen, C., Van Den Hengel, A. (2019), “Wider or deeper: Revisiting the resnet model for visual recognition”, Pattern Recognition, Vol. 90, 119-133.
Glorot, X.; Bordes, A.; Bengio, Y, (2011), “Deep sparse rectifier neural networks”, In International Conference on Artificial Intelligence and Statistics, Lauderdale, FL, USA, pp. 315–323.
Ioffe, S., Szegedy, C. (2015), “Batch normalization: Accelerating deep network training by reducing internal covariate shift”, www.arxiv.org. Accessed: September 15, 2021.
Xie, S., Girshick, R., Dollár, P., Tu, Z., He, K. (2017), “Aggregated residual transformations for deep neural networks”, In Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, pp. 1492–1500.
Akash, K., (2016), “Image data set for alphabets in the American Sign Language”, www.kaggle.com. Accessed: August 4, 2021.
Sonawane V. (2018), Indian Sign Language Dataset. www.kaggle.com. Accessed: August 20, 2021.
Khun, M. and Johnson, K. (2013), Applied Predictive Modeling, Springer, Basel, 600pp.
Krizhevsky, A., Sutskever, I. and Hinton, G. E. (2012), “ImageNet Classification with Deep Convolutional Neural Networks”, In Advances in Neural Information Processing Systems, Lake Tahoe, Nevada, USA, pp. 1097 – 1105.
Rumelhart, D. E., Hinton, G. E. and Williams, R. J. (1986), “Learning representations by back-propagating errors", Nature, Vol. 323, pp. 533 – 536.
Loshchilov, I. and Hutter, F. (2019), “Decoupled Weight Decay Regularization”, In International Conference on Learning Representations (ICLR), New Orleans, Louisiana, USA, pp. 1-19.
Kingma, D. P. and Ba, J. (2015), “Adam: A Method for Stochastic Optimization" In International Conference on Learning Representations (ICLR), San Diego, CA, USA, pp. 1-15.
Wilson, A. C., Roelofs, R., Stern, M., Srebro, N. and Recht, B. (2017), “The Marginal Value of Adaptive Gradient Methods in Machine Learning”, In Conference on Neural Information Processing System, Long Beach, CA, USA, pp. 1-14.
Smith, L. N. (2018), A disciplined approach to neural network hyper-parameters: Part 1 - learning rate, batch size, momentum, and weight decay, US Naval, 21pp.

Index Terms

Computer Science

Information Sciences

Keywords

Deep Neural Network ResNet American Sign Language Indian Sign Language Image Recognition