CFP last date
20 May 2024
Reseach Article

A Convolution Neural Network for Optical Character Recognition and Subsequent Machine Translation

by Goutam Sarker, Swagata Ghosh
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 182 - Number 30
Year of Publication: 2018
Authors: Goutam Sarker, Swagata Ghosh
10.5120/ijca2018918203

Goutam Sarker, Swagata Ghosh . A Convolution Neural Network for Optical Character Recognition and Subsequent Machine Translation. International Journal of Computer Applications. 182, 30 ( Dec 2018), 23-27. DOI=10.5120/ijca2018918203

@article{ 10.5120/ijca2018918203,
author = { Goutam Sarker, Swagata Ghosh },
title = { A Convolution Neural Network for Optical Character Recognition and Subsequent Machine Translation },
journal = { International Journal of Computer Applications },
issue_date = { Dec 2018 },
volume = { 182 },
number = { 30 },
month = { Dec },
year = { 2018 },
issn = { 0975-8887 },
pages = { 23-27 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume182/number30/30219-2018918203/ },
doi = { 10.5120/ijca2018918203 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T01:12:55.060537+05:30
%A Goutam Sarker
%A Swagata Ghosh
%T A Convolution Neural Network for Optical Character Recognition and Subsequent Machine Translation
%J International Journal of Computer Applications
%@ 0975-8887
%V 182
%N 30
%P 23-27
%D 2018
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Optical character recognition has been a longstanding challenging research topic in the broad area of machine learning and pattern recognition. In the present paper, we investigate the problem of textual image recognition and translation, which is among the most daunting challenges in image-based sequence recognition. A convolutional neural network (CNN) architecture, which integrates optical character recognition and natural language translation into a unified framework, is proposed. The accuracy of both OCR output and subsequent translation is moderate and satisfactory. The proposed system for OCR and subsequent translation is an effective, efficient and most promising one.

References
  1. J. Wang and X. Hu, “Gated Recurrent Convolution Neural Network for OCR”, 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA, 2017.
  2. B. Shi, X. Bai and C. Yao, “An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition”, Wuhan, China, 2015.
  3. Otsu N., “A Threshold Selection Method from Gray-Level Histograms”, IEEE Transactions on Systems, Man, and Cybernetics ( Volume: 9 , Issue: 1 , Jan. 1979 ), 1979.
  4. Andy Thomas, “Keras LSTM tutorial -- How to easily build a powerful deep learning language model”, Adventures in Machine Learning, www.adventuresinmachinelearning.com
  5. Ian Goodfellow, “Deep Learning”, MIT Press
  6. Sarker G., “A Treatise on Artificial Intelligence”, ISBN : 978-93-5321-793-8 ; 1, 325, 2018
  7. Sarker G., “Some Studies on Convolutional Neural Networks”, International Journal of Computer Applications – Vol. 182, DOI 10.5120/ijca201891, October 2018.
  8. Sarker, G., Dhua, S., Besra, M. (2015), “An Optimal Clustering for Fuzzy Categorization of Cursive Handwritten Text with Weight Learning in Textual Attributes”, 2015.
  9. Sarker, G., Dhua, S., Besra, M., “A Learning Based Handwritten Text Categorization”, 2015.
  10. Sarker, G., Dhua, S., Besra, M., “A Programming based Handwritten Text Identification”, 2015.
  11. Sarker, G., “A Weight LearningTechnique for Cursive Handwritten Text Categorization with Fuzzy Confusion Matrix”, 2nd International Conference on Control, Instrumentation, Energy & Communication (CIEC), 2016.
  12. Sarker G., “A New Technique for Extraction based Text Summarization”, 31st Indian Engineering Congress, Kolkata, 2016.
  13. Sarker G., Roy K., “A modified RBF Network with Optimal Clustering for Face Identification and Localization”, 2013.
  14. Sarker G., Roy K., “An RBF Network with Optimal Clustering for Face Identification”, 2013.
  15. Sarker G., Kundu S., “A modified Radial Basis Function Network for Fingerprint Identification and Localization”, 2013.
  16. Bhakta D., Sarker G., “A Radial Basis Function Network for Face Identification and Subsequent Localization”, 2013.
  17. Roy K., Sarker G., “A Location Invariant Face Identification and Localization with Modified RBF Network”, 2013.
  18. Bhakta D., Sarker G., “A Rotation and Location Invariant Face Identification and Localization with or without Occlusion using modified RBFN”, 2013.
  19. Sarker G., Sharma S., “A Heuristic Based RBFN for Location and Rotation Invariant Clear and Occluded Face Identification”, 2014.
  20. Kundu S., Sarker G., “A Modified RBFN Based on Heuristic Based Clustering for Location Invariant Fingerprint Rcognition and Localization with or without Occlusion”, 2014.
  21. Kundu S., Sarker G., “A Modified BP Network using Malsburg Learning for Rotation and Localization Invariant Fingerprint Recognition and Localization with or without Occlusion”, 2014..
  22. Bhakta D., Sarker G., “A Boosting Based Multiple Classifier System with Modified RBFN for Facial Expression Identification”, 2014.
  23. Bhakta D., Sarker G., “A Method of Learning Based Boosting in Multiple Classifier for Clear and Occluded Face Identification”, 2015.
  24. Bhakta D., Sarker G., “A New Learning Based Boosting in Multiple Classifier System for Colour Facial Expression Identification”, 2015.
  25. Kundu S., Sarker G., “A Programming Based Boosting in Super Classifier for Fingerprint Recognition”, 2015.
  26. Sarker, G.,Bhakta, D., “An Unsupervised OCA based RBFN for Clear and Occluded Face Identification, Intelligent Computing and Applications”, Advances in Intelligent Systems and Computing, 2015.
  27. Kundu, S., Sarker, G., “A Modified SOM Based RBFN for Rotation Invariant Clear and Occluded Fingerprint Recognition”, Intelligent Computing and Applications, Advances in Intelligent Systems and Computing, 2015.
  28. Kundu, S., Sarker, G., “An Efficient Integrator Based on Template Matching Technique for Person Authentication using Different Biometrics”, Indian Journal of Science and Technology, 2016.
  29. Kundu, S., Sarker, G., “A Multi-level Integrator with Programming Based Boosting for Person Authentication using Different Biometrics”, 2016.
  30. Kundu, S., Sarker, G., “A Person Authentication System using a Biometric Based Efficient Multi-Level Integrator”, International Journal of Control Theory and Applications (IJCTA, 2017.
  31. Sarker, G. Bhakta, D. A Mega Super Classifier with Fuzzy Categorization in Face and Facial Expression Identification - Int. Journal of Engineering Trends and Technology, 2017.
  32. Kundu, S. Sarker G., “An Efficient Multi Classifier Based on Fast RBFN for Biometric Identification” (2014), Advanced Computing, Networking and Informatics - Vol. 1.
Index Terms

Computer Science
Information Sciences

Keywords

Recurrent CNN LSTM CNN based LSTM Performance Evaluation Accuracy