CFP last date
20 May 2024
Reseach Article

Detecting Handwritten Text from Forms using Deep Learning

by Shailendra Singh Kathait, Chirag Sehra
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 175 - Number 34
Year of Publication: 2020
Authors: Shailendra Singh Kathait, Chirag Sehra
10.5120/ijca2020919760

Shailendra Singh Kathait, Chirag Sehra . Detecting Handwritten Text from Forms using Deep Learning. International Journal of Computer Applications. 175, 34 ( Dec 2020), 7-14. DOI=10.5120/ijca2020919760

@article{ 10.5120/ijca2020919760,
author = { Shailendra Singh Kathait, Chirag Sehra },
title = { Detecting Handwritten Text from Forms using Deep Learning },
journal = { International Journal of Computer Applications },
issue_date = { Dec 2020 },
volume = { 175 },
number = { 34 },
month = { Dec },
year = { 2020 },
issn = { 0975-8887 },
pages = { 7-14 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume175/number34/31667-2020919760/ },
doi = { 10.5120/ijca2020919760 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T00:40:13.524466+05:30
%A Shailendra Singh Kathait
%A Chirag Sehra
%T Detecting Handwritten Text from Forms using Deep Learning
%J International Journal of Computer Applications
%@ 0975-8887
%V 175
%N 34
%P 7-14
%D 2020
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Digital Image Processing is an expeditiously emerging field possessing a large number of applications in science and engineering aspects. One of the most used applications in almost every sector is Optical Character Recognition (OCR). OCR is the electronic conversion of handwritten text into digital format which makes information processing from printed papers to data records easy, thus helping to electronically edit, search and store printed texts into machines. This text can then be used in variety of applications like machine translation, speech-to-text, pattern recognition etc. OCR as a piece of software applies pre-processing to improve the recognition in images. This pre-processing step includes skewness correction, despeckling, layout analysis and line and word detection. OCR saves tons of manual effort by recognizing handwritten text with word level detection resulting in an accuracy of 81% to 90%. With form processing, one can capture information in digital format that can save time, labor and money. This helps in achieving a better accuracy in detection. Such systems range from minor application forms to large scale survey forms. Deep Learning algorithms dealing with computer vision related tasks can be used to build a recognition engine.

References
  1. Shailendra Singh Kathait and Shubhrita Tiwari. Application of Image Processing and Convolution Networks in Intelligent Character Recognition for Digitized Forms Processing. Valiance Solutions Pvt. Ltd., Noida, Uttar Pradesh, 2018
  2. Dipti Deodhare, NNR Ranga Suri and R. Amit. Preprocessing and Image Enhancement Algorithms for a Form-based Intelligent Character Recognition System. Technomathematics Research Foundation, 2005
  3. Maureen Caudill. Neural Network Primer: Part I. AI Expert, Feb. 1989
  4. Ian Goodfellow and Yoshua Bengio and Aaron Courville. Deep Learning MIT Press, http://www.deeplearningbook.org, 2016
  5. Alex Krizhevsky, Geoffrey E. Hinton and Ilya Sutskever. ImageNet Classification with Deep Convolutional Neural Networks. University of Toronto.
  6. Christian Szegedy, Vincent Vanhoucke, Sergey Ioffe, Jonathon Shlens, Zbigniew Wojna Rethinking the Inception Architecture for Computer Vision https://arxiv.org/abs/1512.00567
  7. Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, Andrew Rabinovich Going Deeper with Convolutions
  8. Karen Simonyan, Andrew Zisserman Very Deep Convolutional Networks for Large-Scale Image Recognition https://arxiv.org/abs/1409.1556
  9. ImageNet: 2016 Stanford Vision Lab, Stanford University, Princeton University http://www.image-net.org/
  10. Tensorflow https://www.tensorflow.org/
  11. David G. Lowe Distinctive Image Features from Scale-Invariant Keypoints https://www.cs.ubc.ca/~lowe/papers/ijcv04.pdf
  12. Jie Hu, Li Shen, Samuel Albanie, Gang Sun, Enhua Wu Squeeze-and Excitation Networks https://arxiv.org/abs/1709.01507
  13. Lisa Torrey and Jude Shavlik Transfer Learning, Handbook of Research on Machine Learning Applications, 2009. http://ftp.cs.wisc.edu/machine-learning/shavlik-group/torrey.handbook09.pdf
  14. Django https://www.djangoproject.com/
  15. Amazon Web Services https://aws.amazon.com/ec2/
  16. Artifical Neural Network https://en.wikipedia.org/wiki/Artificial neural network.
  17. Fjoder Van Veen The Neural Network Zoo https://www.asimovinstitute.org/neural-network-zoo/
  18. Google Cloud https://cloud.google.com/tpu/docs/inception-v3-advanced
Index Terms

Computer Science
Information Sciences

Keywords

Image Processing Intelligent Character Recognition Optical Character Recognition Optical Mark Recognition Form Handling.