Call for Paper - August 2022 Edition
IJCA solicits original research papers for the August 2022 Edition. Last date of manuscript submission is July 20, 2022. Read More

Text Extraction Techniques

IJCA Proceedings on National Seminar on Future Trends and Innovations in Computer Engineering
© 2016 by IJCA Journal
NSFTICE 2015 - Number 1
Year of Publication: 2016
Yash Gupta
Shivani Sharma
Tushina Bedwal

Yash Gupta, Shivani Sharma and Tushina Bedwal. Article: Text Extraction Techniques. IJCA Proceedings on National Seminar on Future Trends and Innovations in Computer Engineering NSFTICE 2015(1):10-12, August 2016. Full text available. BibTeX

	author = {Yash Gupta and Shivani Sharma and Tushina Bedwal},
	title = {Article: Text Extraction Techniques},
	journal = {IJCA Proceedings on National Seminar on Future Trends and Innovations in Computer Engineering},
	year = {2016},
	volume = {NSFTICE 2015},
	number = {1},
	pages = {10-12},
	month = {August},
	note = {Full text available}


As the growth of technology is emerging, It is beneficial for us to take put some innovative efforts for pulling this computer science field at a higher level. Text extraction is one of the recent growing technique to be enhanced further. Text Extraction is the process of extracting, evaluating and analyzing images. Detection, Localization, Binarization, Extraction, Enhancement, and Recognition are some of the steps to be involved in the process of text extraction. In today's challenging world this technique is a very cumbersome task to be performed because it indulges various activities like changes in fonts,size,orientation,text. There are many text extraction techniques that are based on connected component analysis, edge detection, morphological operators, wavelet transform, neural network, texture features etc. have been developed. In this paper we are providing some of the study of the techniques and comparison between various techniques such as region based technique, texture based technique and hybrid technique.


  • Anhar Risnumawan, Palaiahankote Shivakumara, Chee Seng Chan and Chew Lim Tan, "A Robust Arbitrary Text Detection System For Natural Scene Images", Expert System with Application 41(2014) 8027-8048.
  • Xu-Cheng Yin, Xuwang Yin, Kaizhu Huang, and Hong-Wei Hao, "Robust Text Detection in Natural Scene Images", IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 36, no. 5, May 2014.
  • H. K. Kim, Efficient Automatic Text Location Method and Content-Based Indexing and Structuring Of Video Database, Journal of Visual Communication and Image Representation vol. 7, no. 4 ,1996, pp. 336–344.
  • C. Y. Suen, L. Lam, D. Guillevic, N. W. Strathy, M. Cheriet, J. N. Said, and R. Fan, Bank Check Processing System, International Journal of Imaging Systems and Technology, vol. 7, No. 4 1996, pp. 392–403.
  • D. S. Kim, S. I. Chien, Automatic Car License Plate Extraction using Modified Generalized Symmetry Transform and Image Warping, Proceedings of International Symposium on Industrial Electronics, Vol. 3, 2001, pp. 2022–2027.
  • A. K. Jain, Y. Zhong, Page Segmentation using Texture Analysis, Pattern Recognition, Vol. 29, No. 5, Elsevier, 1996, pp. 743–770.
  • T. N. Dinh, J. Park and G. S. Lee, Low-Complexity Text Extraction in Korean Signboards for Mobile Applications, IEEE International Conference on Computer and Information Technology, 2008, pp. 333-337.
  • Q. Ye, Q. Huang, W. Gao, D. Zhao, Fast and Robust Text Detection in Images and Video Frames, Image and Vision Computing, Vol. 23, No. 6, Elsevier, 2005, pp. 565–576.
  • Hassanzadeh, H. Pourghassem, Fast Logo Detection Based on Morphological Features in Document Image, 2011 IEEE 7th International Colloquium on Signal Processing and its Applications, 2011, pp. 283-286.
  • Y. Song, A. Liu, L. Pang, S. Lin, Y. Zhang, S. Tang, A Novel Image Text Extraction Method Based on K-means Clustering, Seventh IEEE/ACIS International Conference on Computer and Information Science, 2008, pp. 185-190.
  • W. Fan, J. Sun, Y. Katsuyama, Y. Hotta, S. Naoi, Text Detection in Images Based on Grayscale Decomposition and Stroke Extraction, Chinese Conference on Pattern Recognition, IEEE, 2009, pp. 1-4.
  • N. Anupama, C. Rupa, E. S. Reddy, Character Segmentation for Telugu Image Document using Multiple Histogram Projections, Global Journal of Computer Science and Technology, Vol. 13, 2013, pp. 11-16.