CFP last date
22 April 2024
Reseach Article

Text Extraction Techniques

Published on August 2016 by Yash Gupta, Shivani Sharma, Tushina Bedwal
National Seminar on Future Trends and Innovations in Computer Engineering
Foundation of Computer Science USA
NSFTICE2015 - Number 1
August 2016
Authors: Yash Gupta, Shivani Sharma, Tushina Bedwal

Yash Gupta, Shivani Sharma, Tushina Bedwal . Text Extraction Techniques. National Seminar on Future Trends and Innovations in Computer Engineering. NSFTICE2015, 1 (August 2016), 10-12.

author = { Yash Gupta, Shivani Sharma, Tushina Bedwal },
title = { Text Extraction Techniques },
journal = { National Seminar on Future Trends and Innovations in Computer Engineering },
issue_date = { August 2016 },
volume = { NSFTICE2015 },
number = { 1 },
month = { August },
year = { 2016 },
issn = 0975-8887,
pages = { 10-12 },
numpages = 3,
url = { /proceedings/nsftice2015/number1/25608-1523/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
%0 Proceeding Article
%1 National Seminar on Future Trends and Innovations in Computer Engineering
%A Yash Gupta
%A Shivani Sharma
%A Tushina Bedwal
%T Text Extraction Techniques
%J National Seminar on Future Trends and Innovations in Computer Engineering
%@ 0975-8887
%N 1
%P 10-12
%D 2016
%I International Journal of Computer Applications

As the growth of technology is emerging, It is beneficial for us to take put some innovative efforts for pulling this computer science field at a higher level. Text extraction is one of the recent growing technique to be enhanced further. Text Extraction is the process of extracting, evaluating and analyzing images. Detection, Localization, Binarization, Extraction, Enhancement, and Recognition are some of the steps to be involved in the process of text extraction. In today's challenging world this technique is a very cumbersome task to be performed because it indulges various activities like changes in fonts,size,orientation,text. There are many text extraction techniques that are based on connected component analysis, edge detection, morphological operators, wavelet transform, neural network, texture features etc. have been developed. In this paper we are providing some of the study of the techniques and comparison between various techniques such as region based technique, texture based technique and hybrid technique.

  1. Anhar Risnumawan, Palaiahankote Shivakumara, Chee Seng Chan and Chew Lim Tan, "A Robust Arbitrary Text Detection System For Natural Scene Images", Expert System with Application 41(2014) 8027-8048.
  2. Xu-Cheng Yin, Xuwang Yin, Kaizhu Huang, and Hong-Wei Hao, "Robust Text Detection in Natural Scene Images", IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 36, no. 5, May 2014.
  3. H. K. Kim, Efficient Automatic Text Location Method and Content-Based Indexing and Structuring Of Video Database, Journal of Visual Communication and Image Representation vol. 7, no. 4 ,1996, pp. 336–344.
  4. C. Y. Suen, L. Lam, D. Guillevic, N. W. Strathy, M. Cheriet, J. N. Said, and R. Fan, Bank Check Processing System, International Journal of Imaging Systems and Technology, vol. 7, No. 4 1996, pp. 392–403.
  5. D. S. Kim, S. I. Chien, Automatic Car License Plate Extraction using Modified Generalized Symmetry Transform and Image Warping, Proceedings of International Symposium on Industrial Electronics, Vol. 3, 2001, pp. 2022–2027.
  6. A. K. Jain, Y. Zhong, Page Segmentation using Texture Analysis, Pattern Recognition, Vol. 29, No. 5, Elsevier, 1996, pp. 743–770.
  7. T. N. Dinh, J. Park and G. S. Lee, Low-Complexity Text Extraction in Korean Signboards for Mobile Applications, IEEE International Conference on Computer and Information Technology, 2008, pp. 333-337.
  8. Q. Ye, Q. Huang, W. Gao, D. Zhao, Fast and Robust Text Detection in Images and Video Frames, Image and Vision Computing, Vol. 23, No. 6, Elsevier, 2005, pp. 565–576.
  9. Hassanzadeh, H. Pourghassem, Fast Logo Detection Based on Morphological Features in Document Image, 2011 IEEE 7th International Colloquium on Signal Processing and its Applications, 2011, pp. 283-286.
  10. Y. Song, A. Liu, L. Pang, S. Lin, Y. Zhang, S. Tang, A Novel Image Text Extraction Method Based on K-means Clustering, Seventh IEEE/ACIS International Conference on Computer and Information Science, 2008, pp. 185-190.
  11. W. Fan, J. Sun, Y. Katsuyama, Y. Hotta, S. Naoi, Text Detection in Images Based on Grayscale Decomposition and Stroke Extraction, Chinese Conference on Pattern Recognition, IEEE, 2009, pp. 1-4.
  12. N. Anupama, C. Rupa, E. S. Reddy, Character Segmentation for Telugu Image Document using Multiple Histogram Projections, Global Journal of Computer Science and Technology, Vol. 13, 2013, pp. 11-16.
Index Terms

Computer Science
Information Sciences


Text Extraction Detection Binarization. Edge Connected Component.