Call for Paper - March 2023 Edition
IJCA solicits original research papers for the March 2023 Edition. Last date of manuscript submission is February 20, 2023. Read More

A Study of different Text Line Extraction Techniques for Multi-font and Multi-size Printed Kannada Documents

Print
PDF
International Journal of Computer Applications
© 2015 by IJCA Journal
Volume 119 - Number 11
Year of Publication: 2015
Authors:
R Prajna
Ramya V R
Mamatha H. R
10.5120/21113-3923

R Prajna, Ramya V R and Mamatha H.r. Article: A Study of different Text Line Extraction Techniques for Multi-font and Multi-size Printed Kannada Documents. International Journal of Computer Applications 119(11):32-38, June 2015. Full text available. BibTeX

@article{key:article,
	author = {R Prajna and Ramya V R and Mamatha H.r},
	title = {Article: A Study of different Text Line Extraction Techniques for Multi-font and Multi-size Printed Kannada Documents},
	journal = {International Journal of Computer Applications},
	year = {2015},
	volume = {119},
	number = {11},
	pages = {32-38},
	month = {June},
	note = {Full text available}
}

Abstract

Line and word segmentation is one of the important step of OCR systems. For the identification of printed characters of non-Indian languages like English, Japanese, Chinese Optical Character Recognition (OCR) systems have been effectively developed. For Indian languages, efforts are on the way for the development of efficient OCR systems, mainly for Kannada, one of the popular South Indian language . In this paper we have proposed a robust method for extraction of individual text lines for printed kannada documents based on the efficient segmentation methodologies such as morphology operations based projection profile,horizontal projection profile and bounding box.

References

  • Nallapareddy Priyanka, Srikanta Pal, Ranju Mandal "Line and Word Segmentation Approach for Printed Documents", IJCA Special Issue on Recent Trends in Image Processing and Pattern Recognition-RTIPPR,2010, pp 30-36
  • Sunanda dixit, Suresh Hosahalli Narayana, Mahesh Belur "Kannada text line extraction based on energy minimization and skew correction". IEEE International Advance Computing Conference (IACC) ,2014.
  • B. Gangamma, Srikanta Murthy K, Riddhi J. Shah, Swati D V "Text Line Extraction from Palm Script Documents Using Morphological Approach", International Conference on Computer Engineering and Applications Dubai,2012,1452-1455.
  • Vikas J Dongre , Vijay H Mankar "Devnagari document segmentation using histogram approach". International Journal of Computer Science, Engineering and Information Technology (IJCSEIT), Vol. 1, No. 3, August 2011,46-53.
  • Alireza Alaei, P. Nagabhushan, Umapada Pal "A Benchmark Kannada Handwritten Document Dataset and its Segmentation", International Conference on Document Analysis and Recognition,2011.
  • U. Pal and B. B. Chaudhuri "Script Line Separation From Indian Multi-Script Documents". In Proc. 4thICDAR,1999.
  • R. Sanjeev Kunte, R. D. Sudhaker Samuel "An OCR system for printed Kannada text using Two-stage Multi-network classification approach employing Wavelet features", International Conference on Computational Intelligence and Multimedia Applications 2007.
  • Mamatha Hosalli Ramappa and Srikantamurthy Krishnamurthy "Skew Detection, Correction and Segmentation of Handwritten Kannada Document", International Journal of Advanced Science and Technology Vol. 48, November, 2012.