CFP last date
20 May 2024
Reseach Article

Method for Line Segmentation in Handwritten Documents with Touching and Broken Parts in Devanagari Script

by Shafali Goyal, Ashok Kumar Bathla
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 102 - Number 12
Year of Publication: 2014
Authors: Shafali Goyal, Ashok Kumar Bathla
10.5120/17868-8794

Shafali Goyal, Ashok Kumar Bathla . Method for Line Segmentation in Handwritten Documents with Touching and Broken Parts in Devanagari Script. International Journal of Computer Applications. 102, 12 ( September 2014), 22-27. DOI=10.5120/17868-8794

@article{ 10.5120/17868-8794,
author = { Shafali Goyal, Ashok Kumar Bathla },
title = { Method for Line Segmentation in Handwritten Documents with Touching and Broken Parts in Devanagari Script },
journal = { International Journal of Computer Applications },
issue_date = { September 2014 },
volume = { 102 },
number = { 12 },
month = { September },
year = { 2014 },
issn = { 0975-8887 },
pages = { 22-27 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume102/number12/17868-8794/ },
doi = { 10.5120/17868-8794 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:32:57.169902+05:30
%A Shafali Goyal
%A Ashok Kumar Bathla
%T Method for Line Segmentation in Handwritten Documents with Touching and Broken Parts in Devanagari Script
%J International Journal of Computer Applications
%@ 0975-8887
%V 102
%N 12
%P 22-27
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Now days, a vast research is going in Optical Character Recognition (OCR) of handwritten Documents in Indian scripts. A lot of handwritten data is existed in Devanagari script which is still to be recognized. Segmentation is the key step of OCR process. Segmentation is the process of extracting the valuable segments from the text document which are used in the process of recognition of characters. Line segmentation is the process of segmenting the text document into lines. Afterwards, word segmentation and character segmentation is carried out. This paper only deals with the Line segmentation of handwritten documents in Hindi. Devanagari script is the basic script to write Hindi, Marathi, Sanskrit and Nepali languages. In this paper the brief introduction of various existing techniques for segmentation of handwritten text is discussed. Also, develops an algorithm for segmentation of skewed lines, touching lines present in the text document and broken parts in upper modifiers or space present between the upper modifiers. This algorithm is implemented on large database collected from various writers. The proposed algorithm integrated the Projection based method, gap detection between text lines and neighbor pixel analysis method.

References
  1. U. Pal and B. Chaudhuri, "Indian Script Character Recognition: a survey", Computer Vision and Pattern Recognition Unit, Vol. 37, pp. 1887-1899, 2004.
  2. Vijay Kumar and Pankaj K. Sengar, "Segmentation of Printed Text in Devanagari Script and Gurumukhi Script" International Journal of Computer Applications, Vol. 3, No. 8, June 2010.
  3. M. K. Jindal, R. K. Sharma and G. S. Lehal "Segmentation of Horizontally Overlapping Lines in Printed Indian Scripts" International Journal of Computational Intelligence Research, Vol. 3, No. 4, pp. 277–286,2007.
  4. Naresh Kumar Garg, Lakhwinder Kaur and M. k Jindal "The Hazards in Segmentation of Handwritten Hindi Text" International Journal of Computer Applications, Vol. 29, No. 2, Sept. 2011.
  5. Ashwin S Ramteke and Milind E Rane, "Offline Handwritten Devanagari Script Segmentation" International Journal of Scientific & Technology Research Volume 1, Issue 4, MAY 2012.
  6. Bidyut B. Chaudhuri and Sumedha Bera "Handwritten Text Line Identification In Indian Scripts" 10th International Conference on Document Analysis and Recognition, IEEE, 2009
  7. Naresh Kumar Garg, Lakhwinder Kaur and M. k Jindal "Segmentation of Handwritten Hindi Text" International Journal of Computer Applications, Vol. 1, No. 4, 2010.
  8. Naresh Kumar Garg, Lakhwinder Kaur and M. k Jindal "A New Method for Line Segmentation of Handwritten Hindi Text", IEEE, 2010.
  9. Miss Vandana M. Ladwani and Mrs. Latesh Malik, "Novel Approach to Segmentation of Handwritten Devnagari Word", Third International Conference on Emerging Trends in Engineering and Technology, IEEE, 2010.
  10. N. Tripathy and U. pal, "Handwritten Segmentation of Unconstrained Oriya text", International Workshop on Fronteirs in Handwriting Recognition, pp. 306-311, 2004.
  11. Satadal Saha, Subhadip Basu, Mita Nasipuri and Dipak Kr. Basu, "A Hough Transform based Technique for Text Segmentation", Journal of Computing, Vol. 2, ISSUE 2, Feb 2010.
  12. Partha Pritam Roy, Umapada Pal and Josep Llados, "Morphology Based Handwritten Line Segmentation Using Foreground and Background Information", ICFHR, 2008.
  13. Vasant Manohar, Shiv N. Vitaladevuni, Huaigu Cao, Rohit Prasad and Prem Natarajan, "Graph Clustering-based Ensemble Method for Handwritten Text Line Segmentation" International Conference on Document Analysis and Recognition, IEEE 2011.
  14. Dharam Veer Sharma and Gurpreet Singh Lehal "An Iterative Algorithm for Segmentation of Isolated Handwritten Words in Gurmukhi Script" The 18th International Conference on Pattern Recognition, IEEE, 2006.
  15. Rajiv Kumar and Amardeep Singh "Detection and Segmentation of Lines and Words in Gurmukhi Handwritten Text", IEEE, 2010.
  16. Namisha Modi and Khushneet Jindal, "Text Line Detection and Segmentation in Handwritten Gurumukhi Scripts", IJARCSSE, vol. 3, issue-5, May 2013.
  17. U. Pal and Sagarika Datta, "Segmentation of Bangla Unconstrained Handwritten Text", Proceedings of the Seventh International Conference on Document Analysis and Recognition, 2003.
  18. Saiprakash Palakollu, Renu Dhir and Rajneesh Rani, "Handwritten Hindi Text Segmentation Techniques for Lines and Characters" Proceedings of the World Congress on Engineering and Computer Science, San Francisco, USA, Vol. I Oct. 2012.
  19. Vikas J Dongre and Vijay H Mankar, "Devanagari Document Segmentation Using Histogram Approach" International Journal of Computer Science, Engineering and Information Technology, Vol. 1, No. 3, August 2011.
  20. Mamatha H R and Srikantamurthy K, "Morphological Operations and Projection Profiles based Segmentation of Handwritten Kannada Document" International Journal of Applied Information Systems, Vol. 4, No. 5, Oct. 2012.
Index Terms

Computer Science
Information Sciences

Keywords

Modifiers Devanagari OCR Line Segmentation Word segmentation Character Segmentation and Recognition