CFP last date
22 April 2024
Reseach Article

A Novel Approach for Word Segmentation in Correlation based OCR System

by Sonam Jain, Harwinder Singh Sohal
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 99 - Number 18
Year of Publication: 2014
Authors: Sonam Jain, Harwinder Singh Sohal
10.5120/17471-8325

Sonam Jain, Harwinder Singh Sohal . A Novel Approach for Word Segmentation in Correlation based OCR System. International Journal of Computer Applications. 99, 18 ( August 2014), 12-20. DOI=10.5120/17471-8325

@article{ 10.5120/17471-8325,
author = { Sonam Jain, Harwinder Singh Sohal },
title = { A Novel Approach for Word Segmentation in Correlation based OCR System },
journal = { International Journal of Computer Applications },
issue_date = { August 2014 },
volume = { 99 },
number = { 18 },
month = { August },
year = { 2014 },
issn = { 0975-8887 },
pages = { 12-20 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume99/number18/17471-8325/ },
doi = { 10.5120/17471-8325 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:28:31.197918+05:30
%A Sonam Jain
%A Harwinder Singh Sohal
%T A Novel Approach for Word Segmentation in Correlation based OCR System
%J International Journal of Computer Applications
%@ 0975-8887
%V 99
%N 18
%P 12-20
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

This paper introduces a novel approach for word segmentation in OCR system. Segmentation is one of the substantial sub-processes of the OCR system. The meaning of the word can be changed if segmented word is not correct. An approach of segmentation is formulated in which textual area of image is crimped as one large window . Then large window is divided into small windows of different lines and words are segmented out of each line as sub windows to each small window. Then characters are segmented from sub-windows for recognition. The proposed word segmentation technique works efficiently for variable word spaces.

References
  1. Casey, R. G. and Lecolinet, E. , "A Survey of Methods and Strategies in Character Segmentation", IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 18, no. 8, pp. 690-706, 1996.
  2. Issam Bazzi, Richard Schwartz and John Makhoul "An Omnifont Open-Vocabulary OCR Systemfor English and Arabic" IEEE Transactions On Pattern Analysis And Machine Intelligence, Vol. 21, No. 6, 1999.
  3. Rejean Plamondon, Sargur N. Srihari "On-Line and Off-Line Handwriting Recognition:A Comprehensive Survey" 1EEE Transactions On Pattern Analysis And Machine Intelligence. Vol. 22 , No. 1,2000.
  4. A Cheung, M. Bennamoun, N. W. Bergmann "An Arabic Optical Character Recognition system using recognition-based segmentation" Pattern Recognition Society. Published by Elsevier Science Ltd. , pp 215-233, 2001.
  5. N. Arica and Fatos T. Yarman-Vural "An Overview of Character Recognition Focused on Off-Line Handwriting" IEEE Transactions On Systems, Man And Cybernetics—Part C: Applications And Reviews, Vol. 31, No. 2,2001.
  6. Rajiv Kumar, Amardeep Singh "Detection and Segmentation of Lines and Words in Gurmukhi Handwritten Text" IEEE, 2nd International Advance Computing Conference, pp 353-356, 2010.
  7. Nikos Nikolaou,Michael Makridis,Basilis Gatos, Nikolaos Stamatopoulos , NikosPapamarkos "Segmetat-ion of historical machine-printed documents using Adaptive Run Length Smoothing and skeleton segmentation paths"ELSEVIER, Image and Vision Computing 28 2010) 590–604.
  8. Pranob K Charles, V. Harish, M. Swathi, CH. Deepthi "A Review on the Various Techniques used for Optical Character Recognition" International Journal of Engineering Research and Applications (IJERA) , Vol. 2, Issue 1 , pp. 659-662 659, 2012.
  9. Safwa Taha, Yusra Babiker and Mohamed Abbas "Optical Character Recognition of Arabic Printed Text" IEEE Student Conference On Research And Development,pp 235-240,2012
  10. Gaurav Singla, Dr. Parmod Kumar "Extract the Punjabi Word from Machine Printed Document Images" Int. Journal of Engineering Research and Application Vol. 3, Issue 5, pp. 343-348,2013.
  11. Ravina Mithe, Supriya Indalkar, Nilam Divekar "Optical Character Recognition" International Journal of RecentTechnology and Engineering (IJRTE), Vol-2, Issue-1, 2013.
  12. Nafiz Arica and Fatos T. Yarman-Vura (2001) "An Overview of Character Recognition Focused on Off-Line Handwriting" IEEE Transactions On Systems, Man And Cybernetics—Part C: Applications And Reviews, Vol. 31, No. 2.
  13. G S Lehal and Chandan Singh (2002), "A post-processor for Gurmukhi OCR" Sadhana, vol. 27, pp. 99–111.
Index Terms

Computer Science
Information Sciences

Keywords

Word Segmentation OCR Recognition.