Call for Paper - September 2022 Edition
IJCA solicits original research papers for the September 2022 Edition. Last date of manuscript submission is August 22, 2022. Read More

Handwritten Document Retrieval System for Tamil Language

International Journal of Computer Applications
© 2011 by IJCA Journal
Number 1 - Article 1
Year of Publication: 2011
AN. Sigappi
S. Palanivel
V. Ramalingam

AN. Sigappi, S Palanivel and V Ramalingam. Article:Handwritten Document Retrieval System for Tamil Language. International Journal of Computer Applications 31(4):42-47, October 2011. Full text available. BibTeX

	author = {AN. Sigappi and S. Palanivel and V. Ramalingam},
	title = {Article:Handwritten Document Retrieval System for Tamil Language},
	journal = {International Journal of Computer Applications},
	year = {2011},
	volume = {31},
	number = {4},
	pages = {42-47},
	month = {October},
	note = {Full text available}


The paper attempts to create a handwritten document retrieval system suitable for Tamil language, with a view to record traditional literature content for future reference. It projects a search mechanism to access the query word images using a statistical model based methodology. The scheme revolves around a well defined procedure which results in word models from where the search word can be recognised and the relevant documents retrieved. The approach involves the use of hidden Markov models (HMM) to characterize the features of the dynamically varying strokes of handwritten characters. The strategy is investigated for a sample document set over a commonly used literature. The results reveal that the system yields a reasonable performance with considerable accuracy. The highlight of this procedure is that it can effectively segment differently written words from text lines in a document and imbibes in it a flexibility to cover a wide range of tilts in the strokes that are attached to the different words.


  • Konstantinos Zagoris, Kavallieratou Ergina, Nikos Papamarkos. 2010. A document image retrieval system. Engineering Applications of Artificial Intelligence. Vol. 23, No.6, 872-879 .
  • Huaigu Cao, Anurag Bhardwaj, Venu Govindaraju. 2009. A probabilistic method for keyword retrieval in handwritten document images. Pattern Recognition. Vol. 42, No.12, 3374-3382.
  • G. Louloudis, B. Gatos, I. Pratikakis, C. Halatsis. 2009. Text line and word segmentation of handwritten documents. Pattern Recognition. Vol. 42, No.12, 3169-3183.
  • Kaustubh Bhattacharyya, Kandarpa Kumar Sarma. 2009. ANN-based innovative segmentation method for handwritten test in Assamese. Intl. Journal of Computer Science Issues. Vol. 5, 9-16.
  • Million Meshesha, C. V. Jawahar. 2008. Matching word images for content based retrieval from printed document images. International Journal on Document Analysis and Recognition. Vol. 11, No.1, 29-38.
  • Jose A. Rodriguez, Florent Perronnin, Gemma Sanchez, Josep Llados. 2008. Unsupervised writer style adaptation for hand written word spotting. 19th International Conference on Pattern Recognition. 1-4.
  • Rafael C. Gonzalez, Richard E. Woods. 2008. Digital Image Processing. Prentice Hall.
  • Richard O. Duda, Peter E. Hart, David G. Stork. 2007. Pattern Classification. Wiley-India.
  • Gregory R. Ball, Sargur N. Srihari, Harish Srinivasan. 2006. Segmentation-based and Segmentation-free methods for spotting handwritten Arabic words. 10th International Workshop on Frontiers in Handwriting Recognition. 53-58.
  • Sargur Srihari, Chen Huang, Harish Srinivasan. 2005. A search engine for handwritten documents. In Proc. of Document Recognition and Retrieval. 66-75.
  • Toni M. Rath, R. Manmatha. 2003. Features for word spotting in historical manuscripts. ICDAR. 218-222.
  • O Due Trier, Anil K. Jain, Torfin Taxt. 1996. Feature extraction methods for character recognition: A Survey. Pattern Recognition. Vol. 29, No.4, 641-662.
  • Lawrence R. Rabiner. 1989. A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE. Vol. 77, No.2, 257-285.