Call for Paper - January 2024 Edition
IJCA solicits original research papers for the January 2024 Edition. Last date of manuscript submission is December 20, 2023. Read More

Recognition of Similar appearing Gujarati Characters using Fuzzy-KNN Algorithm

International Journal of Computer Applications
© 2012 by IJCA Journal
Volume 55 - Number 6
Year of Publication: 2012
Amit H. Choksi
Shital P. Thakkar

Amit H Choksi and Shital P Thakkar. Article: Recognition of Similar appearing Gujarati Characters using Fuzzy-KNN Algorithm. International Journal of Computer Applications 55(6):12-17, October 2012. Full text available. BibTeX

	author = {Amit H. Choksi and Shital P. Thakkar},
	title = {Article: Recognition of Similar appearing Gujarati Characters using Fuzzy-KNN Algorithm},
	journal = {International Journal of Computer Applications},
	year = {2012},
	volume = {55},
	number = {6},
	pages = {12-17},
	month = {October},
	note = {Full text available}


This paper describes the Optical Character Recognition of similar appearing characters of Gujarati language. Gujarati language is a type of Indian language. Recognition accuracy of Gujarati Script is affected by characters very similar in shape. Here, Fuzzy KNN classifier in pair with two different features Geometric and Wavelet features are used to handle this problem. Fuzzy KNN not only label the class of pattern to be identified, it also decides strength of that pattern for that class. This makes use of Fuzzy KNN for imprecise class boundary. The test data for similar appearing characters are collected from various sources like scanned pages of text books of Gujarati language, newspapers etc. Train data set is prepared by typing Gujarati characters in different font types and size and then scanned.


  • R. O. Duda and P. E. Hart, Pattern Classification and scene Analysis. New York : Wiley,1973.
  • T. M. Cover and P. E. Hart," Nearest Neighbour pattern classification," IEEE Trans. Inform. Theory, vol. 17,no. 1, pp. 15-28,1978.
  • L. A. Zadeh, " Fuzzy Sets," Inf. Control, vol. 8, pp. 338-353,1965.
  • R C Gonzalez and R E Woods. "Digital Image Processing". Publication Addison-Wesley, 1993.
  • http://en. wikipedia. org/wiki/KNN algorithm.
  • Dinesh Dilip, "A Feature Extraction Technique Based on Character Geometry for Character Recognition", Department of Electronics and Communication Engineering, Amrita School of Engineering, Kollam, Kerala.
  • A. Yajnik, S. Rama Mohan, "Identification of Gujarati Characters Using Wavelets and Neural Network", Proc. Of 10th IASTED International Conference on Artificial Intelligence and Soft Computing, Acta press , 2006, pp. 150-155.
  • Atul Negi, Chakravarthy Bhagvati, and B. Krishna. "An OCR System for Telugu". Proc. of 6th ICDAR, IEEE Computer Society, 2001, pp. 1110-1114.
  • Jignesh Dholakia, Atul Negi, S. Rama Mohan, "Zone Identification in the Printed Gujarati Text", Proc. of 8th ICDAR, IEEE Computer Society, 2005, pp. 272-276.
  • Sameer Antani, Lalitha Agnihotri, "Gujarati Character Recognition", Proc. 5th ICDAR, IEEE Computer Society, 1999, pp. 418-422.
  • S. Rama mohan, A. Yajnik, "Gujarati Numeral Recognition Using Wavelets and Neural Network", Proc. Of 2ndIICAI, Pune, 2005, pp. 397-406.
  • Lie Huang, Xiao Huang, "Multiresolution Recognition Of Offline Handwritten Chinese Characters With Wavelet Transform", Proc. 6th ICDAR, IEEE Computer Society, 2001, pp. 631-634.
  • Atul Negi, Jignesh Dholakia, A. Yajnik, "Wavelet Feature Based Confusion Character Sets for Gujarati Script" International Conference on Computational Intelligence and Multimedia Applications, 2007.
  • "Design and Implementation of Optical Character Recognition System to Recognize Gujarati Script using Template Matching"' by Prof S K Shah, A Sharma.
  • J. M. Keller, M. R. Gray, and J. A. Givens, Jr. , "A Fuzzy K-Nearest Neighbor Algorithm", IEEE Transactions on Systems, Man, and Cybernetics, Vol. 15, No. 4, pp. 580-585.
  • O D Trier, A K Jain and T Taxt. 'Feature Extraction Methods for Character Recognition – A Survey'. Pattern Recognition, vol 29, no 4, 1996,pp 641-662.
  • D M Gavrila, D Benze. 'Multi Feature Hierarchical Template Matching using Distance Transforms'. Proceedings of ICDAR, 2001
  • A. Hashizume, P. S. Yeh, A. Rosenfeld, "A method of detecting the orientation of aligned components", Pattern Recognition Letters, 1996, pp. 125-132.
  • B. V. Dasarathy. Nearest neighbor (NN) norms, NN pattern classification techniques. 1991.
  • U Pal, B B Choudhuri: Indian Script Character Recognition: A Survey of Pattern Recognition, Vol. 37,pp. 1887-1899, 2004.
  • N. Sharma, U. Pal, and F. Kimura, "Recognition of Handwritten Kannada Numerals", Proc, of IEEE-ICIT 2006.
  • W. K. Pratt. Digital Image Processing. Wiley Interscience, 1991.
  • S. Tsujimoto and H. Asada. Major component of a complete text reading system. In L. O'Gorman and R. Kasturi, editors, Document Image Analysis, pages 298–314, 1995.
  • H. S. Baird. Document Image Defect Models. In L. O'Gorman and R. Kasturi, editors, Document Image Analysis, pages 315–325, 1995.
  • Arun K. Pujari, C. Dhananjay Naidu, M. Sreenivasa Rao, B. C. Jingara, "An Adaptive Character Recognizer for Telugu Scripts using Multiresolution Analysis and Associative Memory", Image Vision Computing 22(14), 2004 , pp. 1221-1227.