CFP last date
20 May 2024
Reseach Article

Preferred Computational Approaches for the Recognition of different Classes of Printed Malayalam Characters using Hierarchical SVM Classifiers

by Bindu Philip, R. D. Sudhaker Samuel
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 1 - Number 16
Year of Publication: 2010
Authors: Bindu Philip, R. D. Sudhaker Samuel
10.5120/350-530

Bindu Philip, R. D. Sudhaker Samuel . Preferred Computational Approaches for the Recognition of different Classes of Printed Malayalam Characters using Hierarchical SVM Classifiers. International Journal of Computer Applications. 1, 16 ( February 2010), 5-10. DOI=10.5120/350-530

@article{ 10.5120/350-530,
author = { Bindu Philip, R. D. Sudhaker Samuel },
title = { Preferred Computational Approaches for the Recognition of different Classes of Printed Malayalam Characters using Hierarchical SVM Classifiers },
journal = { International Journal of Computer Applications },
issue_date = { February 2010 },
volume = { 1 },
number = { 16 },
month = { February },
year = { 2010 },
issn = { 0975-8887 },
pages = { 5-10 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume1/number16/350-530/ },
doi = { 10.5120/350-530 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T19:42:36.855145+05:30
%A Bindu Philip
%A R. D. Sudhaker Samuel
%T Preferred Computational Approaches for the Recognition of different Classes of Printed Malayalam Characters using Hierarchical SVM Classifiers
%J International Journal of Computer Applications
%@ 0975-8887
%V 1
%N 16
%P 5-10
%D 2010
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Characterization of matrices for efficient classification has several options. There are various alternatives depending on the structure of the matrix. Different features can be adapted in different situations. Image recognition and in particular character recognition is an excellent example where large number of image matrices need to be stored and retrieved often at high speed, at the same time performing computational tasks, resulting in requirements of huge memory and computation time. Near 100% character segmentation accuracy is achieved based on a novel segmentation technique. Here feature extraction is based on the distinctive structural features of machine-printed text lines in these scripts. The final recognition is achieved through Support Vector Machine (SVM) classifiers. The proposed algorithms have been tested on a variety of printed Malayalam documents. Recognition rates between 97.72% and 98.78% have resulted.

References
  1. Golub G.H. and Loan V.C.F. Matrix Computations. The John Hopkins Press, 1989.
  2. Eldén L Numerical linear algebra in data mining. Acta Numerica 15, 327-384, Cambridge University Press, 5/2006
  3. Intel Open Source Computer Vision Library http://www. intel.com/technology/computing/opencv/index.htm
  4. Aparna K G, Ramakrishnan A G, "A complete Tamil Optical Character Recognition System",5th International Workshop on Document Analysis Systems DAS 2002, Princeton, NJ, USA,2002, pp. 53-57.
  5. Ashwin T V, P S Sastry "A font and size independent OCR system for printed Kannada documents using support vector machines", Sadhana, Vol. 27, Part 1, February 2002, pp. 35-58.
  6. Burges C. J. C, .A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery, 1998, pp 121-167.
  7. Jain A.K and Taxt T, ''Feature extraction methods for character recognition-A Survey'', Pattern Recognition, vol. 29, no. 4, pp. 641-662., 1996,
  8. Pal.U, Chaudhuri B. B, ''Indian Script Character recognition: A survey'', Pattern Recognition, vol. 37, pp. 1887-1899, 2004.
  9. Negi Atul, Chakravarthy Bhagvati and Krishna B 2001 An OCR system for Telugu. Proc. Of 6th Int. Conf. on Document Analysis and Recognition IEEE Comp. Soc. Press, USA,. pp. 1110-1114.
  10. Seethalakshmi R., Sreeranjani T.R., Balachandar T., Singh A, Singh M, Ratan R, Kumar S, 2005 "Optical Character Recognition for printed Tamil text using Unicode" Journal of Zhejiang University SCI 6A(11) pp.1297-1305
  11. Shivsubramani K, Loganathan R, Srinivasan CJ, Ajay V, Soman KP "Multiclass Hierarchical SVM for Recognition of Printed Tamil Characters" twentieth international conference on artificial intelligence, Hyderabad, India, January 2007, pp 93-97.
  12. Jawahar C. V., Pavan K M. N. S. S. K, and S. S. Kiran R, "Recognition of Indian Language Characters using Support Vectors Machines," Technical Report TR-CVIT-22, International Institute of Information Technology, Hyderabad, 2002.
  13. John R, Raju G and Guru D. S, "1D Wavelet Transform of Projection Profiles for Isolated Handwritten Malayalam Character Recognition", Proc. of International Conference on Computational Intelligence and Multimedia Applications 2007, Sivakashi, IEEE computer society Press, 2007, pp 481-485,.
  14. Lajish V. L, Suneesh T.K.K. and Narayanan N.K., "Recognition of Isolated Handwritten Character Images using Kolmogrov-Smirnov Statistical Classifier and k-nearest Neighbour classifier", Proc. Of the International Conference on Cognition and Recognition ICCR-05, Mandya, Karnataka, December, 2005
  15. Janardhanan P. S. Issues in the development of OCR systems for Dravidian languages - proceedings of Akshara 94, BPB Publications, New Delhi, India 1994.
  16. Malayalam standardization report May 2001.
  17. Joachims T 1999 SVMlight. http://www-ai.informatik.uni-dortmund.de/forschung/verfahren/svmlight/svmlight.eng.html
Index Terms

Computer Science
Information Sciences

Keywords

Pattern recognition character classification segmentation Optical character recognition Singular value decomposition marginal frequency Support vector machine classifier Malayalam OCR