CFP last date
20 May 2024
Call for Paper
June Edition
IJCA solicits high quality original research papers for the upcoming June edition of the journal. The last date of research paper submission is 20 May 2024

Submit your paper
Know more
Reseach Article

A Multiple Feature based Novel Approach for Identification of Printed Indian Scripts at Word Level

by Gopal Prasad, Atul Kumar Singh, Pawan Kumar
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 85 - Number 15
Year of Publication: 2014
Authors: Gopal Prasad, Atul Kumar Singh, Pawan Kumar
10.5120/14916-3462

Gopal Prasad, Atul Kumar Singh, Pawan Kumar . A Multiple Feature based Novel Approach for Identification of Printed Indian Scripts at Word Level. International Journal of Computer Applications. 85, 15 ( January 2014), 8-13. DOI=10.5120/14916-3462

@article{ 10.5120/14916-3462,
author = { Gopal Prasad, Atul Kumar Singh, Pawan Kumar },
title = { A Multiple Feature based Novel Approach for Identification of Printed Indian Scripts at Word Level },
journal = { International Journal of Computer Applications },
issue_date = { January 2014 },
volume = { 85 },
number = { 15 },
month = { January },
year = { 2014 },
issn = { 0975-8887 },
pages = { 8-13 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume85/number15/14916-3462/ },
doi = { 10.5120/14916-3462 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:02:31.809062+05:30
%A Gopal Prasad
%A Atul Kumar Singh
%A Pawan Kumar
%T A Multiple Feature based Novel Approach for Identification of Printed Indian Scripts at Word Level
%J International Journal of Computer Applications
%@ 0975-8887
%V 85
%N 15
%P 8-13
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

In a country like India where different scripts are in use, automatic identification of printed script facilitates many important applications such as automatic transcription of multilingual documents and for the selection of script specific OCR in a multilingual environment. In this paper a novel method to identify the script type of the collection of documents printed in seven Indian languages at word level is proposed. These languages are Bangla, Hindi, English, Malayalam, Oriya, Tamil and Kannada. The recognition is based upon multiple features extracted using Discrete Cosine Transform (DCT) and Discrete Wavelet Transform (DWT). Script classification performance is analyzed using the K-nearest neighbor classifier by comparing the majority of voting's between the outputs of DCT and DWT based methods. The proposed scheme utilizes the strength of both the DCT and DWT based features. The results of experimentation found the overall accuracy to be 98. 11 % which show the superiority of the proposed multiple features based scheme over several existing schemes of script identification.

References
  1. J. S. Bridle, "Probabilistic Interpretation of Feedforward Classification Network Outputs, with Relationships to Statistical Pattern Recognition," Neurocomputing—Algorithms, Architectures and Applications, F. Fogelman-Soulie and J. Herault, eds. , NATO ASI Series F68, Berlin: Springer-Verlag, pp. 227-236, 1989.
  2. W. -K. Chen, Linear Networks and Systems. Belmont, Calif. : Wadsworth, pp. 123-135, 1993.
  3. H. Poor, "A Hypertext History of Multiuser Dimensions," MUD History, http://www. ccs. neu. edu/home/pb/mud-history. html. 1986.
  4. K. Elissa, "An Overview of Decision Theory," unpublished.
  5. R. Nicole, "The Last Word on Decision Theory," J. Computer Vision, submitted for publication.
  6. C. J. Kaufman, Rocky Mountain Research Laboratories, Boulder, Colo. , personal communication, 1992.
  7. D. S. Coming and O. G. Staadt, "Velocity-Aligned Discrete Oriented Polytopes for Dynamic Collision Detection," IEEE Trans. Visualization and Computer Graphics, vol. 14, no. 1, pp. 1-12, Jan/Feb 2008, doi:10. 1109/TVCG. 2007. 70405.
  8. S. P. Bingulac, "On the Compatibility of Adaptive Controllers," Proc. Fourth Ann. Allerton Conf. Circuits and Systems Theory, pp. 8-16, 1994.
  9. H. Goto, Y. Hasegawa, and M. Tanaka, "Efficient Scheduling Focusing on the Duality of MPL Representation," Proc. IEEE Symp. Computational Intelligence in Scheduling (SCIS '07), pp. 57-64, Apr. 2007, doi:10. 1109/SCIS. 2007. 367670.
  10. J. Williams, "Narrow-Band Analyzer," PhD dissertation, Dept. of Electrical Eng. , Harvard Univ. , Cambridge, Mass. , 1993.
  11. E. E. Reber, R. L. Michell, and C. J. Carter, "Oxygen Absorption in the Earth's Atmosphere," Technical Report TR-0200 (420-46)-3, Aerospace Corp. , Los Angeles, Calif. , Nov. 1988.
  12. L. Hubert and P. Arabie, "Comparing Partitions," J. Classification, vol. 2, no. 4, pp. 193-218, Apr. 1985.
  13. R. J. Vidmar, "On the Use of Atmospheric Plasmas as Electromagnetic Reflectors," IEEE Trans. Plasma Science, vol. 21, no. 3, pp. 876-880, available at http://www. halcyon. com/pub/journals/21ps03-vidmar, Aug. 1992.
  14. J. M. P. Martinez, R. B. Llavori, M. J. A. Cabo, and T. B. Pedersen, "Integrating Data Warehouses with Web Data: A Survey," IEEE Trans. Knowledge and Data Eng. , preprint, 21 Dec. 2007, doi:10. 1109/TKDE. 2007. 190746.
Index Terms

Computer Science
Information Sciences

Keywords

Multilingual Document Images Multi-scripts Images Discrete Cosine Transform (DCT) Discrete Wavelet Transform (DWT) Standard Deviation K-NN classifier.