CFP last date
22 April 2024
Reseach Article

A Novel Transcription Cataloging technique based on Combining Approach of Established Algorithms and Methodology for Deriving Unambiguous Results

Published on November 2011 by Harish V. Gorewar, Dr. M. Kumar
2nd National Conference on Information and Communication Technology
Foundation of Computer Science USA
NCICT - Number 5
November 2011
Authors: Harish V. Gorewar, Dr. M. Kumar
185d437d-fc10-4073-9682-1ee295c6f9df

Harish V. Gorewar, Dr. M. Kumar . A Novel Transcription Cataloging technique based on Combining Approach of Established Algorithms and Methodology for Deriving Unambiguous Results. 2nd National Conference on Information and Communication Technology. NCICT, 5 (November 2011), 12-15.

@article{
author = { Harish V. Gorewar, Dr. M. Kumar },
title = { A Novel Transcription Cataloging technique based on Combining Approach of Established Algorithms and Methodology for Deriving Unambiguous Results },
journal = { 2nd National Conference on Information and Communication Technology },
issue_date = { November 2011 },
volume = { NCICT },
number = { 5 },
month = { November },
year = { 2011 },
issn = 0975-8887,
pages = { 12-15 },
numpages = 4,
url = { /proceedings/ncict/number5/4215-ncict036/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Proceeding Article
%1 2nd National Conference on Information and Communication Technology
%A Harish V. Gorewar
%A Dr. M. Kumar
%T A Novel Transcription Cataloging technique based on Combining Approach of Established Algorithms and Methodology for Deriving Unambiguous Results
%J 2nd National Conference on Information and Communication Technology
%@ 0975-8887
%V NCICT
%N 5
%P 12-15
%D 2011
%I International Journal of Computer Applications
Abstract

This paper discusses a new approach for Transcription Cataloging (TC) based on combining efficient algorithms. The important aspect of automatically sorting and classifying a set of documents into any category by incorporating a predefined set is Transcription Cataloging. Automated Transcription Cataloging is gaining notability since it frees organizations from the hectic and time consuming need of manually organizing documents, which can be too expensive, or simply not feasible given the time constraints of the application or the number of documents involved. In terms of accuracy, modern Transcription Cataloging systems proves better than that of trained human professionals, which is made possible by a combination of information retrieval technology and machine learning technology in Transcription Cataloging approach. There are numerable useful applications of this approach spanning various scientific and general fields of work. This paper deals in depth the feasibility of Transcription Cataloging pertaining to various domains along with making substantial use of techniques like document indexing, text filtering and classifier learning technique. Also the approaches of standard input and tokenization are considered for a better output which shall be devoid of any complexity for Transcription Cataloging.

References
  1. D.D. Lewis, “Featurc Sclection and Featurc Extraction fur Tcxt Categorization,” Proc. Speech nnd Nniurnl LniipngP Workshop, pp.212-217, Arden House, 1992.
  2. Yang Y (1994), “Expert Network: Effective and efficient learning from human decisions in text categorization and retrieval,” Proc. of the seventeenth Int’l ACM SIGIR Conf. on Research and Devp. in information Retrieval, Dublin, pp.13-22.
  3. Salton. G., Wong. A. and Yang (1975),“A Vector Space Model for Automatic Indexing”, Communications of ACM, vol 18, No. 11, pp. 613–620.
  4. Cohen W, Singer Y. (1996), “Context-sensitive learning methods for text categorization,” Proc. of the 19th Int’l ACM SIGIR Conf. on Research and Development in Information Retrieval, Zurich, pp. 307~315.
  5. Yu Wang; Zheng-Ou Wang;( Aug. 2007) , "A Fast KNN Algorithm for Text Categorization," Machine Learning and Cybernetics, 2007 International Conference on , vol.6, no., pp.3436-3441, 19-22.
  6. T. Joachims (1998), “Text categorization with Support Vector Machines: Learning with many relevant features”, European Conference on Machine Learning (ECML ’98) Berlin, pp: 137–142.
  7. Sun Jian, Wang Wei, Zhong Yi-xin (2001), “Automatic Text Categorization Based on K Nearest Neighbor,”Journal of Beijing University of Posts and Telecommunications, vol. 24(1), pp.42-46.
  8. D. Koller and M. Sahami (1997), “Hierarchically Classifying Documents Using Very Few Words”, International Conference on Machine Learning (ICML’97) Nashville, vol- 6, pp.170–178.
  9. Thorsten Joachims (1997), “A probabilistic analysis of the Rocchio algorithm with TFIDF for text categorization”, ICML-97.
  10. Apte C. Damerau F, Weiss S (1994), “Automated learning of decision rules for text categorization,” ACM Transactions on information System, 12(3), pp. 233~251.
  11. L. Breiman, J. Friedman, R. Olshen, and C. Stone (1984), “Classification and Regression Trees”, Wadsworth, Monterrey, Ca.
  12. J. He and A.H. Tan and C.L. Tan (2003), “Machine Learning Methods for Document Categorization”, Journal of Applied Intelligence, vol. 18, pp. 613- 6 17.
  13. Kamal Nigam,Andrew McCallumzy, “A Comparison of Event Models for Naive Bayes Text Classification”, School of Computer Science Carnegie Mellon University, Pittsburgh, PA 15213.
Index Terms

Computer Science
Information Sciences

Keywords

Transcription cataloging Fast-KNN & Naïve Bayesian algorithms pair-relation deductive inference