CFP last date
20 May 2024
Call for Paper
June Edition
IJCA solicits high quality original research papers for the upcoming June edition of the journal. The last date of research paper submission is 20 May 2024

Submit your paper
Know more
Reseach Article

Analysis of BMW Model for Title Word Selection on Indic Script

by P. Vijayapal Reddy, B. Vishnu Vardhan, A. Govardhan
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 18 - Number 8
Year of Publication: 2011
Authors: P. Vijayapal Reddy, B. Vishnu Vardhan, A. Govardhan
10.5120/2304-2915

P. Vijayapal Reddy, B. Vishnu Vardhan, A. Govardhan . Analysis of BMW Model for Title Word Selection on Indic Script. International Journal of Computer Applications. 18, 8 ( March 2011), 21-25. DOI=10.5120/2304-2915

@article{ 10.5120/2304-2915,
author = { P. Vijayapal Reddy, B. Vishnu Vardhan, A. Govardhan },
title = { Analysis of BMW Model for Title Word Selection on Indic Script },
journal = { International Journal of Computer Applications },
issue_date = { March 2011 },
volume = { 18 },
number = { 8 },
month = { March },
year = { 2011 },
issn = { 0975-8887 },
pages = { 21-25 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume18/number8/2304-2915/ },
doi = { 10.5120/2304-2915 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:05:43.732147+05:30
%A P. Vijayapal Reddy
%A B. Vishnu Vardhan
%A A. Govardhan
%T Analysis of BMW Model for Title Word Selection on Indic Script
%J International Journal of Computer Applications
%@ 0975-8887
%V 18
%N 8
%P 21-25
%D 2011
%I Foundation of Computer Science (FCS), NY, USA
Abstract

A title is a short summary that represents document’s main theme. Title can help the reader to have the main idea without reading the entire document. To generate a title for a document, we have to select appropriate words as title words and put them in sequence. The process of generating title for a given document by using machine, can be done by using summarization approaches or by using Statistical approaches or by combing both. For a given document, selecting appropriate words for generating a title by using any available approach mainly depends on the characteristics of the language. In this paper ,we have examined the influence of the language characteristics in the process of title word selection by using the Naïve Bayes probabilistic approach ( called BMW Model ) on the documents which are available in the language ' Telugu '. And also we have investigated the influence of word weight for the selection of title words in BMW Model. By using F1 metric, we have evaluated the title word selection process.

References
  1. Ultra-Summarization: A Statistical Approach to Generating Highly Condensed Non-Extractive Summaries. Michael Witbrock and Vibhu Mittal, Just Research. In Proceedings of SIGIR 99, Berkeley, CA, August 199
  2. Rong Jin and Alexander G. Hauptmann. Title generation using a training corpus.In CICLing ’01: Proceedings of the Second International Conference on Computational Linguistics and Intelligent Text Processing, pages 208–215, London, UK,2001. Springer-Verlag
  3. Term-weighting appraoches in automatic text retrieval ,Salton and Buckley Information Processing & Management Vol. 24, No. 5, pp. 513-523,printed in Great Britain. 988
  4. E. Firmin & M.J. Chrzanowski (1999). An evaluation of automatic text summarization. In I. Mani and M. Maybury, editors. Advances in Automatic Text Summarization. MIT Press, Cambridge, Massachusetts, 1999
  5. C. H. Leung & W.K. Kan (1997). A statistical learning approach to automatic indexing of controlled index terms. Journal of the American Society for Information Science, 48 (1), 55-66, 1997.
  6. P.D. Turney (2000). Learning algorithms for keyphrase extraction. Information Retrieval, 2(4): 303-336, 2000
  7. I. Mani & M. Maybury (1999). Advances in Automated Text Summarization.Cambridge, MA: MIT Press, 1999
  8. K. S. Jones & P. Willett (1997). Reading in Information Retrieval. Morgan Kaufmann Publishers, 1997
  9. MUC-6 (1995), Proceeding of The Sixth Message Understanding Conference, 1995
  10. Padmaja Rani B., Vishnu Vardhan B., Kanaka Durga A., Govardhan A., Pratap Reddy L., and Vinaya Babu A. Telugu Document Classification using Baye’s Probabilistic Model Technology spectrum, Journal of JNTU, vol.2 No.1, 2008, pp.26- 30
  11. M. Banko, V. Mittal, and M. Witbrock. Headline generation based on statistical translation. In the Proceedings of Association for Computational Linguistics, 2000.
  12. V. Rjiesbergen (1979). Information Retrieval. Chapter 7. Butterworths, London, 1979.
  13. Statistical Approaches toward title generation by Rong Jin , 2003, Ph.D Thesis
Index Terms

Computer Science
Information Sciences

Keywords

BMW Model Indic Script Title Word Selection F1 measure Statistical Approach