CFP last date
20 May 2024
Reseach Article

Keyword based Automatic Summarization of HTML Documents

by Shivangi Gupta, Mukesh Rawat
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 127 - Number 8
Year of Publication: 2015
Authors: Shivangi Gupta, Mukesh Rawat
10.5120/ijca2015906421

Shivangi Gupta, Mukesh Rawat . Keyword based Automatic Summarization of HTML Documents. International Journal of Computer Applications. 127, 8 ( October 2015), 24-29. DOI=10.5120/ijca2015906421

@article{ 10.5120/ijca2015906421,
author = { Shivangi Gupta, Mukesh Rawat },
title = { Keyword based Automatic Summarization of HTML Documents },
journal = { International Journal of Computer Applications },
issue_date = { October 2015 },
volume = { 127 },
number = { 8 },
month = { October },
year = { 2015 },
issn = { 0975-8887 },
pages = { 24-29 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume127/number8/22750-2015906421/ },
doi = { 10.5120/ijca2015906421 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:19:20.473766+05:30
%A Shivangi Gupta
%A Mukesh Rawat
%T Keyword based Automatic Summarization of HTML Documents
%J International Journal of Computer Applications
%@ 0975-8887
%V 127
%N 8
%P 24-29
%D 2015
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Automatic summarization [5] can be defined as the procedure to create a short version of a text by a computer program. Its product still contains the most important points of the existing text. Multi-document summarization [6] can be defined as an automatic procedure which extracts information from multiple texts that is written about the same topic. Resulting summary report allows individual users or professional information consumers, to quickly familiarize themselves with information that is contained in a large cluster of documents. Multi-document summarization creates information reports that are both concise and comprehensive.

References
  1. James Allan, Jaime Carbonell, George Doddington,Jonathan Yamron, and Yiming Yang. 1998. Topic detection and tracking pilot study: Final report. In Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop.
  2. Chinatsu Aone, M. E. Okurowski, J. Gorlinsky, and B. Larsen. 1997. A scalable summarization system using robust NLP. In Proceedings of the ACL'97/EACL'97 Workshop on Intelligent Scalable Text Summarization, pages 66-73, Madrid, Spain.
  3. Breck Baldwin and Thomas S. Morton. 1998. Dynamic coreference-based summarization. In Proceedings of the Third Conference on Empirical Methods in Natural Language Processing (EMNLP-3), Granada, Spain, June.
  4. Regina Barzilay and Michael Elhadad. 1997. Using lexical chains for text summarization. In Proceedings of the CL'97/EACL'97 Workshop on Intelligent Scalable Text Summarization, pages 10-17, Madrid, Spain.
  5. A. Siddharthan, A. Nenkova, and K. McKeown. Syntactic simplification for improving content selection in multi-document summarization. In Proc.of COLING, 2004.
  6. L. Vanderwende, H. Suzuki, and C. Brockett. Microsoft Research at DUC2006: Taskfocused summarization with sentence simplification and lexical expansion. In Proc. of DUC, 2006.
  7. X. Wan and J. Yang. Improved affinity graph based multi-document summarization. In Proceedings of HLT-NAACL, Companion Volume: Short Papers, pages 181–184, 2006.
  8. D.Zajic, B. Dorr, andR. Schwartz.Automatic headline generation for newspaper stories. In Proc. of DUC, 2002.
  9. D. Zajic, B. Dorr, J. Lin, C. Monz, and R. Schwartz. A sentence-trimming approach to multidocument summarization. In Proc. of DUC, 2005.
  10. Satoshi Sekine and C Nobata. Sentence Extraction with Information Extraction Technique. In Workshop on Text Summarization, 2001.
  11. Christopher D. Manning, Prabhakar Raghavan,” An Introduction to information retrieval”,Cambridge University Press Cambridge, England, Online edition (c) 2009.
Index Terms

Computer Science
Information Sciences

Keywords

Automatic summarization multi-document summarization multiple texts pre- processing of text.