CFP last date
20 May 2024
Reseach Article

Using Latent Dirichlet Allocation to Incorporate Domain Knowledge with Concept based Approach for Automatic Topic Detection

by A. Mekala, C. Chandra Sekar
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 59 - Number 19
Year of Publication: 2012
Authors: A. Mekala, C. Chandra Sekar
10.5120/9801-4376

A. Mekala, C. Chandra Sekar . Using Latent Dirichlet Allocation to Incorporate Domain Knowledge with Concept based Approach for Automatic Topic Detection. International Journal of Computer Applications. 59, 19 ( December 2012), 28-32. DOI=10.5120/9801-4376

@article{ 10.5120/9801-4376,
author = { A. Mekala, C. Chandra Sekar },
title = { Using Latent Dirichlet Allocation to Incorporate Domain Knowledge with Concept based Approach for Automatic Topic Detection },
journal = { International Journal of Computer Applications },
issue_date = { December 2012 },
volume = { 59 },
number = { 19 },
month = { December },
year = { 2012 },
issn = { 0975-8887 },
pages = { 28-32 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume59/number19/9801-4376/ },
doi = { 10.5120/9801-4376 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:04:39.542913+05:30
%A A. Mekala
%A C. Chandra Sekar
%T Using Latent Dirichlet Allocation to Incorporate Domain Knowledge with Concept based Approach for Automatic Topic Detection
%J International Journal of Computer Applications
%@ 0975-8887
%V 59
%N 19
%P 28-32
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

In the past couple of years multi-topic summarization is a research investigation that has expanded much attention. There has been a variety of effort on generating natural language summaries for variety of topics, but this is feasible only for a very small number of topics. In this research paper the method trying to provide automatic detection of topics to be summarized that is can determine how many topics should be chosen for automatic summarization. This is effectively done through the combined efficient framework of ontology based and non ontology based systems will be optimistic for the excellence of topic summary. To achieve this propose to apply latent Dirichlet allocation (LDA) model for capturing the semantic information on topic transcription. LDA is a generative model and defines a probabilistic method for generating a new document. The LDA model is utilized for estimating topic sharing in queries and word recorded topic documents, and the identical is performed at the topic level. Concept based topic matching between query words and topic documents are performed using ontology based and non ontology based matching algorithm. The results of topic level matching methods are evaluated between the automatic topic detection method and predefined topic detection mechanism along with the experimental results shown to be complementary.

References
  1. James Allan, Carbonell, George Doddington, Jonathan Yamron and Yiming Yang, "Topic Detection and Tracking Pilot Study: Final Report", In Proceedings of the Broadcast News Understanding and Transcription Workshop, 1998.
  2. James Allan, Victor Lavrenko, Hubert Jin, First story detection in TDT is hard, In CIKM'00.
  3. T. Brants, F. Chen and A. Farahat, "A system for New Event Detection", In SIGIR'03.
  4. Sudipto Guha, Adam Meyerson, Nina Mishra, Rajeev Motwani and Liadan O'Callaghan, "Clustering data streams: Theory and practice", J. TKDE 2003.
  5. Qi He, Kuiyu Chang and Ee-Peng Lim, "A Model for Anticipatory Event Detection", In ER'06.
  6. G. Kumaran and J. Allan, "Text classification and named entities for new event detection", In SIGIR'04.
  7. Nicola Stokes and Joe Carthy, "Combining semantic and syntactic document classifiers to improve first story detection", In SIGIR'01.
  8. Alexey Tsymbal, "The problem of concept drift: Definitions and related work", Technical report, Department of Computer Science, Trinity College, 2004.
  9. Y. Yang, T. Pierce and J. Carbonell, "A Study of Retrospective and On-Line Event Detection", In SIGIR'98.
  10. Y. Yang, J. Zhang, J. Carbonell and C. Jin, "Topic-conditioned Novelty Detection", In SIGKDD'02.
Index Terms

Computer Science
Information Sciences

Keywords

Latent Dirichlet allocation Ontology based matching algorithm Non ontology based matching algorithm Topic summarization and Automatic topic detection