A Frequent Concepts Based Document Clustering Algorithm

Dr.Renu Dhir; Rekha Baghel

Call for Paper

July Edition

IJCA solicits high quality original research papers for the upcoming July edition of the journal. The last date of research paper submission is 22 June 2026

Submit your paper

Know more

The week's pick

CAD-Genesis: An Open-Source AI-Powered Add-in for Natural Language-Driven Parametric CAD Modeling and Cross-Platform Integration in SolidWorks and Fusion 360

Anil Mandloi Prakhi Mandloi

Random Articles

Multiport Memory Design in Quantum Dot Cellular Automata Platform

Oct

2019

SysRisk ñA Decisional Framework to Measure System Dimensions of Legacy Application for Rejuvenation through Reengineering

February

2011

Open Reviewing for Imparted Information to Effective Client Denial in the Cloud

June

2015

Enabling Effective Personalized Learning: Determinants for Knowledge based Web Information Retrieval Systems

April

2015

Reseach Article

A Frequent Concepts Based Document Clustering Algorithm

by Dr.Renu Dhir, Rekha Baghel

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 4 - Number 5

Year of Publication: 2010

Authors: Dr.Renu Dhir, Rekha Baghel

10.5120/826-1171

Dr.Renu Dhir, Rekha Baghel . A Frequent Concepts Based Document Clustering Algorithm. International Journal of Computer Applications. 4, 5 ( July 2010), 6-12. DOI=10.5120/826-1171

@article{ 10.5120/826-1171,

author = { Dr.Renu Dhir, Rekha Baghel },

title = { A Frequent Concepts Based Document Clustering Algorithm },

journal = { International Journal of Computer Applications },

issue_date = { July 2010 },

volume = { 4 },

number = { 5 },

month = { July },

year = { 2010 },

issn = { 0975-8887 },

pages = { 6-12 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume4/number5/826-1171/ },

doi = { 10.5120/826-1171 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T19:52:15.416420+05:30

%A Dr.Renu Dhir

%A Rekha Baghel

%T A Frequent Concepts Based Document Clustering Algorithm

%J International Journal of Computer Applications

%@ 0975-8887

%V 4

%N 5

%P 6-12

%D 2010

%I Foundation of Computer Science (FCS), NY, USA

Abstract

This paper presents a novel technique of document clustering based on frequent concepts. The proposed technique, FCDC (Frequent Concepts based document clustering), a clustering algorithm works with frequent concepts rather than frequent items used in traditional text mining techniques. Many well known clustering algorithms deal with documents as bag of words and ignore the important relationships between words like synonyms. the proposed FCDC algorithm utilizes the semantic relationship between words to create concepts. It exploits the WordNet ontology in turn to create low dimensional feature vector which allows us to develop a efficient clustering algorithm. It uses a hierarchical approach to cluster text documents having common concepts. FCDC found more accurate, scalable and effective when compared with existing clustering algorithms like Bisecting K-means , UPGMA and FIHC.

References

Index Terms

Computer Science

Information Sciences

Keywords

Document clustering Clustering algorithm Frequent Concepts based Clustering WordNet