CFP last date
22 April 2024
Reseach Article

A Novel Ensemble based Cluster Analysis using Similarity Matrices and Clustering Algorithm (SMCA)

by Mayank Gupta, Dhanraj Varma
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 100 - Number 10
Year of Publication: 2014
Authors: Mayank Gupta, Dhanraj Varma
10.5120/17558-8171

Mayank Gupta, Dhanraj Varma . A Novel Ensemble based Cluster Analysis using Similarity Matrices and Clustering Algorithm (SMCA). International Journal of Computer Applications. 100, 10 ( August 2014), 1-6. DOI=10.5120/17558-8171

@article{ 10.5120/17558-8171,
author = { Mayank Gupta, Dhanraj Varma },
title = { A Novel Ensemble based Cluster Analysis using Similarity Matrices and Clustering Algorithm (SMCA) },
journal = { International Journal of Computer Applications },
issue_date = { August 2014 },
volume = { 100 },
number = { 10 },
month = { August },
year = { 2014 },
issn = { 0975-8887 },
pages = { 1-6 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume100/number10/17558-8171/ },
doi = { 10.5120/17558-8171 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:29:35.337674+05:30
%A Mayank Gupta
%A Dhanraj Varma
%T A Novel Ensemble based Cluster Analysis using Similarity Matrices and Clustering Algorithm (SMCA)
%J International Journal of Computer Applications
%@ 0975-8887
%V 100
%N 10
%P 1-6
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

In today's world data analytics is gaining popularity due to user's motivation towards online data storage. This storage is not organized because of content types and data handling schemes complexity. User aims to retrieve data in lesser time with logical outcomes as desired can be achieved by applying data mining. Clustering in data mining is one of the known categorization approach used for formation of groups of similar elements having certain properties in common with other elements. This formation sometime creates noisy result in terms of formatted clusters. It depends on various factors such as distance measures, proximity values, objective functions, categorical or numerical attribute types etc. Over the last few years various schemes are suggested by different authors for improving the performance of tradition clustering algorithms. Among them, one is ensemble based clustering. Ensemble uses the mechanism for criteria selection from newly formed clusters with a defined portioning and joining methods to generate a single result instead of multiple solutions. The generation results are affected by various environmental parameters such as number of cluster, partitioning types, proximity values, objective function etc. This paper propose a novel SMCA based ensemble clustering algorithm for improvements over the existing issues defined in the paper. At the primary level of work and analytical evaluations, it shows the promising results in near future.

References
  1. Mahmood Hossain1, Susan M. Bridges2, Yong Wang2, and Julia E. Hodges2, "An Effective Ensemble Method for Hierarchical Clustering" in ACM at ISBN 978-1-4503-1084-0/12/06, 2012.
  2. Ayan Acharya, Eduardo R. Hruschka, Joydeep Ghosh & Sreangsu Acharyya, "Transfer Learning with Cluster Ensembles" in JMLR: Workshop and Conference Proceedings 27:123{133, 2012.
  3. Hanan G. Ayad and Mohamed S. Kamel, "Cluster-Based Cumulative Ensembles" in Pattern Analysis and Machine Intelligence Lab, 2009.
  4. Li Zheng & Tao Li, "Hierarchical Ensemble Clustering" in School of Computing and Information Sciences, 2010.
  5. Abdolreza Mirzaei & Mohammad Rahmati, "A Novel Hierarchical-Clustering-Combination Scheme Based on Fuzzy-Similarity Relations" in IEEE Transaction of fuzzy system, Vol 18, No 1, Feb 2010.
  6. Kurt Hornik , "A CLUE for CLUster Ensembles" in Journal of Statistical Software, Volume 14, Issue 12. September 2005.
  7. Dong Nguyen & Djoerd Hiemstra, "Ensemble Clustering for Result Diversification in Human Media Interaction" at http://mirex. sourceforge. net.
  8. Marcin Pelka, "Ensemble method for clustering of interval valued symbolic data" in Statistics in Transition Vol. 13, No. 2, pp. 335—342, June 2012.
  9. Prachi Joshi1 and Dr. Parag Kulkarni, "Incremental Learning: Areas and Methods –A Survey" in IJDKP Vol. 2, No. 5, September 2012.
  10. Harun Pirim, Dilip Gautam, Tanmay Bhowmik, Andy D. Perkins & Burak Ek?ioglu, " Performance of an ensemble clustering algorithm on biological data set" in Mathematical and Computational Applications, Vol. 16, No. 1, pp. 87-96, 2011.
  11. L. I. Kuncheva, S. T. Hadjitodorov & L. P. Todorova, "Experimental Comparison of Cluster Ensemble Methods" in CLBME.
  12. Proling Dechang Chen, Zhe Zhang, Zhenqiu Liu and Xiuzhen Cheng, "An Ensemble Method of Discovering Sample Classes Using Gene Expression" in Uniformed Services University of the Health Sciences, 2010.
  13. Bryan Orme & Rich Johnson, "Improving K-Means Cluster Analysis: Ensemble Analysis Instead of Highest Reproducibility Replicates" in Sawtooth Software, Inc Research Series, 2008.
  14. Robert Neumayer, "Clustering Based Ensemble Classification for Spam Filtering" in Vienna University of Technology, 2011.
  15. Edgar Gonz`alez & Jordi Turmo, "Non-Parametric Document Clustering by Ensemble Methods" in TALP Research Center ISSN 1135-5948, 2008.
Index Terms

Computer Science
Information Sciences

Keywords

Data Mining Clustering Ensemble Consensus Partitioning and Joining Criteria Proximity Value Similarity Metrics and Clustering Algorithm (SMCA)