CFP last date
20 May 2024
Reseach Article

Analysis of SimpleKMeans with Multiple Dimensions using WEKA

by Rupali Patil, Shyam Deshmukh, K Rajeswari
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 110 - Number 1
Year of Publication: 2015
Authors: Rupali Patil, Shyam Deshmukh, K Rajeswari
10.5120/19280-0694

Rupali Patil, Shyam Deshmukh, K Rajeswari . Analysis of SimpleKMeans with Multiple Dimensions using WEKA. International Journal of Computer Applications. 110, 1 ( January 2015), 14-17. DOI=10.5120/19280-0694

@article{ 10.5120/19280-0694,
author = { Rupali Patil, Shyam Deshmukh, K Rajeswari },
title = { Analysis of SimpleKMeans with Multiple Dimensions using WEKA },
journal = { International Journal of Computer Applications },
issue_date = { January 2015 },
volume = { 110 },
number = { 1 },
month = { January },
year = { 2015 },
issn = { 0975-8887 },
pages = { 14-17 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume110/number1/19280-0694/ },
doi = { 10.5120/19280-0694 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:45:14.744617+05:30
%A Rupali Patil
%A Shyam Deshmukh
%A K Rajeswari
%T Analysis of SimpleKMeans with Multiple Dimensions using WEKA
%J International Journal of Computer Applications
%@ 0975-8887
%V 110
%N 1
%P 14-17
%D 2015
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Clustering techniques have more importance in data mining especially when the data size is very large. It is widely used in the fields including pattern recognition system, machine learning algorithms, analysis of images, information retrieval and bio-informatics. Different clustering algorithms are available such as Expectation Maximization (EM), Cobweb, FarthestFirst, OPTICS, SimpleKMeans etc. SimpleKMeans clustering is a simple clustering algorithm. It partitions n data tuples into k groups such that each entity in the cluster has nearest mean. This paper is about the implementation of the clustering techniques using WEKA interface. This paper includes a detailed analysis of various clustering techniques with the different standard online data sets. Analysis is based on the multiple dimensions which include time to build the model, number of attributes, number of iterations, number of clusters and error rate.

References
  1. K-Means clustering using Weka Interface- By Sapna Jain, M Afshar Aalam and M. N Doja, Jamia Hamdard University, New Delhi, Proceedings of the 4th National Conference, INDIA Com-2010 Computing for Nation Development, February 25-26, 2010 Bharati Vidyapeeth's Institute of Computer Applications and Management, New Delhi.
  2. Bamshad Mobasher, School of CTI, DePaul University, Bank data set, http://facweb. cs. depaul. edu/mobasher/classes/ect584/weka/k-means. html.
  3. National Informatics Centre (NIC), Irrigation census data of water lifts in all villages of country, http://data. gov. in/.
  4. K-Means Clustering in Spacial Data Mining using Weka Interface- By Ritu Sharma (Sachdeva), M. Afshar Alam, Anita Rani, Department of Computer Science Jamia Hamdard University New Delhi, International Conference on Advances in Communication and Computing Technologies (ICACACT) 2012, Proceedings published by International Journal of Computer Applications (IJCA).
  5. The University of Waikato. Weka 3 – Machine Learning software in Java, http://www. cs. waikato. ac. nz/ml/weka.
  6. S. Celis and D. R. Musicant, Weka-parallel: Machine Learning in parallel, Technical report, Carleton College, CS TR, 2002.
  7. K-Means clustering Tutorial- By Kardi Teknomo, Ph. D.
  8. Privacy-Preserving K-Means clustering over vertically Partitioned Data-By Jaideep Vaidya and Chris Clifton, Dept. of Computer Sciences, Purdue University, 2050 N University St, West Lafayette, IN 47907-2066.
  9. Application of special data mining for Agriculture- By D. Rajesh, AP-SITE, VIT University, Vellore-14, International Journal of Computer Applications (0975-8887), volume 15 – No. 2, February 2011.
  10. The university of WAIKATO, WEKA Manual for version 3-6-8, http://www. nilc. icmc. usp. br/elc-ebralc2012/minicursos/WekaManual-3-6-8. pdf.
  11. Eibe Frank, Mark Hall, Geoffey Holmes, Richard Kirkby, Bernhard Pfahringer, lan H. Witten, Len Trigg, "Weka-A Machine Learning Workbench for Data Mining", Data mining and Knowledge Discovery Handbook, pp. 1269-1277, 2010.
  12. Analysis of Different Data Mining Tools using Classification, Clustering and Association Rule Mining – By Pritam Patil, Suvarna Thube, Bhakti Ratnaparkhi, K. Rajesweri, International Journal of Computer Applications (0975-8887), volume 93- No. 8, pp. 35-39, May 2014.
Index Terms

Computer Science
Information Sciences

Keywords

Data mining SimpleKMeans Clustering WEKA.