CFP last date
20 May 2024
Reseach Article

Comparative Analysis of Classification Algorithms on Different Datasets using WEKA

by Rohit Arora, Suman
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 54 - Number 13
Year of Publication: 2012
Authors: Rohit Arora, Suman
10.5120/8626-2492

Rohit Arora, Suman . Comparative Analysis of Classification Algorithms on Different Datasets using WEKA. International Journal of Computer Applications. 54, 13 ( September 2012), 21-25. DOI=10.5120/8626-2492

@article{ 10.5120/8626-2492,
author = { Rohit Arora, Suman },
title = { Comparative Analysis of Classification Algorithms on Different Datasets using WEKA },
journal = { International Journal of Computer Applications },
issue_date = { September 2012 },
volume = { 54 },
number = { 13 },
month = { September },
year = { 2012 },
issn = { 0975-8887 },
pages = { 21-25 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume54/number13/8626-2492/ },
doi = { 10.5120/8626-2492 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:55:34.621987+05:30
%A Rohit Arora
%A Suman
%T Comparative Analysis of Classification Algorithms on Different Datasets using WEKA
%J International Journal of Computer Applications
%@ 0975-8887
%V 54
%N 13
%P 21-25
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Data mining is the upcoming research area to solve various problems and classification is one of main problem in the field of data mining. In this paper, we use two classification algorithms J48 (which is java implementation of C4. 5 algorithm) and multilayer perceptron alias MLP (which is a modification of the standard linear perceptron) of the Weka interface. It can be used for testing several datasets. The performance of J48 and Multilayer Perceptron have been analysed so as to choose the better algorithm based on the conditions of the datasets. The datasets have been chosen from UCI Machine Learning Repository. Algorithm J48 is based on C4. 5 decision based learning and algorithm Multilayer Perceptron uses the multilayer feed forward neural network approach for classification of datasets. When comparing the performance of both algorithms we found Multilayer Perceptron is better algorithm in most of the cases.

References
  1. Z. Haiyang, "A Short Introduction to Data Mining and Its Applications", IEEE, 2011
  2. J. Han and M. Kamber, "Data Mining: Concepts and Techniques", Morgan Kaufmann, 2nd , 2006
  3. R. Agrawal, T. Imielinski, and A. N. Swami, "Database Mining: A Performance Perspective," IEEE Trans. Knowledge and Data Engineering, vol. 5, no. 6, pp. 914-925, Dec. 1993.
  4. J. R. Quinlan, "Induction of Decision Trees," Machine Learning, vol. 1, no. 1, pp. 81-106, 1986.
  5. J. R. Quinlan, C4. 5: Programs for Machine Learning. Morgan Kaufmann, 1993.
  6. Y. Bengio, J. M. Buhmann, M. Embrechts, and J. M. Zurada, "Introduction to the special issue on neural networks for data mining and knowledge discovery," IEEE Trans. Neural Networks, vol. 11, pp. 545-549, 2000.
  7. D. Michie, D. J. Spiegelhalter, and C. C. Taylor, "Machine Learning, Neural and Statistical Classification", Ellis Horwood Series in Artificial Intelligence, 1994.
  8. J. R. Quinlan, "Comparing Connectionist and Symbolic Learning Methods," S. J. Hanson, G. A. Drastall, and R. L. Rivest, eds. , Computational Learning Theory and Natural Learning Systems, vol. 1, pp. 445-456. A Bradford Book, MIT Press, 1994.
  9. J. W. Shavlik, R. J. Mooney, and G. G. Towell, "Symbolic and Neural Learning Algorithms: An Experimental Comparison," Machine Learning, vol. 6, no. 2, pp. 111-143, 1991.
  10. P. Clark and T. Niblett, "The CN2 induction algorithm. Machine learning", 3(4):261-283, 1989.
  11. Y. Freund and L. Mason. The alternating decision tree algorithm. In Proceedings of the 16th International Conference on Machine Learning, pages 124-133, 1999.
  12. UCI Machine Learning Repository: http://archive. ics. uci. edu/ml/datasets. html
  13. Weka: http://www. cs. waikato. ac. nz/~ml/weka/
  14. I. H. Witten, E. Frank, and M. A. Hall, Data Mining: Practical Machine Learning Tools and Techniques, 3rd ed. Morgan Kaufmann, 2011
  15. P. J. Werbos, "Backpropagation Through Time: What It Does and How to Do It", IEEE, 1990
  16. H. Lu, R. Setiono, and H. Liu, "Effective Data Mining Using Neural Networks", IEEE, 1996
Index Terms

Computer Science
Information Sciences

Keywords

Classification Data Mining Techniques Decision Tree Multilayer Perceptron