Call for Paper - September 2020 Edition
IJCA solicits original research papers for the September 2020 Edition. Last date of manuscript submission is August 20, 2020. Read More

The Analytical Comparison of ID3 and C4.5 using WEKA

Print
PDF
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Year of Publication: 2017
Authors:
Vani Kapoor Nijhawan, Mamta Madan, Meenu Dave
10.5120/ijca2017914286

Vani Kapoor Nijhawan, Mamta Madan and Meenu Dave. The Analytical Comparison of ID3 and C4.5 using WEKA. International Journal of Computer Applications 167(11):1-4, June 2017. BibTeX

@article{10.5120/ijca2017914286,
	author = {Vani Kapoor Nijhawan and Mamta Madan and Meenu Dave},
	title = {The Analytical Comparison of ID3 and C4.5 using WEKA},
	journal = {International Journal of Computer Applications},
	issue_date = {June 2017},
	volume = {167},
	number = {11},
	month = {Jun},
	year = {2017},
	issn = {0975-8887},
	pages = {1-4},
	numpages = {4},
	url = {http://www.ijcaonline.org/archives/volume167/number11/27812-2017914286},
	doi = {10.5120/ijca2017914286},
	publisher = {Foundation of Computer Science (FCS), NY, USA},
	address = {New York, USA}
}

Abstract

Data mining means to find out some useful information from a big warehouse of data and the process is aimed at unfolding old records and identifying novel patterns from the data. Data mining is used for classification and prediction. Many techniques and algorithms are available for mining the data. Out of many techniques, the decision tree is the simplest. This paper focuses on comparing the performance accuracy of ID3 and C4.5 techniques of the decision tree for predicting customer churn using WEKA. The data used for this research work has been collected by designing a survey form and getting it filled by around 150 mobile phone users belonging to a different gender, age groups and having different types of connection providers. For the data analysis in WEKA, the cross-validation method is used where a number of folds n (10 as standard as per the software) is used. From the results, it is observed that C4.5 algorithm exhibits better performance than ID3.

References

  1. WEKA Manual for Version 3-7-8 Remco R. Bouckaert Eibe Frank Mark Hall Richard Kirkby Peter Reutemann Alex Seewald David Scuse, 2013. Available at: http://statweb.stanford.edu/~lpekelis/13_datafest_cart/WekaManual-3-7-8.pdf as on 10-12-2016.
  2. Surjeet K. Y., Saurabh P., (2012).Data Mining: A Prediction for Performance Improvement of Engineering Students using Classification” WCSIT, ISSN: 2221-0741 Vol. 2, No. 2, 51-56.
  3. Han J.and Kamber M., (2011). Data Mining: Concepts and Techniques, Morgan Kaufmann Publish.
  4. Shafer, J., Agrawal, R., Mehta, M. Fast serial and parallel classification of very large databases. In Proc. of the 22nd Int’l Conference on Very Large Databases. 1996.
  5. T.Miranda Lakshmi, A.Martin, R.Mumtaj Begum, Dr.V.Prasanna Venkatesan, “An Analysis on Performance of Decision Tree Algorithms using Student’s Qualitative Data”, I.J.Modern Education and Computer Science, 2013. Published Online June 2013 in MECS (http://www.mecs-press.org/) DOI: 10.5815/ijmecs.2013.05.03
  6. O..O. Adeyemo, T. .O Adeyeye, D. Ogunbiyi (2015). Comparative Study of ID3/C4.5 Decision tree and Multilayer Perceptron Algorithms for the Prediction of Typhoid Fever, IEEE African Journal of Computing & ICT ISSN: 2006-1781 Vol 8. No. 1.
  7. J. Ross Quinlan. (1993). C4.5: Programs for Machine Learning. Morgan Kaufman.
  8. Pallavi Mude, Rahila Sheikh, “Study of Decision Tree Classification Algorithms using Matrimonial System”, International Journal of Computer & Organization Trends,Volume 18,No. 1, March 2015.
  9. Badr HSSINA, Abdelkarim MERBOUHA,Hanane EZZIKOURI,Mohammed ERRITALI, “A comparative study of decision tree ID3 and C4.5, ” International Journal of Advanced Computer Science and Applications (IJACSA), Special Issue on Advances in Vehicular Ad Hoc Networking and Applications, 2014.
  10. Jaimin N. Undavia, Dr. P.M.Dolia and Dr. AtulPatel, “ Comparison of Decision Tree Classification Algorithm to Predict Student's Post Graduation Degree in Weka Environment”, International Journal of Innovative and Emerging Research in Engineering,Vol 1, Issue 2, 2014.
  11. Dr. Mamta Madan Dr. Meenu Dave Vani Kapoor Nijhawan, “ A Review on: Data Mining for Telecom Customer Churn Management”, International Journal of Advanced Research in Computer Science and Software Engineering, Vol 5, Issue 9, 2015.
  12. “Machine Learning with WEKA” WEKA Explorer Tutorial for WEKA Version 3.4.3, Svetlana S. Aksenova, 2004. Available at: https://www.scribd.com/document/247244990/WEKA-Tutorial-Presentation as on 10-12-2016.

Keywords

Data mining, Decision tree, ID3, C4.5