CFP last date
20 May 2024
Reseach Article

Prediction of Credit-Card Defaulters: A Comparative Study on Performance of Classifiers

by Ajay, Ajay Venkatesh, Shomona Gracia Jacob
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 145 - Number 7
Year of Publication: 2016
Authors: Ajay, Ajay Venkatesh, Shomona Gracia Jacob
10.5120/ijca2016910702

Ajay, Ajay Venkatesh, Shomona Gracia Jacob . Prediction of Credit-Card Defaulters: A Comparative Study on Performance of Classifiers. International Journal of Computer Applications. 145, 7 ( Jul 2016), 36-41. DOI=10.5120/ijca2016910702

@article{ 10.5120/ijca2016910702,
author = { Ajay, Ajay Venkatesh, Shomona Gracia Jacob },
title = { Prediction of Credit-Card Defaulters: A Comparative Study on Performance of Classifiers },
journal = { International Journal of Computer Applications },
issue_date = { Jul 2016 },
volume = { 145 },
number = { 7 },
month = { Jul },
year = { 2016 },
issn = { 0975-8887 },
pages = { 36-41 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume145/number7/25293-2016910702/ },
doi = { 10.5120/ijca2016910702 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:48:11.786067+05:30
%A Ajay
%A Ajay Venkatesh
%A Shomona Gracia Jacob
%T Prediction of Credit-Card Defaulters: A Comparative Study on Performance of Classifiers
%J International Journal of Computer Applications
%@ 0975-8887
%V 145
%N 7
%P 36-41
%D 2016
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Data mining is an emerging area of research that aims at extracting meaningful patterns from available data. This paper highlights the significance of classification in predicting new trends from voluminous data. Performance analysis of various data mining algorithm viz. BayesNet, Meta-Stacking, Naïve Bayes, Random Forest, SMO and ZeroR in predicting credit-card defaulters is discussed in this paper. Dataset from the UCI machine learning repository comprising of 25 attributes and 30000 instances have been employed to analyze the performance of algorithms. Moreover, the effect of feature selection has also been identified with respect to each classification algorithm. It has been concluded from the experimental results that both Correlation Feature Subset and Information Gain feature selection methods yield the most useful features for prediction and the accuracy of Random Forest Ensemble method is highest in predicting credit card defaulters.

References
  1. Han, Jiawei, and Micheline Kamber. "Data mining: concepts and techniques (the Morgan Kaufmann Series in data management systems)." (2000).
  2. Cios, Krzysztof J., Witold Pedrycz, and Roman W. Swiniarski. Data mining methods for knowledge discovery. Vol. 458. Springer Science & Business Media, 2012.
  3. Dataset from University of California,Irvine available in their online repository http://archive.ics.uci.edu/ml/index.html.
  4. Abbas Kerama and Niloofar Yousefi - A Proposed Classification of Data Mining Techniques in Credit Scoring. Proceedings of the 2011,International conference on industrial engineering and Operations management Kuala Lumpur, Malaysia.
  5. Tudor, Adela Ioana, Adela Bara, and Elena Andrei. "Clustering analysis for credit default probabilities in a retail bank portfolio." Bucharest Acad Econ Stud, Database Syst J 3 (2012): 23-30.
  6. Tudor, Adela Ioana, Adela Bâra, and Simona Vasilica Oprea. "Comparative analysis of data mining methods for predicting credit default probabilities in a retail bank portfolio." WSEAS International Conference. Proceedings. Recent Advances in Computer Engineering Series. No. 7. WSEAS, 2012.
  7. Investopedia is an online encyclopedia relating to investments. http://www.investopedia.com/terms/c/credit.asp
  8. Kim, Yong, W. Nick Street, and Filippo Menczer. "Feature selection in data mining." Data mining: opportunities and challenges 3.9 (2003): 80-105.
  9. Guan, Hu, Jingyu Zhou, and Minyi Guo. "A class-feature-centroid classifier for text categorization." Proceedings of the 18th international conference on World wide web. ACM, 2009.
  10. Roobaert, Danny, Grigoris Karakoulas, and Nitesh V. Chawla. "Information gain, correlation and support vector machines." Feature Extraction. Springer Berlin Heidelberg, 2006. 463-470.
  11. Jacob, Shomona Gracia, and R. Geetha Ramani. "Discovery of knowledge patterns in clinical data through data mining algorithms: multi-class categorization of breast tissue data." International Journal of Computer Applications (IJCA) 32.7 (2011): 46-53.
  12. Genuer, Robin, et al. "Random Forests based feature selection for decoding fMRI data." Proceedings Compstat. No. 267. 2010.
  13. Nasa, Chitra. "Evaluation of different classification techniques for web data."International Journal of Computer Applications 52.9 (2012).
  14. Cheng, Jie, and Russell Greiner. "Comparing Bayesian network classifiers."Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence. Morgan Kaufmann Publishers Inc., 1999.
  15. WEKA- Weka is a collection of machine learning algorithms for data mining tasks.http://www.cs.waikato.ac.nz/ml/weka/
  16. Watanabe, Carolina YV, et al. "SACMiner: A new classification method based on statistical association rules to mine medical images." Enterprise Information Systems. Springer Berlin Heidelberg, 2010. 249-263.
Index Terms

Computer Science
Information Sciences

Keywords

Credit card defaulter datamining patterns Knowledge patterns.