CFP last date
20 May 2024
Reseach Article

Application of Feature Selection Methods in Educational Data Mining

by Anal Acharya, Devadatta Sinha
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 103 - Number 2
Year of Publication: 2014
Authors: Anal Acharya, Devadatta Sinha
10.5120/18048-8951

Anal Acharya, Devadatta Sinha . Application of Feature Selection Methods in Educational Data Mining. International Journal of Computer Applications. 103, 2 ( October 2014), 34-38. DOI=10.5120/18048-8951

@article{ 10.5120/18048-8951,
author = { Anal Acharya, Devadatta Sinha },
title = { Application of Feature Selection Methods in Educational Data Mining },
journal = { International Journal of Computer Applications },
issue_date = { October 2014 },
volume = { 103 },
number = { 2 },
month = { October },
year = { 2014 },
issn = { 0975-8887 },
pages = { 34-38 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume103/number2/18048-8951/ },
doi = { 10.5120/18048-8951 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:33:32.330940+05:30
%A Anal Acharya
%A Devadatta Sinha
%T Application of Feature Selection Methods in Educational Data Mining
%J International Journal of Computer Applications
%@ 0975-8887
%V 103
%N 2
%P 34-38
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

In the recent years, web based learning has emerged as a new field of research due to growth of network and communication technology. These learning systems generate a large volume of student data. Data mining algorithms may be applied on this data set to study interesting patterns. As an example, student enrollment data and his past examination records could be used to predict his grades in the term end examination. However this prediction could mean examining a lot of features of the student data resulting in creation of a model with high computational complexity. In this context this work first defines a student data set with 309 records and 14 features collected by a survey from various graduation level students majoring in Computer Science under University of Calcutta. Different feature selection algorithms are applied on this data set. The best results are obtained by Correlation Based Feature Selection algorithm with 8 features. Subsequently classification algorithms may be applied on this feature subset for predicting student grades.

References
  1. Castro, F. , Vellido, A. , Nebot A. , and Francisco M,. Applying Data Mining Techniques to e-Learning Problems.
  2. Minei-Bidgoli, B. , Kashy, D. A. ,,Kortemeyer, G, Punch, W F. , 2003, Predicting Student Performance: An Application of Data Mining Methods with the Educational Web-Based System LON-CAPA. 33rd ASEE/IEEE Frontiers in Education Conference
  3. Romero, C. , Ventura, S. ,2007, Educational data mining: A survey from 1995 to 2005. Expert Systems with Applications
  4. Romero, C. , Ventura. S. , Educational Data Mining: A Review of the State-of-the-Art. IEEE Transactions on Systems, Man, and Cybernatics—PART C: Applications and Reviews
  5. Kira, K. , Rendell, L. A. , 1992. A practical approach to feature selection. Proceedings of the Ninth International Conference on Machine Learning. pp. 249–256.
  6. Selwyn, Piramuthu. ,2004, Evaluating feature selection methods for learning in data mining applications. European Journal of Operational Research 156 p 483–494.
  7. Mitra, P. , Murthy, C. A. , Pal, S K. ,2002, Unsupervised Feature Selection using Feature Similarity. IEEE Transactions on Pattern Analysis and Machine Intelligence", Vol 24, No 4.
  8. Dash, M. , Liu, H. ,1997, Feature Selection for Classification. Intelligent Data Analysis.
  9. Stefanowski, J,. An Experimental Study of Methods Combining Multiple Classifiers - Diversified both by Feature Selection and Bootstrap Sampling.
  10. Kotsiantis, S. , Piarrekeas, C. , Pintelas P, 2004, Predicting Students' performance in Distance Learning using Machine Learning Techniques, Applied Artificial Intelligence, Volume 18, pp 411-426.
  11. Ramaswami, M. , Bhaskaran,, R. ,2010, A CHAID Based Performance Prediction Model in Educational Data Mining. International Journal of Computer Science Issues, Vol. 7, Issue 1, No. 1.
  12. Nguyen, N. N. , Janeck, P. , Haddawy, P. A ,2007, Comparative Analysis of Techniques for Predicting Academic Performance. 37th ASEE/IEEE Frontiers in Education Conference.
  13. Kovacic, Z. J. ,2010, Early Prediction of Student Success: Mining Students Enrolment Data. Informing Science & IT Education Conference.
  14. Oladukun, V. O. , Adebanjo, A. T. , Charles-Obawa, O. E. , Predicting Students' Academic Performance using Artificial Neural Network: A Case Study of an Engineering Course.
  15. Zhao, Y. and Zhang Y. , Comparison of decision tree methods for finding active objects.
  16. Vafaie, H. and Imam, I. F,. Feature Selection Methods:Genetic Algorithms vs. Greedy-like Search.
  17. WEKA Manual for Version 3-6-10.
  18. Haan, J. , Kamber, M,. 2011, Data Mining-Concepts and Techniques. Third Edition, Elsivier.
Index Terms

Computer Science
Information Sciences

Keywords

Educational Data Mining (EDM) Kappa Statistic F-measure Prediction Accuracy College Education WEKA.