CFP last date
20 May 2024
Reseach Article

Enhancing Classifier Performance via Hybrid Feature Selection and Numeric Class Handling- A Comparative Study

by S. Vijayasankari, K. Ramar
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 41 - Number 17
Year of Publication: 2012
Authors: S. Vijayasankari, K. Ramar
10.5120/5634-8003

S. Vijayasankari, K. Ramar . Enhancing Classifier Performance via Hybrid Feature Selection and Numeric Class Handling- A Comparative Study. International Journal of Computer Applications. 41, 17 ( March 2012), 30-36. DOI=10.5120/5634-8003

@article{ 10.5120/5634-8003,
author = { S. Vijayasankari, K. Ramar },
title = { Enhancing Classifier Performance via Hybrid Feature Selection and Numeric Class Handling- A Comparative Study },
journal = { International Journal of Computer Applications },
issue_date = { March 2012 },
volume = { 41 },
number = { 17 },
month = { March },
year = { 2012 },
issn = { 0975-8887 },
pages = { 30-36 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume41/number17/5634-8003/ },
doi = { 10.5120/5634-8003 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:29:51.402602+05:30
%A S. Vijayasankari
%A K. Ramar
%T Enhancing Classifier Performance via Hybrid Feature Selection and Numeric Class Handling- A Comparative Study
%J International Journal of Computer Applications
%@ 0975-8887
%V 41
%N 17
%P 30-36
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Classification is a supervised machine learning procedure in which the effective model is constructed for prediction. The accuracy of classification mainly depends on the type of features and the characteristics of the dataset. Feature selection is an efficient approach in searching the most descriptive features which would contribute to the increase in the performance of the inductive algorithm by reducing dimensionality and processing time. In the present work a hybrid embedded feature selection algorithm with class label refining and handled numeric class problem in classifier are implemented. A novel feature selection algorithm based on ranker search optimization method and ensemble genetic search for selecting the appropriate features and class label refining for correcting misclassified instances from the dataset have been done. By modelling this approach, it reaches a near global optimal solution set of features. Hybrid feature selection with enhanced C4. 5 and naïve bayes classification can handle numeric class to achieve better performance. The efficiency of this method is demonstrated by comparing with the other existing methods in terms of accuracy, number of features selected and ability to handle numerical class values. Experimental results on datasets reveals that the proposed algorithm increases the classifier accuracy with less error rate and the quality of results are comparable.

References
  1. Appavu S and Rajaram R. 2009. Knowledge - based system for text classification using ID6NB algorithm, Knowledge-Based Systems, Vol. 22. pp. 1–7.
  2. Blake C. L. and Merz C. J. UCI Repository of Machine Learning Databases,
  3. http://www. ics. uci. edu/~mlearn/MLRepository. html.
  4. Ong Y. S. and Keane A. J. 2004, Meta-Lamarckian in Memetic Algorithm, IEEE Trans. Evolutionary Computation, Vol. 8, pp. 99 -110.
  5. Renpu Li, Zheng - ou Wanga. 2004, Computing, Artificial Intelligence and Information Technology Mining classification rules using rough sets and neural networks, European Journal of operational Research, Vol. 157, pp. 439 - 448.
  6. Umamaheswari G, Ramar K, Manimegalai D and Gomathi V. 2011, An adaptive region based color texture segmentation using fuzzified distance metric, Applied soft computing, Vol. 11, pp. 2916 – 2924.
  7. Waikato Environment for Knowledge analysis (WEKA), Machine learning algorithms in java.
  8. Chien-Pang Lee and Yungho Leu. 2011, A novel hybrid feature selection method for micro array data analysis, Applied Soft Computing, Vol. 11, pp. 208-213.
  9. Shih-Chieh Chen, Shih-Wei Lin and Shuo-Yan Chou. 2011, Enhancing the classification accuracy by scatter-search-based ensemble approach, Applied Soft Computing, Vol. 11 pp. 1021-1028.
  10. Yue Huang, Paul McCullagh, Norman Black and Roy Harper, 2007, Feature selection and classification model construction on type 2 diabetic patients' data, Artificial Intelligence in Medicine, Vol. 41, pp. 251-262.
  11. Hassan H. Malik, John R. Kender • Dmitriy Fradkin, and Fabian Moerchen, 2010, Hierarchical document clustering using local patterns, Data Mining and Knowledge Discovery, Vol. 21, pp. 153-185.
  12. Srilatha Chebrolu, Ajith Abraham and Johnson P Thomas, 2005, Feature deduction and ensemble design of intrusion detection system, Computers and Security, Vol. 24, pp. 295-307
  13. Shailendra Singh and Sanjay Silakari, 2009, Generalized Discriminant Analysis algorithm for feature reduction in Cyber Attack Detection System', International Journal of Computer Science and Information Security, Vol. 6, pp - 173-180.
  14. Pratik Neelakantan N and Nagesh C, 2011, Role of Feature Selection in Intrusion Detection Systems for 802. 11 Networks, International Journal of Smart Sensors and Ad Hoc Networks, Vol. 1, pp: 98-101.
  15. Nallusamy R, Jayarajan K, and Duraiswamy K. 2009, Intrusion Detection In Mobile Ad Hoc Networks Using GA Based Feature Selection, Computer Science and Telecommunications, Vol. 5, pp-28-35.
Index Terms

Computer Science
Information Sciences

Keywords

Data Mining Hybrid Feature Selection Classification Decision Tree Accuracy