CFP last date
22 April 2024
Reseach Article

Comparative Study among Data Reduction Techniques over Classification Accuracy

by Ibrahim M. El-hasnony, Hazem M. El Bakry, Ahmed A. Saleh
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 122 - Number 2
Year of Publication: 2015
Authors: Ibrahim M. El-hasnony, Hazem M. El Bakry, Ahmed A. Saleh
10.5120/21671-4752

Ibrahim M. El-hasnony, Hazem M. El Bakry, Ahmed A. Saleh . Comparative Study among Data Reduction Techniques over Classification Accuracy. International Journal of Computer Applications. 122, 2 ( July 2015), 9-15. DOI=10.5120/21671-4752

@article{ 10.5120/21671-4752,
author = { Ibrahim M. El-hasnony, Hazem M. El Bakry, Ahmed A. Saleh },
title = { Comparative Study among Data Reduction Techniques over Classification Accuracy },
journal = { International Journal of Computer Applications },
issue_date = { July 2015 },
volume = { 122 },
number = { 2 },
month = { July },
year = { 2015 },
issn = { 0975-8887 },
pages = { 9-15 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume122/number2/21671-4752/ },
doi = { 10.5120/21671-4752 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:09:30.739828+05:30
%A Ibrahim M. El-hasnony
%A Hazem M. El Bakry
%A Ahmed A. Saleh
%T Comparative Study among Data Reduction Techniques over Classification Accuracy
%J International Journal of Computer Applications
%@ 0975-8887
%V 122
%N 2
%P 9-15
%D 2015
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Nowadays, Healthcare is one of the most critical issues that need efficient and effective analysis. Data mining provides many techniques and tools that help in getting a good analysis for healthcare data. Data classification is a form of data analysis for deducting models. Mining on a reduced version of data or a lower number of attributes increases the efficiency of system providing almost the same results. In this paper, a comparative study between different data reduction techniques is introduced. Such comparison is tested against classification algorithms accuracy. The results showed that fuzzy rough feature selection outperforms rough set attribute selection, gain ratio, correlation feature selection and principal components analysis.

References
  1. Ahmed E. Youssef," A framework for secure healthcare systems based on big data analytics in mobile cloud computing environments", International Journal of Ambient Systems and Applications (IJASA), Vol. 2, No. 2, (2014), pp. : 1-11.
  2. Mohini D Patil, Dr. Shirish S. Sane," Effective Classification after Dimension Reduction: A Comparative Study", International Journal of Scientific and Research Publications, Vol. 4, No7, (2014), pp. : 1-4.
  3. Han, J. , M. Kamber, and J. Pei. , "Data Mining, third Edition: Concepts and Techniques ",The Morgan Kaufmann Series in Data Management Systems. ISBN-13: 978-0-12-381479-1 (2012). ?
  4. J. Alamelu Mangai, V. Santhosh Kumar, S. Appavu alias Balamurugan, "A Novel Feature Selection Framework for Automatic Web Page Classification", International Journal of Automation and Computing,Vol. 9,No. 4, (2012), pp. :442-448.
  5. N. R. Sakthivel a, Binoy B. Nairb, M. Elangovana, V. Sugumaranc and S. Saravanmurugan , "Comparison of dimensionality reduction techniques for the fault diagnosis of mono block centrifugal pump using vibration signals", Engineering Science and Technology, an International Journal,Vol. 17 ,(2014),pp. :30-38.
  6. Asha Gowda Karegowda, A. S. Manjunath& M. A. Jayaram introduced ," comparative study of attribute selection using gain ratio and correlation based feature selection", International Journal of Information Technology and Knowledge Management, Vol. 2, No. 2,(2010), pp. : 271-277.
  7. Jianhua Dai, Qing Xu," Attribute selection based on information gain ratio in fuzzy rough set theory with application to tumor classification", Applied Soft Computing,Vol. 13 ,(2013),pp. :211–221.
  8. Porkodi, R. ,"comparison of filter based feature selection algorithms: An overview", international journal of innovative research in technology& science, VOl. 2, No. 2, (2014), pp. :108-113.
  9. Alexander Ilin, Tapani Raiko," Practical Approaches to Principal Component Analysis in the Presence of Missing Values", Journal of Machine Learning Research,Vol. 11,(2010),pp. :1957-2000.
  10. Hall, Mark A. , and Lloyd A. Smith. , "Feature Selection for Machine Learning: Comparing a Correlation-Based Filter Approach to the Wrapper. " In FLAIRS conference, (1999), pp. : 235-239.
  11. Jensen, Richard, and Qiang Shen. ," Fuzzy–rough attribute reduction with application to web categorization ", fuzzy sets and systems Vol. 141, no. 3,( 2004),pp. :469-485.
  12. Jensen, Richard, and Qiang Shen. "New approaches to fuzzy-rough feature selection. ", Fuzzy Systems, IEEE Transactions ,Vol. 17, No. 4,(2009),pp. : 824-838.
  13. Kumar, R. Kavitha, and R. M. Chadrasekaran. "Attribute correction–data cleaning using association rule and clustering methods. " International Journal of Data Mining & Knowledge Management Process (IJDKP) . Vol. 1, no. 2, (2011), pp. : 22-32. ?
  14. Ics. uci. edu, (2015). Donald Bren School of Information and Computer Sciences @ University of California, Irvine. [online] Available at: http://www. ics. uci. edu [Accessed 29 May 2015].
  15. Mona Gamal,Ahmed Abou El-Fetouh, Shereef Barakat . "A Fuzzy Rough Rule Based System Enhanced By Fuzzy Cellular Automata". (IJACSA) International Journal of Advanced Computer Science and Applications, Vol. 4, no. 5, (2013), pp. :1-11.
Index Terms

Computer Science
Information Sciences

Keywords

Fuzzy rough feature selection rough set attribute reduction principal component analysis correlation feature selection gain ratio