CFP last date
20 May 2024
Reseach Article

Dimension Reduction: A Review

by Mohini D Patil, Shirish S. Sane
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 92 - Number 16
Year of Publication: 2014
Authors: Mohini D Patil, Shirish S. Sane
10.5120/16094-5390

Mohini D Patil, Shirish S. Sane . Dimension Reduction: A Review. International Journal of Computer Applications. 92, 16 ( April 2014), 23-29. DOI=10.5120/16094-5390

@article{ 10.5120/16094-5390,
author = { Mohini D Patil, Shirish S. Sane },
title = { Dimension Reduction: A Review },
journal = { International Journal of Computer Applications },
issue_date = { April 2014 },
volume = { 92 },
number = { 16 },
month = { April },
year = { 2014 },
issn = { 0975-8887 },
pages = { 23-29 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume92/number16/16094-5390/ },
doi = { 10.5120/16094-5390 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:14:29.898843+05:30
%A Mohini D Patil
%A Shirish S. Sane
%T Dimension Reduction: A Review
%J International Journal of Computer Applications
%@ 0975-8887
%V 92
%N 16
%P 23-29
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Dimension reduction is the process of keeping only those dimensions in a dataset which are important from the point of view of problem at hand and discarding of the others. This helps to design easily computable algorithms and to increase the performance of classifiers. It has gained importance as a preprocessing step in knowledge discovery and data mining especially in the fields of pattern matching, machine learning, bioinformatics and genetics which involve datasets having large number of dimensions. There are two basic strategies used for reduction of the dimensions; feature selection and feature extraction. Feature selection techniques focus on selecting some of the important features from all the features while feature extraction techniques are based on generating new features by making use of entire information present in the original dataset. Recently some work has also focused upon combining both the strategies to club their advantages. This paper studies the basic strategies for dimension reduction, the different techniques proposed in literature for reducing the dimensions and also about the measures used by them. Finally it also discusses about the combined approach and concluding remarks are given.

References
  1. Jiawei Han and Micheline Kamber, "Data Mining: Concepts and Technique"s, 2nd ed,The Morgan Kaufmann Series in Data Management Systems, Jim Gray, Series Editor Morgan Kaufmann Publishers, March 2006. ISBN 1-55860-901-6.
  2. www. nptel. ac. in
  3. P. Maji and S. Paul, "Rough set based maximum relevance maximum significance criterion and gene selection from microarray data" ,Int. J. Approx. Reason. , vol. 52, no. 3, pp. 408426, Mar. 2011.
  4. A. Chouchoulas and Q. Shen, "Rough set-aided keyword reduction for text categorization" ,Appl. Artif. Intell. ,vol. 15, no. 9, mpp. 843873,Oct. 01
  5. R. Jensen and Q. Shen, "Semantics-preserving dimensionality reduction:Rough and fuzzy rough-based approach" ,IEEE Trans. Knowl. Data Eng. , vol. 16, no. 12, pp. 14571471, Dec. 2004.
  6. Q. Hu, D. Yu, J. Liu, and C. Wu, "Neighborhood rough set based heterogeneous feature subset selection",Inf. Sci. , vol. 178, no. 18, pp. 3577-3594, Sep. 2008.
  7. H. Peng, F. Long, and C. Ding, "Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy",IEEE Trans. Pattern Anal. Mach. Intell. , vol. 27, no. 8, pp. 1226-1238,Aug. 2005
  8. R. O. Duda, P. E. Hart and D. G. Stork, Pattern "Classification and Scene Analysis" ,Hoboken,Wiley,2000.
  9. P. Maji and P. Garai,Fuzzy Rough Simultaneous Attribute Selection and Feature Extraction Algorithm,IEEE Transactions on Cybernetics, VOL. 43, NO. 4,AUGUST 2013
  10. http://www. cs. waikato. ac. nz/ml/weka/
Index Terms

Computer Science
Information Sciences

Keywords

Dimension reduction Feature Selection Feature Extraction.