CFP last date
20 May 2024
Reseach Article

A Comparative Performance Survey on Microarray Data Analysis Techniques for Colon Cancer Classification

by Kshipra Chitode, Meghana Nagori
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 93 - Number 17
Year of Publication: 2014
Authors: Kshipra Chitode, Meghana Nagori
10.5120/16430-6190

Kshipra Chitode, Meghana Nagori . A Comparative Performance Survey on Microarray Data Analysis Techniques for Colon Cancer Classification. International Journal of Computer Applications. 93, 17 ( May 2014), 28-34. DOI=10.5120/16430-6190

@article{ 10.5120/16430-6190,
author = { Kshipra Chitode, Meghana Nagori },
title = { A Comparative Performance Survey on Microarray Data Analysis Techniques for Colon Cancer Classification },
journal = { International Journal of Computer Applications },
issue_date = { May 2014 },
volume = { 93 },
number = { 17 },
month = { May },
year = { 2014 },
issn = { 0975-8887 },
pages = { 28-34 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume93/number17/16430-6190/ },
doi = { 10.5120/16430-6190 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:16:01.621409+05:30
%A Kshipra Chitode
%A Meghana Nagori
%T A Comparative Performance Survey on Microarray Data Analysis Techniques for Colon Cancer Classification
%J International Journal of Computer Applications
%@ 0975-8887
%V 93
%N 17
%P 28-34
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The genetic information of any human beings is very helpful in cancer diagnosis. DNA microarray technology has enabled us to handle thousands of genes simultaneously. cDNA and Affymetrix microarray are the microarray technologies. The microarray data analysis can be done in supervised or unsupervised learning methods. Hierarchical clustering, k-means algorithms are widely used for clustering. As Curse of dimensionality is main challenge for microarray, Feature selection techniques are used. The classification accuracy depends on the feature selection technique used. In proposed work, feature selection techniques implemented are Signal-to-Noise ratio, Information Gain and Fishers criteria. SVM and KNN classifiers are built. The comparative results of performance accuracies are generated. The SVM classifier outperforms with fishers criteria and KNN outperforms with SNR.

References
  1. Alireza Osareh and Bita Shadgar "Microarray Data Analysis for Cancer classification" IEEE Antalya, Turkey 2009, pp. 125-132.
  2. Jun S. Liu Department of Statistics Harvard University "Bioinformatics: Microarrays Analyses and Beyond".
  3. Ahmed Fadiel, Frederick Naftolin "Microarray applications and challenges: a vast array of possibilities" Int Arch Biosci 2003-02-02, pp. 1111-1121.
  4. Yves Moreau, Frank De Smet, Gert Thijs, Kathleen Marchal, and Bart De Moor, "Functional Bioinformatics of Microarray Data: From Expression to Regulation" Proceedings of the IEEE, Vol. 90, No. 11, November 2002.
  5. Bettina Harr and Christian Schlotterer "Comparison of algorithms for the analysis of Affymetrix microarray data as evaluated by co-expression of genes in known operons" Nucleic Acids Research, 2006, Vol. 34, No. 2.
  6. Shang Gao, Omar Addam and colleagues, "Robust Integrated Framework for Effective Feature Selection and Sample Classification and Its Application to Gene Expression Data Analysis" IEEE 2012 pp. 112-119.
  7. Yvan Saeys, Inaki Inza and Pedro Larranaga "A review of feature selection techniques in bioinformatics" 2005 pp 1–10.
  8. Tanya Barrett and colleagues "NCBI GEO: mining tens of millions of expression profiles—database and tools update" Nucleic Acids Research, 2007, Vol. 35, Database issue, pp. D760-D765.
  9. Jasmina Novakovi?, Perica Strbac, Dusan Bulatovic, "Toward optimal feature selection using ranking methods and classification algorithms" Yugoslav Journal of Operations Research 21 (2011), Number 1, 119-135.
  10. Sri Harsha Vege "Ensemble of Feature Selection Techniques for High Dimensional Data" Western Kentucky University 2012.
  11. Azadeh Mohammadi, Mohammad H Saraee, Mansoor Salehi, "Identification of disease-causing genes using microarray data mining and Gene Ontology" Mohammadi et al. BMC Medical Genomics 2011.
  12. "Data Mining: Concepts and Techniques", second edition by Jaiwei Han and Micheline Kamber Chapter 6.
  13. Seyyid Ahmed Medjahed , Tamazouzt Ait Saadi, Abdelkader Benyettou, "Breast Cancer Diagnosis by using k-Nearest Neighbor with Different Distances and Classification Rules", International Journal of Computer Applications (0975 - 8887)Volume 62 - No. 1, January 2013.
  14. Sampath Deegalla, Henrik Bostrom, "Classification of Microarrays with kNN: Comparison of Dimensionality Reduction Methods" H. Yin et al. (Eds. ): IDEAL 2007, LNCS 4881, pp. 800–809, 2007.
Index Terms

Computer Science
Information Sciences

Keywords

Microarray cancer genes feature selection classification