CFP last date
20 May 2024
Reseach Article

Incremental Missing Value Replacement Techniques for Stream Data

by Kinnari Patel, R G Mehta, M M Raghuvanshi, N N Vadnere
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 122 - Number 17
Year of Publication: 2015
Authors: Kinnari Patel, R G Mehta, M M Raghuvanshi, N N Vadnere
10.5120/21791-5129

Kinnari Patel, R G Mehta, M M Raghuvanshi, N N Vadnere . Incremental Missing Value Replacement Techniques for Stream Data. International Journal of Computer Applications. 122, 17 ( July 2015), 9-13. DOI=10.5120/21791-5129

@article{ 10.5120/21791-5129,
author = { Kinnari Patel, R G Mehta, M M Raghuvanshi, N N Vadnere },
title = { Incremental Missing Value Replacement Techniques for Stream Data },
journal = { International Journal of Computer Applications },
issue_date = { July 2015 },
volume = { 122 },
number = { 17 },
month = { July },
year = { 2015 },
issn = { 0975-8887 },
pages = { 9-13 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume122/number17/21791-5129/ },
doi = { 10.5120/21791-5129 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:10:48.180181+05:30
%A Kinnari Patel
%A R G Mehta
%A M M Raghuvanshi
%A N N Vadnere
%T Incremental Missing Value Replacement Techniques for Stream Data
%J International Journal of Computer Applications
%@ 0975-8887
%V 122
%N 17
%P 9-13
%D 2015
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Stream data mining is the process of excerpting knowledge structure from large, continuous data. For stream data, various techniques are proposed for preparing the data for data mining task. In recent years stream data have become a growing area for the researcher, but there are many issues occurring in classifying these data due to erroneous and noisy data. Change of trend in the data periodically produces major challenge for data miners. This research concentrates on incremental missing value replacement for stream data. The proposed method generates the value for the missing data considering the data type and data distribution. It also considers the concept drift in the data stream. The method is applied to different datasets and promising results derived.

References
  1. Data preprocessing. CCSU http://www. cs. ccsu. edu/~markov/ccsu_courses/DataMining-3. html
  2. Cw. flek. cvut. cz/lib/exe/fetch. php/cources/ac4m33sad/2_tutorial. pdf.
  3. S. McClean, B. Scotney and M. Shapcott, "Using Background Knowledge with Attribute-Oriented Data Mining" Knowledge Discovery and Data mining (Digest no, 1998/310), IEE colloquiumon, 1998, pp. 1/1-1/4.
  4. J. Shena and M. Chen, "A Recycle Technique of Association Rule for Missing Value Completion" in Proc. AINA'03, 2003, pp. 526-529.
  5. Thomas R. Gabriel and Michael R. Berthold, "Missing Values in Fuzzy Rule Induction", Systems, Man and Cybernetics, 2005 IEEE International Conference on (Volume: 2).
  6. M. Shyu, I. P. Appuhamilage, S. Chen and L. Chang, "Handling Missing Values via Decomposition of the Conditioned Set", IEEE Systems, Man, and cybernetics society, pp. 199-204, 2005.
  7. Olga Troyanskaya, Michael Cantor, Gavin Sherlock, Pat Brown, Trevor Hastie, Robert Tibshirani, David Botstein and Russ B. Altman, "Missing value estimation methods for DNA microarrays", Bioinformatics (2001) 17 (6): 520-525.
  8. Anjana Sharma, Naina Mehta, Iti Sharma, " Reasoning with Missing Values in Multi Attribute Datasets" ,International Journal of Advanced Research in Computer Science and Software Engineering, Volume 3, Issue 5, May 2013 .
  9. R. Malarvizhi, A. Thanamani," K-NN Classifier Performs Better Than K-Means Clustering in Missing Value Imputation", IOSR Journal of Computer Engineering (IOSRJCE), vol. 6, pp. 12-15, Nov. - Dec. 2012.
  10. Phimmarin Keerin and Werasak Kurutach, Tossapon Boongoen, "Cluster-based KNN Missing Value Imputation for DNA Microarray Data", 2012 IEEE International Conference on Systems, Man, and Cybernetics October 14-17, 2012, COEX, Seoul, Korea.
  11. N. Devi, Balamurugan. S, Swathi U. V, "An amalgam KNN to predict Diabetes Mellitus", in proc. ICE-CCN, 2013, pp. 691-695.
  12. T. R. Sivapriy, A. R. Nadira Banu Kamal, V. Thavavel, "Imputation and Classification Of Missing Data Using Least Square Support Vector Machines – A New Approach in Dementia Diagnosis", International Journal of Advanced Research in Artificial Intelligence, Vol. 1, No. 4, 2012.
  13. Prakash Gupta, R. Srinivasan,"Missing Data Prediction and Forecasting for Water Quantity Data", 011 International Conference on Modeling, Simulation and Control "IPCSIT vol. 10 (2011) © (2011) IACSIT Press, Singapore.
  14. Ariel Schlamm, David Messinger, "Improved detection and clustering of hyper spectral image data by preprocessing with a Euclidean distance transformation", WHISPERS 2011, IEEE, Lisbon, Portugal, June (2011)
  15. Tony Finch, "Incremental calculation of weighted mean and variance", February 2009.
  16. Lukasz A. Kurgan,Member, IEEE, and Krzysztof J. Cios, "CAIM Discretization Algorithm", Senior Member, IEEE, IEEE transactions on knowledge and data engineering, vol. 16, no. 2, february 2004
  17. UCI repository dataset, "http://archive. ics. uci. edu/ml/"
  18. Weka tool "http://www. cs. waikato. ac. nz/ml/weka/Vote data", "Contact lance data.
  19. M. R. Lad, R. G. Mehta, D. P. Rana, "A novel tree based classification", international journal of engineering science & advanced technology", ijesat | may-jun 2012.
Index Terms

Computer Science
Information Sciences

Keywords

Skewness Mean Median Standard deviation Discretization.