CFP last date
22 April 2024
Reseach Article

Performance Analysis on Uncertain Data using Decision Tree

by Bhosale J. D., Patil B. M
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 96 - Number 7
Year of Publication: 2014
Authors: Bhosale J. D., Patil B. M
10.5120/16805-6529

Bhosale J. D., Patil B. M . Performance Analysis on Uncertain Data using Decision Tree. International Journal of Computer Applications. 96, 7 ( June 2014), 15-19. DOI=10.5120/16805-6529

@article{ 10.5120/16805-6529,
author = { Bhosale J. D., Patil B. M },
title = { Performance Analysis on Uncertain Data using Decision Tree },
journal = { International Journal of Computer Applications },
issue_date = { June 2014 },
volume = { 96 },
number = { 7 },
month = { June },
year = { 2014 },
issn = { 0975-8887 },
pages = { 15-19 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume96/number7/16805-6529/ },
doi = { 10.5120/16805-6529 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:21:07.137458+05:30
%A Bhosale J. D.
%A Patil B. M
%T Performance Analysis on Uncertain Data using Decision Tree
%J International Journal of Computer Applications
%@ 0975-8887
%V 96
%N 7
%P 15-19
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Data uncertainty is common in emerging applications, such as sensor networks, moving object databases, medical and biological fields. Data uncertainty can be caused by various factors including measurements precision limitation. Data uncertainty is inherited in various applications due to different reasons such as outdated sources or imprecise measurement and transmission problems. Classification is one of the most popular data mining techniques. Lot of people used decision tree for data classification and it widely used on certain or precise data. However in this paper we applied on uncertain data which is taken from UCI machine learning repository. This paper proposes a decision tree based classification method on uncertain data. We construct decision tree algorithms by including entropy and information gain, considering the uncertain data intervals. We use some pruning techniques that can improve efficiency of the decision tree and our experiment show that it significantly reduce the tree-construction time.

References
  1. http://archive. ics. uci. edu/ml/datasets. html.
  2. J. R. Quinlan, "Induction of Decision Trees," Machine Learning, vol. 1, no. 1, pp. 81-106, 1986.
  3. J. R. Quinlan, C4. 5: Programs for Machine Learning. Morgan Kaufmann, 1993.
  4. C. L. Tsien, I. S. Kohane, and N. McIntosh, "Multiple Signal Integration by Decision Tree Induction to Detect Artefacts in the Neonatal Intensive Care Unit," Artificial Intelligence in Medicine, vol. 19, no. 3, pp. 189-202, 2000.
  5. W. Street, W. Wolberg, and O. Mangasarian, "Nuclear Feature "Extraction for Breast Tumor Diagnosis," Proc. SPIE, pp. 861-870, http://citeseer. ist. psu. edu/street93nuclear. html, 1993.
  6. L. Breiman, "Technical Note: Some Properties of Splitting Criteria," Machine Learning, vol. 24, no. 1, pp. 41-47, 1996.
  7. Langley P, Iba W, Thompson K (1992) An analysis of Bayesian classifiers, In: Proceedings of the tenth National Conference on artificial intelligence, pp. 223-228.
  8. Abdelghani Bellaachia, Erhan Guven, "Predicting Breast Cancer Survivability Using Data Mining Techniques", www. siam. org/meetings/sdm06/workproceed/bellaachia. pdf on Dec 06, 2010.
  9. T. Elomaa and J. Rousu, "General and efficient multisplitting of numerical Attributes", Machine learning, vol 36, no 3, pp 201-244, 1999.
  10. U. M. Fayyad and K. B. Irani, "On the handling of continuous-valued attributes in decision tree generation," Machine Learning, 1992.
  11. T. Elomaa and J. Rousu, "Efficient multisplitting revisited: elimination of partition candidates," Data Mining and knowledge Discovery, vol. 8, no. 2, pp. 97–126, 2004.
  12. Hawarah L, Simonet A, Simonet M(2006) Dealing with Missing Values in a Probabilistic Decision Tree during Classification, The Second International Workshop on Mining Complex Data, pp. 325-329.
  13. T. M. Mitchell, Machine Learning. McGraw-Hill, 1997.
Index Terms

Computer Science
Information Sciences

Keywords

Uncertain Data Decision Tree Classification Data Minings