CFP last date
22 April 2024
Reseach Article

Predicting Fault in the Process of Producing Important Android Applications using Data Mining Techniques

by Asrin Hosseini, Amir Sheikh-Ahmadi
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 131 - Number 13
Year of Publication: 2015
Authors: Asrin Hosseini, Amir Sheikh-Ahmadi
10.5120/ijca2015907216

Asrin Hosseini, Amir Sheikh-Ahmadi . Predicting Fault in the Process of Producing Important Android Applications using Data Mining Techniques. International Journal of Computer Applications. 131, 13 ( December 2015), 43-47. DOI=10.5120/ijca2015907216

@article{ 10.5120/ijca2015907216,
author = { Asrin Hosseini, Amir Sheikh-Ahmadi },
title = { Predicting Fault in the Process of Producing Important Android Applications using Data Mining Techniques },
journal = { International Journal of Computer Applications },
issue_date = { December 2015 },
volume = { 131 },
number = { 13 },
month = { December },
year = { 2015 },
issn = { 0975-8887 },
pages = { 43-47 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume131/number13/23698-2015907216/ },
doi = { 10.5120/ijca2015907216 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:27:18.146136+05:30
%A Asrin Hosseini
%A Amir Sheikh-Ahmadi
%T Predicting Fault in the Process of Producing Important Android Applications using Data Mining Techniques
%J International Journal of Computer Applications
%@ 0975-8887
%V 131
%N 13
%P 43-47
%D 2015
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The aim of this research is to predict fault in the process of producing important Android applications using data mining techniques. Predictable models must not only be correct in predicting fault, but also must be understandable, which needs the user to understand the motivation of the model prediction. Unfortunately, understandability of the fault types is ignored in order to achieve the predictable efficiency of the fault prediction models. In order to solve this problem, some trees are extracted from the random forests and support vector machines for the logistic regression and the rule extraction algorithm are used; also, NASA MDP data are used for extracting the model. The method of evaluating the prediction of software fault is the use of ALPA algorithm and extraction of random trees (RF) and (Logistic) regression for weka software. In the method of creating trees, (REPTree and C4.5) and black box model (Logistic, RF) are used. The results show that the trees extracted from the black box models discuss the prediction way of black box which is more understandable as well as more correct than the trees from direct work on the data.

References
  1. Ostrand, T. and E. Weyuker, on the automation of software fault prediction. 2006.
  2. Khoshgoftaar, T. and N. Seliya, Comparative Assessment of Software Quality Classification Techniques: An Empirical Case Study. 2004. 9(3).
  3. Martens, D., et al., Performance of classification models from a user perspective. Decis. 2011. 51(4).
  4. Verbeke, W., et al., New insights into churn prediction in the telecommunication sector: a profit driven data mining approach. 2012. 218(1).
  5. Baesens, B., et al., Using neural network rule extraction and decision tables for credit-risk evaluation. 2003. 49(3).
  6. Moeyersoms, J., et al., Comprehensible software fault and effort prediction: A data mining approach. 2015. 100.
  7. Weben S, H.W., Kimmich J, Systematic testing of data abstractions based on software specifications. J. Software Testing, Verification and Reliability, 1992. 1(4): p. 39-55.
  8. B. S. Gulavani, T.A.H., Y. Kannan, A. V. Nori, and S. K. Rajamani, Synergy: A new algorithm for property checking. In Proceedings of the 14th Annual Symposium on Foundations of Software Engineering (FSE), 2006.
  9. Korel, B., A Dynamic Approach of Test Data Generation. In IEEE Conference on Software Maintenance, 1990: p. 311-317.
  10. Hayes, J.O.a.J., A Semantic Model of Program Faults. In Proceedings of ISSTA’96 (International Symposium on Software Testing and Analysis), 1996: p. 195-200.
  11. Vandecruys, O., et al., Mining software repositories for comprehensible software fault prediction models. 2008. 81.
  12. Binkley, D., et al., Increasing diversity: Natural language measures for software fault prediction. 2009. 82(11).
  13. Erturk, E. and E. Akcapinar Sezar, A comparison of some soft computing methods for software fault prediction. 2015. 42(4).
  14. Junqué de Fortuny, E., Martens, D, Active Learning-Based Pedagogical Rule Extraction. University of Antwerp, 2014.
  15. Shepperd, M., Kadoda, G, Comparing software prediction techniques using simulation. IEEE Trans. Softw. Eng, 2001. 27(11): p. 1014-1022.
  16. Tan, P., Steinbach, M., Kumar, V, Introduction to Data Mining. Pearson Education, Boston, USA, 2006.
  17. Breiman, L., Random forests. Mach. Learn, 2001. 45(1): p. 2-32.
  18. Quinlan, J., C4. 5: Programs for Machine Learning. Morgan Kaufmann Publish-ers Inc., San Francisco, CA, USA, 1993.
  19. Rokach, L., Maimon, O, Data Mining with Decision Trees. World Scientific Publishing Co, 2008.
  20. Huysmans, J., Dejaeger, K., Mues, C., Vanthienen, J., Baesens, B, An empiri-cal evaluation of the comprehensibility of decision table, tree and rule based predictive models. Decis. Support Syst, 2011. 51(1): p. 141–154.
  21. Witten, I., Frank, E., Hall, M, Data Mining: Practical Machine Learning Tools and Techniques. Morgan Kaufmann, 2011.
Index Terms

Computer Science
Information Sciences

Keywords

Fault prediction software effort Android data mining