CFP last date
20 May 2024
Reseach Article

Article:Evaluation of Expectation Maximization Based Clustering Approach for Reusability Prediction of Function based Software systems

by Dr Himani Goel, Er Gurbhej Singh
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 8 - Number 13
Year of Publication: 2010
Authors: Dr Himani Goel, Er Gurbhej Singh
10.5120/1308-1705

Dr Himani Goel, Er Gurbhej Singh . Article:Evaluation of Expectation Maximization Based Clustering Approach for Reusability Prediction of Function based Software systems. International Journal of Computer Applications. 8, 13 ( October 2010), 13-20. DOI=10.5120/1308-1705

@article{ 10.5120/1308-1705,
author = { Dr Himani Goel, Er Gurbhej Singh },
title = { Article:Evaluation of Expectation Maximization Based Clustering Approach for Reusability Prediction of Function based Software systems },
journal = { International Journal of Computer Applications },
issue_date = { October 2010 },
volume = { 8 },
number = { 13 },
month = { October },
year = { 2010 },
issn = { 0975-8887 },
pages = { 13-20 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume8/number13/1308-1705/ },
doi = { 10.5120/1308-1705 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T19:57:16.549442+05:30
%A Dr Himani Goel
%A Er Gurbhej Singh
%T Article:Evaluation of Expectation Maximization Based Clustering Approach for Reusability Prediction of Function based Software systems
%J International Journal of Computer Applications
%@ 0975-8887
%V 8
%N 13
%P 13-20
%D 2010
%I Foundation of Computer Science (FCS), NY, USA
Abstract

In this study Expectation Maximization based Clustering approach is evaluated for Reusability Prediction of Function based Software systems. Here, the metric based approach is used for prediction. The function oriented dataset considered have the output attribute as Reusability value. The Reusability in the dataset is expressed in terms of six numeric labels i.e. 1, 2, 3, 4, 5 and 6. The label 1 represents Nil and the label 6 represents the Excellent Reusability Label. A framework of metrics are used to target those the essential attributes of function oriented features towards measuring the reusability of software modules, so it tried to analyze, refine and use following metrics to explore different structural dimensions of Function oriented components: Cyclometric Complexity Using Mc Cabe’s Measure, Halstead Software Science Indicator, Regularity Metric, Reuse-Frequency Metric and Coupling Metric. The input attributes are expressed in the three linguistic labels i.e. 1, 2, and 3. The label 1 corresponds to the Low value, label 2 corresponds to the Medium value and label 3 corresponds to the high value.Five Input metrics are used as Input and clusters are formed using EM. EM assigns a probability distribution to each instance which indicates the probability of it belonging to each of the clusters.Thereafter 10 fold cross validation performance of the system is recorded. The results are expressed in Precision, Recall and Accuracy values. Precision for a class is the number of true positives (i.e. the number of items correctly labeled as belonging to the positive class) divided by the total number of elements labeled as belonging to the positive class (i.e. the sum of true positives and false positives, which are items incorrectly labeled as belonging to the class). Recall is defined as the number of true positives divided by the total number of elements that actually belong to the positive class (i.e. the sum of true positives and false negatives, which are items which were not labeled as belonging to the positive class but should have been). Hence, Precision can be seen as a measure of exactness or fidelity, whereas Recall is a measure of completeness. Accuracy is the percentage of the predicted values that match with the expected values of the reusability for the given data. As deduced from the results it is clear that Precision and Recall

References
  1. Anderson, J.A (2003) “An Introduction To Neural Networks”, Prentice Hall of India.
  2. Arnold, R.S. (1990) “Heuristics for Salvaging Reusable Parts From Adav Code”, SPC TechnicalReport, ADA_REUSE_HEURISTICS-90011-N, March 1990.
  3. Arnold, R.S. (1990) “Salvaging Reusable Parts From Ada Code: A Progress Report”, SPC Technical Report, SALVAGE_ADA_PARTS_PR-90048-N, September 1990.
  4. Basili, V. R. and Rombach, H. D. (1988) “The TAME Project: Towards Improvement Oriented Software Environments”, IEEE Trans. Software Eng., vol. 14, no. 6, June 1988, pp. 758-771.
  5. Basili, V.R. (1989) “Software Development: A Paradigm for the Future”, Proceedings COMPAC’89, Los Alamitos, California, IEEE CS Press, 1989, pp. 471-485.
  6. Boetticher, G. and Eichmann, D. (1993) “A Neural Network Paradigm for Characterizing Reusable Software”, Proceedings of the Australian Conference on Software Metrics, Australia, July, 1993, pp. 234-237.
  7. Boetticher, G., Srinivas, K. and Eichmann, D. (1990) “A Neural Net-Based Approach to the Software Metrics” Proceedings of the 5th International Conference on Software Engineering and Knowledge Engineering, San Francisco, CA, 14-18 June 1990, pp. 271-274.
  8. Caldiera, G. and Basili, V. R. (1991) “Identifying and Qualifying Reusable Software Components,” IEEE Computer, February 1991.
  9. Chen, Y. F. Nishimoto, M. Y. and Ramamoorty, C. V. “The C Information Abstraction System”, IEEE Trans. on Software Engineering, Vol. 16, No. 3, March 1990.
  10. Dunn, M. F. and Knight, J. C. (1993) “Software reuse in Industrial setting: A Case Study”, Proc. of the 13th International Conference on Software Engineering, Baltimore, MA, 1993. pp. 56-62.
  11. Esteva, J. C. and Reynolds, R. G. (1991) “Identifying Reusable Components using Induction”, International Journal of Software Engineering and Knowledge Engineering, Vol. 1, No. 3 , 1991, pp. 271-292.
  12. Frakes, W.B. and Kyo Kang (2005) “Software Reuse Research: Status and Future”, IEEE Trans. Software Engineering, vol. 31, issue 7, July 2005, pp. 529 - 536.
  13. Jang, J-S. R. and Sun, C.T. (1995) “Neuro-fuzzy Modeling and Control”, Proceeding of IEEE, March 1995, pp. 123-135.
  14. Jerome Feldman (1996) “Neural Networks - A Systematic Introduction” Berlin, New-York, 1996.
  15. Kartalopoulos, S. V. (1996) “Understanding Neural Networks and Fuzzy Logic-Basic Concepts and Applications”, IEEE Press, 1996, pp. 153-160.
  16. Klir, G. J. and Yuan, B. (1995) “Fuzzy Sets and Fuzzy Logic” Prentice-Hall, New Jersey.
  17. Mayobre, G. (1991) “Using Code Reusability Analysis to Identify Reusable Components from Software Related to an Application Domain,” Proceeding of the Fourth Workshop on Software Reuse, Reston. VA, November, 1991, pp. 87-96.
  18. Nguyen, H.T. and Walker, E.A. (1997) “A first course in Fuzzy Logic” Boca Raton FLA. CRC Press, 1997.
  19. Melanie Mitchell (1996) “An Introduction to Genetic Algorithm”, MIT Press, 1996.
  20. Parvinder Singh and Hardeep Singh (2005) “Critical Suggestive Evaluation of CK METRIC”, Proc. of 9th Pacific Asia Conference on Information Technology (PACIS-2005), Bangkok, Thailand, July 7 – 10, 2005, pp 234-241.
  21. Poulin, J. S. (1997) “Measuring Software Reuse–Principles”, Practices and Economic Models, Addison-Wesley, 1997.
  22. Richard, W. S. (2005) “Enabling Reuse-Based Software Development of Large-Scale Systems”, IEEE Trans. on Software Engineering, Vol. 31, No. 6, June 2005 pp. 495-510.
  23. Selby, R. W. (1988) “Empirically Analyzing Software Reuse in a Production Environment”, Software Reuse: Emerging Technology, W. Tracz, ed, IEEE Computer Society Press, 1988.
  24. Stender (1994) “Introduction to genetic algorithms”, IEEE Colloquium on Genetic Algorithms, Volume 2, March 15, 1994 pp. 1-4.
Index Terms

Computer Science
Information Sciences

Keywords

Fault Prediction Cyclometric Complexity volume Regularity Metric Reuse-Frequency Metric Coupling Metric Precision Recall Accuracy confusion matrix expectation maximisation