CFP last date
20 May 2024
Reseach Article

Machine Learning based Vocabulary Management Tool Assessment for the Linked Open Data

by Ahsan Morshed, Ritaban Dutta
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 60 - Number 9
Year of Publication: 2012
Authors: Ahsan Morshed, Ritaban Dutta
10.5120/9724-4197

Ahsan Morshed, Ritaban Dutta . Machine Learning based Vocabulary Management Tool Assessment for the Linked Open Data. International Journal of Computer Applications. 60, 9 ( December 2012), 51-58. DOI=10.5120/9724-4197

@article{ 10.5120/9724-4197,
author = { Ahsan Morshed, Ritaban Dutta },
title = { Machine Learning based Vocabulary Management Tool Assessment for the Linked Open Data },
journal = { International Journal of Computer Applications },
issue_date = { December 2012 },
volume = { 60 },
number = { 9 },
month = { December },
year = { 2012 },
issn = { 0975-8887 },
pages = { 51-58 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume60/number9/9724-4197/ },
doi = { 10.5120/9724-4197 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:06:09.134317+05:30
%A Ahsan Morshed
%A Ritaban Dutta
%T Machine Learning based Vocabulary Management Tool Assessment for the Linked Open Data
%J International Journal of Computer Applications
%@ 0975-8887
%V 60
%N 9
%P 51-58
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Reusing domain vocabularies in the context of developing the knowledge based Linked Open data system is the most important discipline on the web. Many editors are available for developing and managing the vocabularies or Ontologies. However, selecting the most relevant editor is very difficult since each vocabulary construction initiative requires its own budget, time, resources. In this paper a novel unsupervised machine learning based comparative assessment mechanism has been proposed for selecting the most relevant editor. Defined evaluation criterions were functionality, reusability, data storage, complexity, association, maintainability, resilience, reliability, robustness, learnability, availability, flexibility, and visibility. Principal component analysis (PCA) was applied on the feedback data set collected from a survey involving sixty users. Focus was to identify the least correlated features carrying the most independent information variance to optimize the tool selection process. An automatic evaluation method based on Bagging Decision Trees has been used to identify the most suitable editor. Three tools namely Vocbench, TopBraid EVN and Pool Party Thesaurus Manager have been evaluated. Decision tree based analysis recommended the Vocbench and the Pool Party Thesaurus Manager are the better performer than the TopBraid EVN tool with very similar recommendation scores.

References
  1. T. R. Gruber, A translation approach to portable ontologies. Knowledge Acquisition, 5(1993), 199-220.
  2. Y. Sure, Onto-To-Knowledge-Ontology based Knowledge Management Tools and their Application, German Journal Kuentliche Intelligenz, Special Issue on Knowledge Management (01/02), 2002.
  3. Stanford medical informatics Home page, at URL: http://www. smi. stanford. edu/
  4. Ontoprise®GmbH (1999), Onto Edit tutorial Homepage [Online], at URL: http://www. ontoprise. de/documents/tutorial_ontoedit. pdf
  5. Computus (1985) Home page [Online], at URL: www. computus. com
  6. L. Stojanovic and B. Motik, Ontology evolution within ontology editor, 13th International Conference on Knowledge Engineering and Knowledge Management EKAW02, Sigüenza (Spain)
  7. M. Denny, Ontology Building: A survey of Editing Tools, Nov 6, 2002.
  8. Jürgen, York Sure, White paper: Evaluation of ontology-based tool, 13th International Conference on Knowledge Engineering and Knowledge Management EKAW02, Sigüenza (Spain).
  9. P. Lambrix and A. Edberg, Evaluation of ontology merging tools in bioinformatics, Proceedings of the Pacific Symposium on Biocomputing PSB03, 8:589-600, Kauai, Hawaii, USA, 2003.
  10. David Lane's Homepage. http://davidmlane. com/hyperstat/A15885. html
  11. World Wide Web. http://www. w3. org/
  12. Oracle. http://www. oracle. com/index. html
  13. M. A. Musen (2000), the Evolution of Protégé: An Environment for Knowledge-Based Systems Development, Retrieved October 10, 2004 at URL: http://www. smi. stanford. edu/pubs/SMI_Reports/SMI-2002-0943. pdf
  14. N. Noy, R. W. Fergerson, M. A. Musen (2000), The knowledge model of Protégé-2000: combining interoperability and flexibility, Retrieved October 11, 2004 atURL:http://wwwsmi. stanford. edu/pubs/SMI_Reports/SMI-2000-0830. pdf website
  15. Search Engine Yahoo: www. yahoo. com
  16. Amazon. www. amazon. com
  17. A. Morshed and R. Singh, Master thesis (No: 05-x-223) Evaluation and Ranking of Ontology Construction Tools, Royal Institute of Technology, 13th Jan, 2005
  18. Asunción Gómez-Pérez (1999), Evaluation of Taxonomic Knowledge in Ontologies and Knowledge Bases, October 16 Twelfth Workshop on Knowledge Acquisition, Modeling and Management, Voyager Inn, Banff, Alberta, Canada.
  19. Jackson, J. E. , A User's Guide to Principal Components, John Wiley and Sons, 1991, p. 592.
  20. Jolliffe, I. T. , Principal Component Analysis, 2nd edition, Springer, 2002.
  21. Krzanowski, W. J. Principles of Multivariate Analysis: A User's Perspective. New York: Oxford University Press, 1988.
  22. Seber, G. A. F. , Multivariate Observations, Wiley, 1984
  23. Altman, E. , "Financial Ratios, Discriminant Analysis and the Prediction of Corporate Bankruptcy," Journal of Finance, Vol. 23, No. 4, (Sep. , 1968), pp. 589-609.
  24. Basel Committee on Banking Supervision, "Studies on the Validation of Internal Rating Systems," Bank for International Settlements (BIS), Working Papers No. 14, revised version, May 2005. Available at: http://www. bis. org/publ/bcbs_wp14. htm.
  25. Basel Committee on Banking Supervision, "International Convergence of Capital Measurement and Capital Standards: A Revised Framework," Bank for International Settlements (BIS), comprehensive version, June 2006. Available at: http://www. bis. org/publ/bcbsca. htm.
  26. Loeffler, G. , and P. N. Posch, Credit Risk Modeling Using Excel and VBA, West Sussex, England: Wiley Finance, 2007.
  27. Merton, R. , "On the Pricing of Corporate Debt: The Risk Structure of Interest Rates," Journal of Finance, Vol. 29, No. 2, (May, 1974), pp. 449-70.
Index Terms

Computer Science
Information Sciences

Keywords

Principal component analysis Bagging Decision Trees Feature Clustering Decision Making