CFP last date
20 May 2024
Reseach Article

An Ontology based Hybrid Approach to Derive Multidimensional Schema for Data Warehouse

by M. Thenmozhi, K. Vivekanandan
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 54 - Number 8
Year of Publication: 2012
Authors: M. Thenmozhi, K. Vivekanandan
10.5120/8590-2343

M. Thenmozhi, K. Vivekanandan . An Ontology based Hybrid Approach to Derive Multidimensional Schema for Data Warehouse. International Journal of Computer Applications. 54, 8 ( September 2012), 36-42. DOI=10.5120/8590-2343

@article{ 10.5120/8590-2343,
author = { M. Thenmozhi, K. Vivekanandan },
title = { An Ontology based Hybrid Approach to Derive Multidimensional Schema for Data Warehouse },
journal = { International Journal of Computer Applications },
issue_date = { September 2012 },
volume = { 54 },
number = { 8 },
month = { September },
year = { 2012 },
issn = { 0975-8887 },
pages = { 36-42 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume54/number8/8590-2343/ },
doi = { 10.5120/8590-2343 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:55:12.587986+05:30
%A M. Thenmozhi
%A K. Vivekanandan
%T An Ontology based Hybrid Approach to Derive Multidimensional Schema for Data Warehouse
%J International Journal of Computer Applications
%@ 0975-8887
%V 54
%N 8
%P 36-42
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Due to the diversity of data source data integration has become a challenging task. Data warehouse system plays a vital role to integrate the data for making important business decisions. Data within the data warehouse is arranged as multidimensional schema. In past many works exist to carry out the design of the multidimensional schema for data warehouse from either requirements and/or data sources. These approaches are either manual or automated which work with only relational sources. But as today the data warehouse system needs to deal with semi-structured and unstructured sources, the design task becomes much tedious. Recently, ontology has been very useful for different data integration projects. The use of ontology could solve the syntactic and semantic conflicts that arise from heterogeneous sources. It also provides a way for automating the design of multidimensional schema and populating the data warehouse in a more meaningful way. This paper proposes a framework using ontology for the design of multidimensional schema. Our framework uses a hybrid approach where the reconciliation of requirements and data source are done at the early stage of design. We adopt ontology reasoning in order to automatically derive multidimensional elements such as facts and dimensions.

References
  1. Golfarelli M. , Maio D. , Rizzi S. , "The dimensional fact model: a conceptual model for data warehouses", Int. J. Coop. Inf. Syst. 7 (2–3), 1998, 215–247.
  2. M. R. Jensen, T. Holmgren, T. B. Pedersen, Discovering multidimensional structure in relational data, 6th Int. Conf. on Data Warehousing and Knowledge Discovery, LNCS, vol. 3181, Springer, 2004, pp. 138–148.
  3. D. Moody, M. Kortink, From enterprise models to dimensional models: a methodology for data warehouse and data mart design, Proc. of 2nd Int. Workshop on Design and Management of Data Warehouses, CEUR-WS. org, 2000.
  4. Hüsemann B. , Lechtenbörger J. , Vossen G. , "Conceptual data warehouse modeling, Proc. of 2nd Int. Workshop on Design and Management of Data Warehouses",CEUR-WS. org, 2000, p. 6.
  5. R. Winter, B. Strauch, A method for demand-driven information requirements analysis in DW projects, Proc. of 36th Annual Hawaii Int. Conf. on System Sciences, IEEE, 2003, pp. 231–239.
  6. P. Giorgini, S. Rizzi, M. Garzetti, Goal-oriented requirement analysis for data warehouse design, Proc. of 8th Int. Workshop on Data Warehousing and OLAP, ACM Press, 2005, pp. 47–56.
  7. N. Prat, J. Akoka, I. Comyn-Wattiau, A UML-based data warehouse design method, Decision Support Systems 42 (3) (2006) 1449–1473.
  8. Song, I. , Khare, R. , & Dai, B. (2007). SAMSTAR: A Semi-Automated Lexical Method for Generating STAR Schemas from an ER Diagram In I. Song, T. B. Pedersen (Eds. ), Proceedings of ACM 10th International Workshop on Data Warehousing and OLAP; pp 9-16, Lisbon, Portugal: ACM Press.
  9. Phipps C. , & Davis K. C. , "Automating Data Warehouse Conceptual Schema Design and Evaluation". In L. V. S. Lakshmanan (Ed. ), Proceedings of 4th International Workshop on Design and Management of Data Warehouses, 2002, pp 23-32, Toronto, Canada: CEUR-WS. org.
  10. J. -N. Mazon, J. Trujillo, J. Lechtenborger, Reconciling requirement-driven data warehouses with data sources via multidimensional normal forms, Data & Knowledge Engineering 23 (3) (2007) 725–751.
  11. Romero, A. Abelló, Automatic Validation of Requirements to Support Multidimensional Design, Data & Knowledge Engineering 69 (2010) 917–942.
  12. M. Gagnon, Ontology-based Integration of Data Sources, 10th International Conference on Information Fusion, Quebec, Canada, 2007.
  13. Jesús Pardillo, Jose-Norberto Mazón, "Using ontologies for the Design of data warehouses", International Journal of Database Management Systems ( IJDMS ), May 2011.
  14. Yannis Kalfoglou, and Marco Schorlemmer, Ontology mapping: the state of the art, The Knowledge Engineering Review Journal, Vol 18, No. 1, pp. 1-31, 2003.
  15. V. Nebot, R. Berlanga, J. M. Perez, M. J. Aramburu, and T. B. Pedersen. Multidimensional Integrated Ontologies: A Framework for Designing Semantic Data Warehouses. JoDS XIII, 5530:1{35, 2009.
  16. Selma Khouri, Bellatreche Ladje, "A Methodology and Tool for Conceptual Designing a Data Warehouse from Ontology-based Sources", Ecole nationale Supérieure d'Informatique Algiers, Algeria,2010.
  17. O. Romero, Alberto Abelló: A framework for multidimensional design of data warehouses from ontologies. Data Knowl. Eng. 69(11): 1138-1157 (2010)
  18. Lihong Jiang1, Junliang Xu1, Boyi Xu2, Hongming Cai1. An Automatic Method of Data Warehouses Multidimension Modeling for Distributed Information Systems. In Proceedings of 15th International Conference on Computer Supported Cooperative Work in Design (2011) IEEE, 9-16.
  19. Oscar Romero, Alkis Simitsis, and Alberto Abelló, GEM: requirement-driven generation of ETL and multidimensional conceptual designs. In Proceedings of the 13th international conference on Data warehousing and knowledge discovery (DaWaK'11), Alfredo Cuzzocrea and Umeshwar Dayal (Eds. ). Springer-Verlag, Berlin, Heidelberg, (2011) 80-95.
  20. M. Thenmozhi, K. Vivekanandan, A Framework to Derive Multidimensional Schema for Data Warehouse Using Ontology, Proceedings of National Conference on Internet and WebSevice Computing, NCIWSC (2012).
  21. http://sourceforge. net/projects/rdbtoonto/
  22. Toni Rodrigues, Pedro Rosa, Jorge Cardoso. Moving from syntactic to semantic organizations using JXML2OWL. Computers in Industry, 59(8):808-819, 2008.
  23. Paul Buitelaar, Daniel Olejnik, Michael Sintek (2003) OntoLT: A Protege Plug-In for Ontology Extraction from Text Demo Session at the International Semantic Web Conference, Sanibel Island, Florida, USA
  24. Natalya Fridman Noy, Mark A. Musen: PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment. AAAI/IAAI 2000: 450-455
  25. http://www. w3. org/2004/02/skos/
  26. Giunchiglia, Fausto and Shvaiko, Pavel and Yatskevich, Mikalai. Semantic Matching: Algorithms and Implementation. Technical Report DIT-07-001, Department of Information Engineering and Computer Science, University of Trento. In Journal of Data Semantics (JoDS), IX, 2007.
  27. Jérôme Euzenat, Pavel Shvaiko: Ontology matching. Springer-Verlag, Berlin Heidelberg (DE), 2007: 1-333
  28. http://jena. apache. org/
  29. http://org. mindswap. pellet
  30. L. Fr´?as, A. Queralt, and A. Oliv´e. EU-Rent Car Rentals Specification. Technical report, "Dept. de Llenguatges i Sistemes Inform`atics", 2003. www. lsi. upc. edu/dept/techreps/llistat_detallat. php?id=690.
Index Terms

Computer Science
Information Sciences

Keywords

Data Modelling Multidimensional Schema Data warehouse Ontology