CFP last date
20 May 2024
Call for Paper
June Edition
IJCA solicits high quality original research papers for the upcoming June edition of the journal. The last date of research paper submission is 20 May 2024

Submit your paper
Know more
Reseach Article

Constructing Semantic Web Form from Unstructured Web Page

by Amira AbdEl-atey, Sherif El-etriby, Arabi kishk
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 30 - Number 10
Year of Publication: 2011
Authors: Amira AbdEl-atey, Sherif El-etriby, Arabi kishk
10.5120/3675-5137

Amira AbdEl-atey, Sherif El-etriby, Arabi kishk . Constructing Semantic Web Form from Unstructured Web Page. International Journal of Computer Applications. 30, 10 ( September 2011), 34-41. DOI=10.5120/3675-5137

@article{ 10.5120/3675-5137,
author = { Amira AbdEl-atey, Sherif El-etriby, Arabi kishk },
title = { Constructing Semantic Web Form from Unstructured Web Page },
journal = { International Journal of Computer Applications },
issue_date = { September 2011 },
volume = { 30 },
number = { 10 },
month = { September },
year = { 2011 },
issn = { 0975-8887 },
pages = { 34-41 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume30/number10/3675-5137/ },
doi = { 10.5120/3675-5137 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:16:44.634084+05:30
%A Amira AbdEl-atey
%A Sherif El-etriby
%A Arabi kishk
%T Constructing Semantic Web Form from Unstructured Web Page
%J International Journal of Computer Applications
%@ 0975-8887
%V 30
%N 10
%P 34-41
%D 2011
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Semantic web is a kind of webs that is able to describe things to be understood by computers. Automatically answering any query without human interactions is one of the key challenges in computer science area. Semantics can help in answering such queries. Consequently, extracting information from unstructured documents and transforming them into semantic web form is an important trend. Semantic web mining is a combination of two trends; semantic web and web mining. Our extracting and structuring system clarify the meaning of the web mining. The obtained data converted to the semantic web format. And so, the semantic web mining trend was illustrated. This paper concentrates on extracting data from the web page tables. Data on the Web in the HTML tables are mostly structured. However; we usually do not know the structure in advance. Thus, data of interest cannot be directly queried. Data extraction and structuring system is proposed to put data extracted into the semantic web form. After putting extracted data in the semantic web format, it can be queried using semantic web query language. Experimental results show that the data of interest can be located and build its new structure using semantic web.

References
  1. Antoniou, G. and Harmelen, F. V. 2008. A semantic Web primer - The MIT Press, second edition.
  2. Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Ives, Z., Cyganiak, R. 2007. DBpedia: A Nucleus for a Web of Open Data. 6th International Semantic Web Conference, 2nd Asian Semantic Web Conference.
  3. Baumgartner, R. , Gatterbauer, W. and Gottlob, G. 2009. Web Data Extraction System,” in encyclopedia of database systems.
  4. Berners-Lee, T. 2005. Primer: Getting into RDF & Semantic Web using N3. In WWW. http://www.w3.org/2000/10/swap/Primer.html.
  5. Berners-Lee, T. 1999. Weaving the Web: The Original Design and Ultimate Destiny of the World Wide Web by Its Inventor, ISBN 1402842937.
  6. Bizer, C. 2003. D2R MAP: A database to RDF mapping language. In WWW.
  7. Bizer, C., Lehmann, J., Kobilarov, G., Auer, S., Becker, C., Cyganiak, R. and Hellmann, S. “DBpedia – A Crystallization Point for the Web of Data”, Journal of Web Semantics: Science, Services and Agents on the World Wide Web, Issue 7, Pages 154–165, 2009.
  8. Embley, D. W., Tao, C. and Liddle, S. W. 2002. Automatically Extracting Ontologically Specified Data from HTML Tables with Unknown Structure. In Proceedings of the 21st International Conference on Conceptual Modeling.
  9. Embley, D. W., Tao, C. and Liddle, S. W. ”Automating the extraction of data from HTML tables with unknown structure,” in Data & Knowledge Engineering, vol.54, issue.1, 2005, pp.3-28.
  10. Embley, D. W., Campbell, D. M., Smith, R. D. and Liddle, S. W. 1998. Ontology-based extraction and structuring of information from data-rich unstructured documents. in Proceeding CIKM '98 Proceedings of the seventh international conference on Information and knowledge management.
  11. Prud'hommeaux, E. and Seaborne, A. January 2008. SPARQL Query Language for RDF. In WWW. http://www.w3.org/TR/rdf-sparql-query.
  12. Shadbolt, N., Hall, W. and Berners-Lee, T. June-2006. The Semantic Web Revisited. IEEE Intelligent Systems.
  13. Stumme, G., Hotho, A. and Berendt, B. “Semantic Web Mining: State of the art and future directions”, Journal of Web Semantics: Science, Services and Agents on the World Wide Web, Volume 4, Issue 2, pages 124-143, 2006.
Index Terms

Computer Science
Information Sciences

Keywords

semantic web information extraction information structuring natural language processing natural language processing wrapper generation semantic web mining extracting and structuring data