We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 November 2024
Reseach Article

Semantic Structure Representation of HTML Document Suitable for Semantic Document Retrieval

by Nidhi Tyagi, Rahul Rishi, R.p. Agarwal
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 46 - Number 13
Year of Publication: 2012
Authors: Nidhi Tyagi, Rahul Rishi, R.p. Agarwal
10.5120/6973-9589

Nidhi Tyagi, Rahul Rishi, R.p. Agarwal . Semantic Structure Representation of HTML Document Suitable for Semantic Document Retrieval. International Journal of Computer Applications. 46, 13 ( May 2012), 39-43. DOI=10.5120/6973-9589

@article{ 10.5120/6973-9589,
author = { Nidhi Tyagi, Rahul Rishi, R.p. Agarwal },
title = { Semantic Structure Representation of HTML Document Suitable for Semantic Document Retrieval },
journal = { International Journal of Computer Applications },
issue_date = { May 2012 },
volume = { 46 },
number = { 13 },
month = { May },
year = { 2012 },
issn = { 0975-8887 },
pages = { 39-43 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume46/number13/6973-9589/ },
doi = { 10.5120/6973-9589 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:39:40.335031+05:30
%A Nidhi Tyagi
%A Rahul Rishi
%A R.p. Agarwal
%T Semantic Structure Representation of HTML Document Suitable for Semantic Document Retrieval
%J International Journal of Computer Applications
%@ 0975-8887
%V 46
%N 13
%P 39-43
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The information on the WWW is available in various formats. The RDF and XML representation provides semantic knowledge about the document where as HTML mark-up only indicates the structure and lay-out of documents, but not the document semantics. The representation of the HTML document to semantic form can facilitate the extraction of knowledge from these documents in a more efficient manner. This paper proposes a technique for providing semantic structure to the HTML documents and stores it in the knowledge base as predicates, which helps in the retrieval of context related documents

References
  1. T. Berners-Lee, J. Hendler, and O. Lasilla, " The SemanticWeb", Scientific American, May 2001.
  2. W3C. Semantic Web activity statement. W3CTechnology & Society Domain Activity Statement,Available: http://www. w3. org/2001/sw/Activity. 2002.
  3. Berners-Lee, T. , Weaving the Web: The original design and ultimate destiny of the WorldWide Web New York, NY: HarperCollins, 2000.
  4. Terje Brasethvik and Jon Atle Gulla ,"A ConceptualModeling Approach to Semantic Document Retrieval", Advanced Information Systems Engineering, 14th International Conference, pp. 167-182, 2002.
  5. Comfort T. Akinribido, Babajide S. Afolabi , Bernard I Akhigbe and Ifiok J. Udo," A Fuzzy-Ontology Based Information Retrieval System for Relevant Feedback", International Journal of Computer Science Issues, Vol. 8, Issue 1, January 2011.
  6. Tim Finin, Li Ding, Rong Pan, Anupam Joshi, Pranam Kolari, Akshay Java and Yun Peng, "Swoogle: Searching for knowledge on the Semantic Web"University of Maryland Baltimore County, Baltimore.
  7. Marut Buranarach ," A framework for the organization and discovery of information resources in a www environment using association, classification and deduction", December 13,2004.
  8. Tool: Light HTML to XML converter.
  9. Sekine Proteus Project - Apple Pie Parser, http://nlp. cs. nyu. edu/app (Corpus based Parser) 2006.
  10. Parul Gupta and A. K. Sharma," Context based Indexing in Search Engines using Ontology", International Journal of Computer Applications,Volume 1 No. 14, pp 49-52, 2010.
  11. Nidhi Tyagi, Rahul Rishi and R. P. Agrawal," Context based Web Indexing for Storage of Relevant Web Pages", International Journal of Computer Applications (0975 – 8887) Volume 40– No. 3, February 2012.
  12. Pell, "POWERSET - Natural Language and the Semantic Web". The 6th International Semantic Web Conference and the 2nd Asian Semantic Web Conference, 2007.
Index Terms

Computer Science
Information Sciences

Keywords

Semantic Representation Contextual Data Html Xml Knowledge Base Predicate