International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 85 - Number 6 |
Year of Publication: 2014 |
Authors: Abhishek Singh Rathore, Devshri Roy |
10.5120/14849-3211 |
Abhishek Singh Rathore, Devshri Roy . Ontology based Web Page Topic Identification. International Journal of Computer Applications. 85, 6 ( January 2014), 35-40. DOI=10.5120/14849-3211
With the emergence of the web, lots of research efforts are made in the area of Web Mining. This paper proposes an automatic approach for automatic topic identification from the web pages. The contribution of this research is in the approach of automatic topic identification of web pages that can provide better results. The topic of the web documents is identified through ontological approach. Keywords are extracted from the basic HTML tags and co-occurrence of words in the text instead of calculating the frequency of each term exits in a web page. Domain ontology is developed to map topics of the documents. Keywords are mapped to the ontology with a Levenshtein Edit Distance to extract topic of the web page. The result could give benefit to the search engines for faster tagging of web pages.