CFP last date
20 December 2024
Reseach Article

An Automatic Annotation Technique for Web Search Results

by Rosamma K S, Jiby J Puthiyidam
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 119 - Number 24
Year of Publication: 2015
Authors: Rosamma K S, Jiby J Puthiyidam
10.5120/21383-4375

Rosamma K S, Jiby J Puthiyidam . An Automatic Annotation Technique for Web Search Results. International Journal of Computer Applications. 119, 24 ( June 2015), 11-16. DOI=10.5120/21383-4375

@article{ 10.5120/21383-4375,
author = { Rosamma K S, Jiby J Puthiyidam },
title = { An Automatic Annotation Technique for Web Search Results },
journal = { International Journal of Computer Applications },
issue_date = { June 2015 },
volume = { 119 },
number = { 24 },
month = { June },
year = { 2015 },
issn = { 0975-8887 },
pages = { 11-16 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume119/number24/21383-4375/ },
doi = { 10.5120/21383-4375 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:04:58.324403+05:30
%A Rosamma K S
%A Jiby J Puthiyidam
%T An Automatic Annotation Technique for Web Search Results
%J International Journal of Computer Applications
%@ 0975-8887
%V 119
%N 24
%P 11-16
%D 2015
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The uses of web search engines are very frequent and common worldwide over the internet by end users for different purposes. A web search engine takes the query request from the end user and executes that query on relational database used to store the information on behalf of that web search engine. Based on input queries the dynamic response is generated by search engine, in the form of HTML based pages. Such pages are supported with the web databases. Every web page generated contains many results to display for particular query, called as Search Result Records (SRRs). Sometimes it becomes troublesome to extract relevant data from diverse sources. The SRRs generated may contain data units that are relevant to one common semantic. These SRRs are further required to be assigned with proper labels. The manual methods for record extraction and labeling have a worse scalability. Thus automatic annotation based method is needed to improve the accuracy as well as scalability of web search engines. This paper presents an automatic annotation technique for web search results. The proposed approach first aligns the data units on a result page into different groups such that the data in the same group have the same semantic. Then, each group is annotated from different aspects and aggregates the different annotations to predict a final annotation label for it. The annotation wrapper generated for the search site is automatically constructed and can be used to annotate new result pages from the same web database. Experiments indicate that the proposed approach is highly effective.

References
  1. Wei Liu, Xiaofeng Meng, Member, IEEE, and Weiyi Meng, Member, IEEE, ViDE: A Vision-Based Approach for Deep Web Data Extraction
  2. D. Raghu, V. Sridhar Reddy, Ch. Raja Jacob, Dynamic Vision-Based Approach in Web Data Extraction
  3. R. Vijay1, Dr. K. Prasadh, A Vision Based Approach for Web Data Extraction using Enhanced Co-citation Algorithm
  4. Arvind Arasu, Hector Garcia-Molina, Extracting Structured Data from Web Pages, Stanford University
  5. Yanhong Zhai, Bing Liu, Department of Computer Science University of Illinois at Chicago, Web Data Extraction Based on Partial Tree Alignment
  6. P. V. Praveen Sundar1 1Research Scholar, Hindusthan College of Arts &Science, Coimbatore, India, Towards Automatic Data Extraction Using Tag and Value Similarity Based on Structural -Semantic Entropy
  7. T. Seeniselvi1, N. Thangamani. 2 1Associate Professor, Hindusthan college of Arts and Science, India, Semantic Similarity Based Data Alignment and Best Feature Extraction using PSO for Annotating Search Results from Web Databases
  8. L. Kaufman and P. Rousseeuw, Finding Groups in Data: An Introduction to Cluster Analysis. John Wiley & Sons, 1990.
  9. Hongjun Lu, Integrating Database and World Wide Web Technologies, National University of Singapore
  10. Jungwha Hong, How to Build a Web Database: A Case Study, The University of Texas at Austin.
  11. Yiyao Lu, Hai He, Hongkun Zhao, Weiyi Meng, Member, IEEE, and Clement Yu, Senior Member, IEEE, Annotating Search Results from Web Databases
Index Terms

Computer Science
Information Sciences

Keywords

Web Database Annotation Data alignment Annotation Wrapper