International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 25 - Number 3 |
Year of Publication: 2011 |
Authors: Anuradha, A.K.Sharma |
10.5120/3010-4060 |
Anuradha, A.K.Sharma . Structure based Data Extraction from Hidden Web Sources: A Review. International Journal of Computer Applications. 25, 3 ( July 2011), 32-37. DOI=10.5120/3010-4060
In order to extract data from the web pages of Hidden web sources, many semi-automatic and automatic techniques are proposed based on structure and tags of HTML documents. These techniques include machine learning and schema- matching approaches to solve the problem of data extraction. This paper discusses the research that has been done in the area of data extraction from Hidden Web sources. The goal of this paper is to discuss the advantages and disadvantages of currently existing techniques.