International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 150 - Number 1 |
Year of Publication: 2016 |
Authors: Rashmi K. B., Vijaya Kumar T., H. S. Guruprasad |
10.5120/ijca2016911448 |
Rashmi K. B., Vijaya Kumar T., H. S. Guruprasad . Deep Web Crawler: Exploring and Re-ranking of Web Forms. International Journal of Computer Applications. 150, 1 ( Sep 2016), 32-35. DOI=10.5120/ijca2016911448
A huge portion of the web known as deep web is accessible via search interfaces to myriads of databases on the web. Deep web crawl is concerned with the problem of surfacing hidden content behind search interfaces on the web. Given the dynamic nature of the web, where data sources are constantly changing, it is crucial to discover these resources. The paper proposes a two level application namely deep web crawler for gathering relevant searchable forms. In the first level deep web crawler explores the forms based on reverse searching for a given seed site, ranking the sites to prioritize highly relevant sites and by extracting the links to find the forms. In the next level, it searches the forms based on preference and the result is enhanced by re ranking, given the user feedback.