International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 116 - Number 19 |
Year of Publication: 2015 |
Authors: Rishabh Dixit, Shiva Gupta, Rajkumar Singh Rathore, Shivesh Gupta |
10.5120/20445-2796 |
Rishabh Dixit, Shiva Gupta, Rajkumar Singh Rathore, Shivesh Gupta . A Novel Approach to Priority based Focused Crawler. International Journal of Computer Applications. 116, 19 ( April 2015), 22-25. DOI=10.5120/20445-2796
The web continues to grow at an exponential rate so fetching relevant information about a specific topic is gaining importance. Web crawlers are programs that traverse the web and fetch the web documents in an automated manner. Focused crawlers search for a specific keyword in a web page. Link based focused crawlers focus on the anchor links of the page and seeks out the most relevant links without actually downloading the web page itself. This paper is based on assigning priorities to different links so that the most relevant links are displayed to the user first. The insignificant links are avoided which leads to significant savings in the computational costs involved in query processing, network, as well as the hardware resources.