International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 94 - Number 12 |
Year of Publication: 2014 |
Authors: Gaurav Kumar Srivastav, Irphan Ali, Atul Kumar Srivastava |
10.5120/16392-6009 |
Gaurav Kumar Srivastav, Irphan Ali, Atul Kumar Srivastava . A Novel Technique for Spare Web Page Detection in Parallel Web Crawler. International Journal of Computer Applications. 94, 12 ( May 2014), 1-5. DOI=10.5120/16392-6009
The World Wide Web is increasing in the random rate of web pages and all web pages are rapidly updated about the need of user. Web search engine downloads web pages and the user cannot take the relevant update information for World Wide Web within short period of time. In this paper, we represent novel technique which helps in downloading the updated relevant web pages from World Wide Web. We will be implementing a new algorithm which can find out the update web page on World Wide Web. This algorithm compares the Content Weight of old web page content and downloaded update web page content. In this paper, we have also avoid the downloading of spare web pages from World Wide Web . This is a novel techniques improved the downloading rate of web pages and it is decreased the network bandwidth of web crawler by the help of parallel web crawler. This web detection technique will be downloaded the update web pages from World Wide Web and minimize the web browsing period of time.