We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 December 2024
Reseach Article

Timestamp based Recrawling Technique (TSBCT)

by Babita Ahuja, Neelu Chaudhary
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 45 - Number 22
Year of Publication: 2012
Authors: Babita Ahuja, Neelu Chaudhary
10.5120/7081-9533

Babita Ahuja, Neelu Chaudhary . Timestamp based Recrawling Technique (TSBCT). International Journal of Computer Applications. 45, 22 ( May 2012), 23-26. DOI=10.5120/7081-9533

@article{ 10.5120/7081-9533,
author = { Babita Ahuja, Neelu Chaudhary },
title = { Timestamp based Recrawling Technique (TSBCT) },
journal = { International Journal of Computer Applications },
issue_date = { May 2012 },
volume = { 45 },
number = { 22 },
month = { May },
year = { 2012 },
issn = { 0975-8887 },
pages = { 23-26 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume45/number22/7081-9533/ },
doi = { 10.5120/7081-9533 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:38:15.601828+05:30
%A Babita Ahuja
%A Neelu Chaudhary
%T Timestamp based Recrawling Technique (TSBCT)
%J International Journal of Computer Applications
%@ 0975-8887
%V 45
%N 22
%P 23-26
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

In this era of digital tsunami of information on the web, everyone is completely dependent on the WWW for information retrieval. Most of the information is hidden behind the query interface. In the query interface the user types the keyword to access the web pages. These pages are known as the Hidden web, Invisible Web or Dark Web. Such kind of web pages cannot be indexed by the Search Engines. As these are not indexed by the search engines these pages cannot be returned and displayed to the users. This paper discusses the various reasons due of which they are not indexed by the search engines and the possible solutions for these reasons.

References
  1. Sriram Raghavan Hector Garcia-Molina Computer Science Department Stanford University Stanford, CA 94305, USA, "Crawling the HiddenWeb"
  2. Rosy Madaan / (IJCSE) International Journal on Computer Science and Engineering Vol. 02, No. 03, 2010, 753-758, "A Framework for Incremental Hidden Web Crawler"
  3. Ping Wu Ji-Rong Wen, Huan Liu, Wei-Ying Ma "Query Selection Techniques for Efficient Crawling of Structured Web Sources"
  4. Jian Qiu, Feng Shao, Misha Zatsman, Jayavel Index Structures for Querying the Deep Web, Workshop on the Web and Databases (WebDB), 2003, 79-86
  5. Ntoulas, A. , Zerfos, P. , Cho, J. Downloading Textual Hidden Web Content Through Keyword Queries. In Proceedings of the 5th ACM/IEEE Joint Conference on Digital Libraries (JCDL05). 2005.
  6. Chang, K; He, B; Zhang, Z. (2005). Toward Large Scale Integration: Building a MetaQuerier over Databases on the Web. CIDR, pp44-55
Index Terms

Computer Science
Information Sciences

Keywords

Hidden Web Search Engine Surface Webquery Interface