Context based Web Indexing for Storage of Relevant Web Pages

Nidhi Tyagi; Rahul Rishi; R. P. Agarwal

Call for Paper

May Edition

IJCA solicits high quality original research papers for the upcoming May edition of the journal. The last date of research paper submission is 20 April 2026

Submit your paper

Know more

The week's pick

Evaluating Text-to-Text Generation from LLMs: A Case Study and Scalable Framework

Ziqiao Ao Juhi Singh Sebastian Antinome

Random Articles

Reseach Article

Context based Web Indexing for Storage of Relevant Web Pages

by Nidhi Tyagi, Rahul Rishi, R. P. Agarwal

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 40 - Number 3

Year of Publication: 2012

Authors: Nidhi Tyagi, Rahul Rishi, R. P. Agarwal

10.5120/5021-7166

Nidhi Tyagi, Rahul Rishi, R. P. Agarwal . Context based Web Indexing for Storage of Relevant Web Pages. International Journal of Computer Applications. 40, 3 ( February 2012), 1-5. DOI=10.5120/5021-7166

@article{ 10.5120/5021-7166,

author = { Nidhi Tyagi, Rahul Rishi, R. P. Agarwal },

title = { Context based Web Indexing for Storage of Relevant Web Pages },

journal = { International Journal of Computer Applications },

issue_date = { February 2012 },

volume = { 40 },

number = { 3 },

month = { February },

year = { 2012 },

issn = { 0975-8887 },

pages = { 1-5 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume40/number3/5021-7166/ },

doi = { 10.5120/5021-7166 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T20:27:04.762842+05:30

%A Nidhi Tyagi

%A Rahul Rishi

%A R. P. Agarwal

%T Context based Web Indexing for Storage of Relevant Web Pages

%J International Journal of Computer Applications

%@ 0975-8887

%V 40

%N 3

%P 1-5

%D 2012

%I Foundation of Computer Science (FCS), NY, USA

Abstract

A focused crawler downloads web pages that are relevant to a user specified topic. The downloaded documents are indexed with a view to optimize speed and performance in finding relevant documents for a search query at the search engine side. However, the information will be more relevant if the context of the topic is also made available to the retrieval system. This paper proposes a technique for indexing the keyword extracted from the web documents along with their contexts wherein it uses a height balanced binary search (AVL) tree, for indexing purpose to enhance the performance of the retrieval system.

References

Diligenti M., Coetzee F.M., Lawrence S., Giles C.L.and Gori M., “Focused Crawling using context graphs”, Proc. International Conference on Very Large Databases (VLDB ’00), pp. 527-534, 2000.
Yang Yongsheng and Wang Hui, “Implementation of Focused Crawler”, COMP630D Course Project Report.
A.K.Sharma, “Data Structures using C”, Pearson publication, 2011.
Fabrizio Silvestri, Raffaele Perego and Salvatore Orlando ”Assigning Document Identifiers to Enhance Compressibility of Web Search Engines Indexes”. Proceedings of SAC, 2004.
Oren Zamir and Oren Etzioni “Web Document Clustering: A feasibility demonstration”. Proceedings of SIGIR, 1998.
Changshang Zhou, Wei Ding and Na Yang, “Double Indexing Mechanism of Search Engine based on Campus Net”, Proceedings of the 2006 IEEE Asia-Pacific Conference on Services Computing (APSCC'06), 2006.
Naresh Chauhan and A. K. Sharma,” Design of an Agent Based Context Driven Focused Crawler”,BVICAM’S International Journal of Information Technology, pp 61-66, 2008.
Parul Gupta and A.K.Sharma,” Context based Indexing in Search Engines using Ontology”, International Journal of Computer Applications, Volume 1 No. 14, pp 49-52, 2010.
Steve Lawrence, “Context in Web Search”, IEEE Data Engineering Bulletin, 2000.
Wang Jicheng, Huang Yuan, Wu Gangshan and Zhang Fuyan, “Web Mining: Knowledge Discovery on the Web”, IEEE International Conference, Tokyo, 1999.
O. Zamir, O. Etzioni, O. Madanim, and R.M. Karp “Fast and Intuitive Clustering of Web Documents,” Proceeding Third International Conference Knowledge Discovery and Data Mining, pp. 287-290, Aug. 1997.

Index Terms

Computer Science

Information Sciences

Keywords

AVL tree contextual repository balance factor