We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 December 2024
Reseach Article

A Survey of Link based Algorithms for Ranking and Similarity in Search Engines

Published on August 2011 by Sumit Chhabra, Mini Singh Ahuja, Pritika Mehra
journal_cover_thumbnail
National Technical Symposium on Advancements in Computing Technologies
Foundation of Computer Science USA
NTSACT - Number 3
August 2011
Authors: Sumit Chhabra, Mini Singh Ahuja, Pritika Mehra
9bcebc42-b053-4582-97ec-d18ca67d6f01

Sumit Chhabra, Mini Singh Ahuja, Pritika Mehra . A Survey of Link based Algorithms for Ranking and Similarity in Search Engines. National Technical Symposium on Advancements in Computing Technologies. NTSACT, 3 (August 2011), 15-20.

@article{
author = { Sumit Chhabra, Mini Singh Ahuja, Pritika Mehra },
title = { A Survey of Link based Algorithms for Ranking and Similarity in Search Engines },
journal = { National Technical Symposium on Advancements in Computing Technologies },
issue_date = { August 2011 },
volume = { NTSACT },
number = { 3 },
month = { August },
year = { 2011 },
issn = 0975-8887,
pages = { 15-20 },
numpages = 6,
url = { /proceedings/ntsact/number3/3200-ntst021/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Proceeding Article
%1 National Technical Symposium on Advancements in Computing Technologies
%A Sumit Chhabra
%A Mini Singh Ahuja
%A Pritika Mehra
%T A Survey of Link based Algorithms for Ranking and Similarity in Search Engines
%J National Technical Symposium on Advancements in Computing Technologies
%@ 0975-8887
%V NTSACT
%N 3
%P 15-20
%D 2011
%I International Journal of Computer Applications
Abstract

The main goal of information retrieval is to find the documents relevant to a user query. Before the Web came into existence, the retrieval algorithm in information retrieval systems were usually based on the analysis of the text in the document but the web changed it all. With the emergence of web, the concept of hypertext and hyperlinks came into existence. Typically, a link between two pages infers that either the content of web pages is good or the pages might be similar. So, Link analysis plays an important role in search engines. It is being used in search engine for deciding which web pages to add to the collection of documents (i.e., which pages to crawl), to order the documents matching a user query (i.e., how to rank pages), to find degree of similarity between web pages etc. In this paper, a literature survey of existing link based algorithms for ranking web pages and finding similar pages to a given page in search engines is provided.

References
  1. R. Kosala, and H. Blockeel, "Web Mining Research: A survey”, ACM SIGKDD Explorations, 2(1), pp: 1–15, 2000.
  2. S. Brin and L. Page, "The Anatomy of a Large-Scale Hypertextual Web Search Engine", Proceedings of the 7th international conference on World Wide Web (WWW). Brisbane, Australia, pp: 107–117, 1998.
  3. J. Kleinberg, "Authoritative sources in a hyperlinked environment". Journal of ACM (JASM), pp: 604–632, 1999.
  4. R. Lempel and S. Moran., "The stochastic approach for linkstructure analysis (SALSA) and the TKC effect", Proceedings of the 9th International conference on World Wide Web (WWW), pp.: 387–401, 2000.
  5. A. Borodin, G.O. Roberts, J.S. Rosenthal, and P. Tsaparas, "Finding authorities and hubs from link structures on the World Wide Web", Proceedings of the 10th International World Wide Web Conference, Hong Kong, pp:. 415–429, 2001. ACM, May 2001.
  6. H. Small, "Co-citation in Scientific Literature - New Measure of Relationship Between 2 Documents ", Journal of The American Society For Information Science, pp: 265–269 1973.
  7. M.M. Kessler, "Bibliographic Coupling Between scientific papers", American foundation, pp: 10–25, 1963.
  8. J. Dean, J. and M. Henzinger, "Finding related pages in the world wide web", Computer Networks: International Journal of Computer Telecommunication Network, pp. 1467–1479, 1999.
  9. K. Bharat and M. Henzinger, "Improved algorithms for topic distillation in hyperlinked environments", Proceedings of the 21st International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR’98), pp. 104– 111, 1998.
  10. S. Brin and L. Page, "The Anatomy of a Large-Scale Hypertextual Web Search Engine", Computer Networks and ISDN Systems Journal, pp: 107–117, 1998.
Index Terms

Computer Science
Information Sciences

Keywords

Web Mining Inlink Outlink Page Rank Hub & Authority weights Citation