We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 December 2024
Reseach Article

Web Search Engines: Mining Right Information

Published on May 2012 by Naveen, Dharmender Kumar
National Workshop-Cum-Conference on Recent Trends in Mathematics and Computing 2011
Foundation of Computer Science USA
RTMC - Number 2
May 2012
Authors: Naveen, Dharmender Kumar
8d7d37db-7981-4363-942b-cc1b51dda0f7

Naveen, Dharmender Kumar . Web Search Engines: Mining Right Information. National Workshop-Cum-Conference on Recent Trends in Mathematics and Computing 2011. RTMC, 2 (May 2012), 25-27.

@article{
author = { Naveen, Dharmender Kumar },
title = { Web Search Engines: Mining Right Information },
journal = { National Workshop-Cum-Conference on Recent Trends in Mathematics and Computing 2011 },
issue_date = { May 2012 },
volume = { RTMC },
number = { 2 },
month = { May },
year = { 2012 },
issn = 0975-8887,
pages = { 25-27 },
numpages = 3,
url = { /proceedings/rtmc/number2/6632-1015/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Proceeding Article
%1 National Workshop-Cum-Conference on Recent Trends in Mathematics and Computing 2011
%A Naveen
%A Dharmender Kumar
%T Web Search Engines: Mining Right Information
%J National Workshop-Cum-Conference on Recent Trends in Mathematics and Computing 2011
%@ 0975-8887
%V RTMC
%N 2
%P 25-27
%D 2012
%I International Journal of Computer Applications
Abstract

A Web Search Engine maintains and catalogs the content of Web pages in order to make them easier to find and browse. There are many Search Engines which are similar, differentiates from the other by the methods for scouring, storing, and retrieving information from the Web. Usually Search Engines search through Web pages for specified keywords, in response they return a list of containing specified keywords documents. After finding the list of specified keywords documents, list is sorted by relevance criteria which try to put at the very first positions the documents that best match the user's query. The usefulness of a search engine to most people is based on the relevance of results it retrieves from the web. This paper tries to address some issues regarding some of the major challenges faced by Search Engines, since the size of the Web is rapidly growing.

References
  1. C. J. Van Rijsbergen. Information Retrieval. Butterworths. Available at http://www. dcs. gla. ac. uk/Keith/Preface. html
  2. Oliver A. McBryan. GENVL and WWWW: Tools for taming the Web. In Proceedings of the First International World Wide Web Conference, Geneva, Switzerland, May 1994.
  3. Steve Lawrence and C. Lee Giles. Accessibility of information on the Web Nature, 400:107-109, July 1999.
  4. Roy T. Fielding, Jim Gettys, Je_rey C. Mogul, Henrik Frystyk, L. Masinter, P. Leach, and Tim Berners-Lee. Hypertext Transfer Protocol HTTP/1. 1. RFC 2616, http://ftp. isi. edu/in-notes/rfc2616. txt, June 1999.
  5. BRIN, S. , AND PAGE, L. The anatomy of a large-scale hypertextual web search engine. In Proceedings of WWW7 (Brisbane, Australia, May 1998). http://www7. scu. edu. au/programme/fullpapers/1921/com1921. htm.
  6. HEYDON, A. , AND NAJORK, M. Mercator: A Scalable, Extensible Web Crawler. World Wide Web Journal (December 1999), 219 – 229. http://www. research. digital. com/SRC/mercator/.
  7. CHO, J. , GARC´I A-MOLINA, H. , AND PAGE, L. Efficient crawling through URL ordering. Computer Networks and ISDN Systems 30, 1–7 (1998), 161–172.
  8. WITTEN, I. H. , BELL, T. C. , AND MOFFAT, A. Managing Gigabytes: Compressing and Indexing Documents and Images. John Wiley & Sons, Inc. , 1999.
  9. Zamir, O. , Etzioni, O. 1998. Web document clustering: a feasibility demonstration. Proc. of SIGIR '98, Melbourne, Appendix-Questionnaire, pp. 46-54
Index Terms

Computer Science
Information Sciences

Keywords

Web Search Engine Clustering Crawler Hyper Text Transfer Protocol