CFP last date
20 December 2024
Reseach Article

Integrated Searching Technique for IR from Web Repository

by Yagnesh Dave, Bijendra S. Agrawal
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 126 - Number 12
Year of Publication: 2015
Authors: Yagnesh Dave, Bijendra S. Agrawal
10.5120/ijca2015906274

Yagnesh Dave, Bijendra S. Agrawal . Integrated Searching Technique for IR from Web Repository. International Journal of Computer Applications. 126, 12 ( September 2015), 36-42. DOI=10.5120/ijca2015906274

@article{ 10.5120/ijca2015906274,
author = { Yagnesh Dave, Bijendra S. Agrawal },
title = { Integrated Searching Technique for IR from Web Repository },
journal = { International Journal of Computer Applications },
issue_date = { September 2015 },
volume = { 126 },
number = { 12 },
month = { September },
year = { 2015 },
issn = { 0975-8887 },
pages = { 36-42 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume126/number12/22606-2015906274/ },
doi = { 10.5120/ijca2015906274 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:17:17.902608+05:30
%A Yagnesh Dave
%A Bijendra S. Agrawal
%T Integrated Searching Technique for IR from Web Repository
%J International Journal of Computer Applications
%@ 0975-8887
%V 126
%N 12
%P 36-42
%D 2015
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The volume of unstructured text and hypertext data is increasing exponentially over that of well organized structured data in web data repositories. These spectacular data assets with increasing volume of repositories is generating challenges in efficient access of data to produce information by way of processing as well as extracting patterns by way of web mining. The effective and efficient retrieval as well as mining hidden patterns in a large volume of unstructured data as well as hypertext data has opened a window of research on web data mining. The undertaken research work is motivated on the centralized thought of exploratory research along with experimental justification to achieve research targets in planned research track. The research work initially carried out by the literature review of web data mining and information retrieval techniques through its models. The proposed work dealt with an integrated approach in searching technique to retrieve information from web data repository. The proposed model Amalgamate Web Search Methodology (AWSM) increases the level of Information Retrieval performance by integrating Exact, Relative and Adaptive search.

References
  1. McCallum, A. Rosenfeld, R.,Mitchell & A.Y. Improving text classification by shrinkage in a hierarchy of classes. Proceeding of the 15th international conference on Machine Learning,359-367.
  2. Robertson, S.E. Maron & Cooper (1982), Probability of Relevance: A unification of two competing models for document retrieval. Information Technology: Research and Development,1,1-21.
  3. Marchiori “The Quest for Correct Information on the web”, Italy,2005.
  4. Buckland, M. Chen Mapping Entry Vocabulary to Unfamiliar metadata Vocabularies D-Lib Magazine,5(1),1999.
  5. Salton. G.&Buckley Weighting Approaches in Automatic Text Retrieval. Information Processing and Management,24,513-523,1998.
  6. N. Lalmas,M.& Fuhr,N.(1999) A Probabilistic description oriented approach for categorizing Web documents. Proceeding of the 8th ACM International Conference on Information and Knowledge Management, 475-482.
  7. Salton, G. Buckley and Allan J.(1999). Automatic restructing and retrieval of text files. Communications of the ACM.
  8. Dik Lun Lee, Kent E. Seamons. Documents Ranking and Vector space model, IEEE Software, March-April, 1997.
  9. Robertson, S.E.Maron,M.E. & Cooper (1982) Probability of relevance: A Unification of two competing models for document retrieval. Information Technology Research and Development,,1-21.
Index Terms

Computer Science
Information Sciences

Keywords

Web data Mining Exact Search Relative Search Adaptive search HITS PR TM CD and VSM.