CFP last date
20 January 2025
Reseach Article

Optimization of User Query for Improving Document Retrieval Performance

by Nidhi Bhandari, Rachna Navalakhe, G.L. Prajapati
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 184 - Number 2
Year of Publication: 2022
Authors: Nidhi Bhandari, Rachna Navalakhe, G.L. Prajapati
10.5120/ijca2022921970

Nidhi Bhandari, Rachna Navalakhe, G.L. Prajapati . Optimization of User Query for Improving Document Retrieval Performance. International Journal of Computer Applications. 184, 2 ( Mar 2022), 14-19. DOI=10.5120/ijca2022921970

@article{ 10.5120/ijca2022921970,
author = { Nidhi Bhandari, Rachna Navalakhe, G.L. Prajapati },
title = { Optimization of User Query for Improving Document Retrieval Performance },
journal = { International Journal of Computer Applications },
issue_date = { Mar 2022 },
volume = { 184 },
number = { 2 },
month = { Mar },
year = { 2022 },
issn = { 0975-8887 },
pages = { 14-19 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume184/number2/32304-2022921970/ },
doi = { 10.5120/ijca2022921970 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T01:20:23.943489+05:30
%A Nidhi Bhandari
%A Rachna Navalakhe
%A G.L. Prajapati
%T Optimization of User Query for Improving Document Retrieval Performance
%J International Journal of Computer Applications
%@ 0975-8887
%V 184
%N 2
%P 14-19
%D 2022
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The unstructured data processing and finding accurate information from IR models is a complex task. The classical techniques use different concepts for improving IR models such as categorization, classification and many more. This paper reviews different document retrieval techniques first and then an extension on previously introduced version is provided. Similar to the traditional model, this technique first pre-process data and extract features. After that the retrieved features are organized in a tuple. These tuples are further used with fuzzy c means algorithm to cluster their domain according to their features. This process reduces the time of proposed search model. In addition to that, for preventing inappropriate query submission, the new query generation and optimization is also proposed in this work. The results with the different dataset shows the proposed IR model improve the performance in terms of efficiency and accuracy.

References
  1. J. Han, M. Kamber, J. Pei, “Data Mining Concepts and Techniques”, http://myweb.sabanciuniv.edu/rdehkharghani/files/2016/02/The-Morgan-Kaufmann-Series-in-Data-Management-Systems-Jiawei-Han-Micheline-Kamber-Jian-Pei-Data-Mining.-Concepts-and-Techniques-3rd-Edition-Morgan-Kaufmann-2011.pdf
  2. H. Wang, Q. Zhang, & J. Yuan, “Semantically Enhanced Medical Information Retrieval System: A Tensor Factorization Based Approach”, 2169-3536, 2017 IEEE
  3. S. Bergamaschi, E. Domnor, F. Guerra, M. Orsini, R. T. Lado, Y. Velegrakis, “Keymantic: Semantic Keyword-based Searching in Data Integration Systems”, Proceedings of the VLDB Endowment, Vol. 3, No. 2, Copyright 2010 VLDB ACM
  4. G. Kumaran and J. Allan, “Simple Questions to Improve Pseudo-Relevance Feedback Results”, Copyright is held by the author/owner(s), SIGIR’06, August 6–10, 2006, Seattle, Washington, USA ACM
  5. L. Chen, J. M. Jose, H. Yu, F. Yuan, “A Semantic Graph-Based Approach for Mining Common Topics from Multiple Asynchronous Text Streams”, c 2017 International World Wide Web Conference Committee (IW3C2), published under Creative Commons CC BY 4.0 License. WWW 2017, April 3–7, 2017, Perth, Australia. ACM 978-1-4503-4913-0/17/04
  6. Z. Fu, F. Huang, K. Ren, J. Weng, and C. Wang, “Privacy-Preserving Smart Semantic Search Based on Conceptual Graphs Over Encrypted Outsourced Data”, IEEE Transactions on Information Forensics and Security, Vol. 12, No. 8, August 2017
  7. Y. Chen, X. Zhang, Z. Li, J. P. Ng, “Search engine reinforced semi-supervised classification and graph-based summarization of microblogs”, Neurocomputing 152 (2015) 274–286
  8. J. A. Pine, G. Csurka, S. Clinchant, “Unsupervised Visual and Textual Information Fusion in CBMIR using Graph based Methods”, ACM Transactions on Information Systems, Vol. , No. , 20, Pages 1–0??.
  9. J. Cai, Z. J. Zha, M. Wang, S. Zhang, and Q. Tian, “An Attribute-assisted Reranking Model for Web Image Search”, IEEE Transactions on Image Processing, Vol. X, No. XX, Month Year
  10. K. Aoyama, A. Ogawa, T. Hattori, T. Hori, and A. Nakamura, “Zero-Resource Spoken Term Detection Using Hierarchical Graph-Based Similarity Search”, 2014 IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP)
  11. A. Costa, E. D. Buccio, M. Melucci, G. Nannicini, “Efficient Parameter Estimation for Information Retrieval using Black-box Optimization”, IEEE Transactions on Knowledge and Data Engineering ( Volume: 30 , Issue: 7 , July 1 2018 )
  12. S. Krishnamurthy, Akila V, “Information Retrieval Models: Trends and Techniques”, Copyright © 2018, IGI Global.
  13. Y. Wang, D. Yin, L. Jie, P. Wang, M. Yamada, Y. Chang, Q. Mei, “Beyond Ranking: Optimizing Whole-Page Presentation”, WSDM’16, February 22–25, 2016, San Francisco, CA, USA. c 2016 ACM. ISBN 978-1-4503-3716-8/16/02
  14. S. Balaneshin-kordan, A. Kotov, “Optimization Method for Weighting Explicit and Latent Concepts in Clinical Decision Support Queries”, ICTIR ’16, September 12-16, 2016, Newark, DE, USA c 2016 ACM. ISBN 978-1-4503-4497-5/16/09
  15. J. v. Doorn, D. Odijk, D. M. Roijers, M. d. Rijke, “Balancing Relevance Criteria through Multi-Objective Optimization”, SIGIR ’16, July 17 - 21, 2016, Pisa, Italy c 2016 Copyright held by the owner/author(s). Publication rights licensed to ACM. ISBN 978-1-4503-4069-4/16/07
  16. M. Chahal, “Information Retrieval using Jaccard Similarity Coefficient”, International Journal of Computer Trends and Technology (IJCTT) – Volume 36 Number 3 - June 2016.
Index Terms

Computer Science
Information Sciences

Keywords

Text mining Query optimization Semantic knowledge Information retrieval c-means clustering