CFP last date
20 January 2025
Reseach Article

Natural Language Processing and Natural Language Understanding Techniques for Intelligent Search

by Shrishti Shiva, Mohamed El-Dosuky, Sherif Kamel
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 186 - Number 11
Year of Publication: 2024
Authors: Shrishti Shiva, Mohamed El-Dosuky, Sherif Kamel
10.5120/ijca2024923468

Shrishti Shiva, Mohamed El-Dosuky, Sherif Kamel . Natural Language Processing and Natural Language Understanding Techniques for Intelligent Search. International Journal of Computer Applications. 186, 11 ( Mar 2024), 39-45. DOI=10.5120/ijca2024923468

@article{ 10.5120/ijca2024923468,
author = { Shrishti Shiva, Mohamed El-Dosuky, Sherif Kamel },
title = { Natural Language Processing and Natural Language Understanding Techniques for Intelligent Search },
journal = { International Journal of Computer Applications },
issue_date = { Mar 2024 },
volume = { 186 },
number = { 11 },
month = { Mar },
year = { 2024 },
issn = { 0975-8887 },
pages = { 39-45 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume186/number11/natural-language-processing-and-natural-language-understanding-techniques-for-intelligent-search/ },
doi = { 10.5120/ijca2024923468 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-03-23T00:18:05.472837+05:30
%A Shrishti Shiva
%A Mohamed El-Dosuky
%A Sherif Kamel
%T Natural Language Processing and Natural Language Understanding Techniques for Intelligent Search
%J International Journal of Computer Applications
%@ 0975-8887
%V 186
%N 11
%P 39-45
%D 2024
%I Foundation of Computer Science (FCS), NY, USA
Abstract

This paper presents an intelligent text retrieval and ranking system leveraging advanced NLP and NLU techniques, including word embeddings and cosine similarity. The system incorporates an LSTM language model to generate document embeddings from preprocessed text documents, facilitating accurate document-query matching. Experimental evaluation demonstrates the system's efficacy, achieving an average accuracy of 0.75 on the test set. The use of cosine similarity further supports the system's ability to rank documents meaningfully. However, potential overfitting concerns necessitate an exploration of regularization techniques to improve generalization. The proposed intelligent system finds practical applications in search engines and recommendation systems, delivering contextually relevant content to users.

References
  1. W. Yang, H. Zhao, M. Wang and J. Ji, "Design of Intelligent Search Engine Service Performance Evaluation System," 2020 5th Asia-Pacific Conference on Intelligent Robot Systems (ACIRS), Singapore, 2020, pp. 86-91, doi: 10.1109/ACIRS49895.2020.9162611.
  2. X. Wu, Y. Tang, C. Zhou, G. Zhu, J. Song and G. Liu, "An Intelligent Search Engine Based on Knowledge Graph for Power Equipment Management," 2022 5th International Conference on Energy, Electrical and Power Engineering (CEEPE), Chongqing, China, 2022, pp. 370-374, doi: 10.1109/CEEPE55110.2022.9783291.
  3. H. Zhao, D. Wang, M. He, Y. Chen, J. Li and Y. You, "An Intelligent Method For Extracting Hotspot Events in News Bulletin," 2021 7th International Conference on Big Data and Information Analytics (BigDIA), Chongqing, China, 2021, pp. 143-148, doi: 10.1109/BigDIA53151.2021.9619646.
  4. L. Shukla, J. N. Singh, P. Johri and A. Kumar, "Artificial Intelligence in Information Retrieval," 2022 4th International Conference on Advances in Computing, Communication Control and Networking (ICAC3N), Greater Noida, India, 2022, pp. 1-5, doi: 10.1109/ICAC3N56670.2022.10074291.
  5. H. Zhao, D. Wang, M. He, Y. Chen, J. Li and Y. You, "An Intelligent Method For Extracting Hotspot Events in News Bulletin," 2021 7th International Conference on Big Data and Information Analytics (BigDIA), Chongqing, China, 2021, pp. 143-148, doi: 10.1109/BigDIA53151.2021.9619646.
  6. D. Singh, K. R. Suraksha and S. J. Nirmala, "Question Answering Chatbot using Deep Learning with NLP," 2021 IEEE International Conference on Electronics, Computing and Communication Technologies (CONECCT), Bangalore, India, 2021, pp. 1-6, doi: 10.1109/CONECCT52877.2021.9622709.
  7. H. -Y. Lin, T. -S. Moh and B. Westlake, "Gun Violence News Information Retrieval using BERT as Sequence Tagging Task," 2021 IEEE International Conference on Big Data (Big Data), Orlando, FL, USA, 2021, pp. 2525-2531, doi: 10.1109/BigData52589.2021.9671919.
  8. Mikolov, T., Chen, K., Corrado, G., & Dean, J. (2013). Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781.
  9. Miller, G. A. (1995). WordNet: A lexical database for English. Communications of the ACM, 38(11), 39-41.
  10. M. Polignano, F. Narducci, A. Iovine, C. Musto, M. De Gemmis and G. Semeraro, "HealthAssistantBot: A Personal Health Assistant for the Italian Language," in IEEE Access, vol. 8, pp. 107479-107497, 2020, doi: 10.1109/ACCESS.2020.3000815.
  11. R. Ahamad and K. N. Mishra, "Sentiment Analysis of Handwritten and Text Statement for Emotion Classification using Intelligent Techniques: A Novel Approach," 2023 International Conference on Computational Intelligence and Knowledge Economy (ICCIKE), Dubai, United Arab Emirates, 2023, pp. 414-419, doi: 10.1109/ICCIKE58312.2023.10131894.
  12. Kanev, V. Terekhov, M. Kochneva, V. Chernenky and M. Skvortsova, "Hybrid Intelligent System of Crisis Assessment using Natural Language Processing and Metagraph Knowledge Base," 2021 IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering (ElConRus), St. Petersburg, Moscow, Russia, 2021, pp. 2099-2103, doi: 10.1109/ElConRus51938.2021.9396100.
  13. M. Alargrami and M. M. Eljazzar, "Imam: Word Embedding Model for Islamic Arabic NLP," 2020 2nd Novel Intelligent and Leading Emerging Sciences Conference (NILES), Giza, Egypt, 2020, pp. 520-524, doi: 10.1109/NILES50944.2020.9257931.
  14. K. Zheng, N. Lin and S. Jiang, "Unsupervised Character Embedding Correction and Candidate Word Denoising," in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 30, pp. 76-86, 2022, doi: 10.1109/TASLP.2021.3129334.
  15. S. Koepke, A. -M. Oncescu, J. F. Henriques, Z. Akata and S. Albanie, "Audio Retrieval With Natural Language Queries: A Benchmark Study," in IEEE Transactions on Multimedia, vol. 25, pp. 2675-2685, 2023, doi: 10.1109/TMM.2022.3149712.
  16. Y. Yang, X. Liu and R. H. Deng, "Multi-User Multi-Keyword Rank Search Over Encrypted Data in Arbitrary Language," in IEEE Transactions on Dependable and Secure Computing, vol. 17, no. 2, pp. 320-334, 1 March-April 2020, doi: 10.1109/TDSC.2017.2787588.
  17. P. Duraisamy, M. Duraisamy, M. Periyanayaki and Y. Natarajan, "Predicting Disaster Tweets using Enhanced BERT Model," 2023 7th International Conference on Intelligent Computing and Control Systems (ICICCS), Madurai, India, 2023, pp. 1745-1749, doi: 10.1109/ICICCS56967.2023.10142660.
  18. O. -G. Ene, M. -D. Sirbu, M. Dascalu, S. Trausan-Matu and A. C. Nuta, "PIAM - Intelligent Platform for Retrieving Relevant Information on Drugs Marketed in Romania," 2019 22nd International Conference on Control Systems and Computer Science (CSCS), Bucharest, Romania, 2019, pp. 420-425, doi: 10.1109/CSCS.2019.00077
Index Terms

Computer Science
Information Sciences
Natural language processing
Natural language understanding
search

Keywords

Natural language processing Natural language understanding search