CFP last date
21 April 2025
Reseach Article

Novel Processing for Stop Words

by Aeesha S. Shaheen, Hadia Salih, Amera Ismail Melhum
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 186 - Number 72
Year of Publication: 2025
Authors: Aeesha S. Shaheen, Hadia Salih, Amera Ismail Melhum
10.5120/ijca2025924514

Aeesha S. Shaheen, Hadia Salih, Amera Ismail Melhum . Novel Processing for Stop Words. International Journal of Computer Applications. 186, 72 ( Mar 2025), 14-18. DOI=10.5120/ijca2025924514

@article{ 10.5120/ijca2025924514,
author = { Aeesha S. Shaheen, Hadia Salih, Amera Ismail Melhum },
title = { Novel Processing for Stop Words },
journal = { International Journal of Computer Applications },
issue_date = { Mar 2025 },
volume = { 186 },
number = { 72 },
month = { Mar },
year = { 2025 },
issn = { 0975-8887 },
pages = { 14-18 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume186/number72/novel-processing-for-stop-words/ },
doi = { 10.5120/ijca2025924514 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2025-03-25T22:41:27.347691+05:30
%A Aeesha S. Shaheen
%A Hadia Salih
%A Amera Ismail Melhum
%T Novel Processing for Stop Words
%J International Journal of Computer Applications
%@ 0975-8887
%V 186
%N 72
%P 14-18
%D 2025
%I Foundation of Computer Science (FCS), NY, USA
Abstract

One of the most difficult tasks that face scholars in a branch of linguistics known as Natural Language (NLP) is how to deal with stop words – those ubiquitous words which add little to the meaning of a text yet tend to be in it. In the most traditional approaches, these are even deleted in order to facilitate analyses of the contents. Some of these explain how these unwanted ‘filler’ words should be applied using a novel concept of color coding them. Rather than only substituting the ‘cut-out’ stop word with the appropriate symbol, we replace the symbol with a colored one representing that specific stop word. This revolutionary technique, allows improved visual of text and opens new avenues in text condensation, affective evaluation and even keywords selection. Thanks to the standard distribution of color-coded stop words, it is possible to illustrate visually how each stop word is distributed.

References
  1. Bird, S., Klein, E., & Loper, E. (2009). “Natural Language Processing with Python”. O'Reilly Media.
  2. Blanchard A. (2007). “Understanding and customizing stop word lists for enhanced patent mapping”. World Pat Inf.; 29: 308–316. https://doi.org/10.1016/j.wpi.2007.02.002.
  3. Devlin, J., Chang, M. W., Lee, K., & Toutanova, K. (2019). BERT: “Pre-training of Deep Bidirectional Transformers for Language Understanding”.
  4. Garg, N., Schiebinger, D., Jurafsky, D., & Zou, J. (2019). “Word Embeddings quantify 100 Years of Gender and Ethnic Stereotypes”.
  5. Jurafsky, D., & Martin, J. H. (2019). “Speech and Language Processing” (3rd ed.).
  6. Kirti, A., Kumar, P., & Choudhury, M. (2020). “Comparative Analysis of Stop Word Removal Techniques for Text Classification in Bengali”.
  7. Lee, J., & Kang, J. (2020). “Investigating the Impact of Stop Words on the Performance of Neural Text Generation”.
  8. Luo, X., Zhang, R., Li, L., & Wu, X. (2020). “Revisiting Stop Word Removal for Generic Text Summarization”.
  9. Manning, C. D., Raghavan, P., & Schütze, H. (2008). “Introduction to Information Retrieval”.
  10. Pečar, J., Kukar, M., Lavrač, N., & Juvan, P. (2021).” Stop Words for Text Classification in Slovenian”.
  11. Serhad Sarica, Jianxi Luo, June 2020, “Stop words in Technical Language Processing”.
  12. Senem Kumova Metin, Bahar Karaoğlan, (June 2017). “STOP WORD DETECTION AS A BINARY CLASSIFICATION PROBLEM”.
  13. Yadollahi, M., Azzopardi, L., & Crestani, F. (2021). “Improving Query Translation with the Stop Word Lists”.
  14. Wu, S., Zhang, Y., Yang, F., Li, Z., Wang, Y., & Yu, Y. (2020). “TextRank: Bringing Order into Texts”.
Index Terms

Computer Science
Information Sciences

Keywords

Stop wors natural language processing removal stop words color-coding text retravel