We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 November 2024
Reseach Article

Application of Natural Language Processing Tools in Stemming

by B. P. Pande, Prof. H. S. Dhami
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 27 - Number 6
Year of Publication: 2011
Authors: B. P. Pande, Prof. H. S. Dhami
10.5120/3302-4530

B. P. Pande, Prof. H. S. Dhami . Application of Natural Language Processing Tools in Stemming. International Journal of Computer Applications. 27, 6 ( August 2011), 14-19. DOI=10.5120/3302-4530

@article{ 10.5120/3302-4530,
author = { B. P. Pande, Prof. H. S. Dhami },
title = { Application of Natural Language Processing Tools in Stemming },
journal = { International Journal of Computer Applications },
issue_date = { August 2011 },
volume = { 27 },
number = { 6 },
month = { August },
year = { 2011 },
issn = { 0975-8887 },
pages = { 14-19 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume27/number6/3306-4530/ },
doi = { 10.5120/3302-4530 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:13:34.653438+05:30
%A B. P. Pande
%A Prof. H. S. Dhami
%T Application of Natural Language Processing Tools in Stemming
%J International Journal of Computer Applications
%@ 0975-8887
%V 27
%N 6
%P 14-19
%D 2011
%I Foundation of Computer Science (FCS), NY, USA
Abstract

In the present work an innovative attempt is being made to develop a novel conflation method that exploits the phonetic quality of words and uses some standard NLP tools like LD (Levenshtein Distance) and LCS (Longest Common Subsequence) for Stemming process.

References
  1. Araujo Lourdes, Zaragoza Hugo, Pérez-Agüera Jose R., Pérez-Iglesias Joaquín (2010) Structure of morphologically expanded queries: A genetic algorithm approach, Data & Knowledge Engineering, Volume 69, Issue 3, 279-289
  2. Binstock A. and Rex J. (1995) Practical Algorithms for Programmers, Addison-Wesley, Reading, Mass., 158-160
  3. Cormen T. T. , Leiserson C. E. , Rivest R. L. (1990) Introduction to algorithms, MIT Press, Cambridge, MA
  4. English Joshua S. (2005) English Stemming Algorithm, Pragmatic Solutions, Inc., 1-3
  5. Hafer M., and S. Weiss (1974) Word Segmentation by Letter Successor Varieties, Information Storage and Retrieval, 10, 371-85.
  6. Harman Donna (1991) How effective is suffixing?, Journal of the American Society for Information Science, 42, 7-15
  7. Krovetz Robert (1993) Viewing morphology as an inference process, Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval, 191-202
  8. Lovins, J. B. (1968) Development of a stemming algorithm, Mechanical Translation and Computational Linguistics, 11, 22-31
  9. Mayfield James and McNamee Paul (2003) Single N-gram stemming, Proceedings of the 26th annual international ACM SIGIR conference on Research and development in information retrieval, 415-416
  10. Majumder Prasenjit, Mitra Mandar, Parui Swapan K., Kole Gobinda, Mitra Pabitra and Datta Kalyankumar (2007) YASS: Yet another suffix stripper, ACM Transactions on Information Systems. Volume 25, Issue 4, Article No. 18
  11. Melucci Massimo and Orio Nicola (2003) A novel method for stemmer generation based on hidden Markov models, Proceedings of the twelfth international conference on Information and knowledge management, 131-138
  12. Odell K. M. and Russell R. C., Soundex phonetic comparison system [cf.U.S. Patents 1261167 (1918), 1435663 (1922)].
  13. Paice Chris D (1990) Another stemmer, ACM SIGIR Forum, Volume 24, No. 3. 56-61.
  14. Porter M.F (1980) An algorithm for suffix stripping, Program 14, 130-137
  15. Šnajder J. , Bašić B. Dalbelo, Tadić M. (2008) Automatic acquisition of inflectional lexica for morphological normalization, Information Processing & Management, Volume 44, Issue 5, 1720-1731
  16. Singh Brijesh Shanker (2003) Search Algorithms, DRTC Workshop on Digital Libraries: Theory and Practice, Paper: E
  17. Tamah Eiman , Shammari-Al. (2008) Towards an error free stemming, IADIS European Conference Data Mining, 160-163
  18. UzZaman Naushad and Khan Mumit, (2005) T12: An Advanced Text Input System with Phonetic Support for Mobile Devices, 2nd International Conference on Mobile Technology, Applications and Systems, 1-7.
  19. Xu Jinxi and Croft Bruce W (1998) Corpus-based stemming using co-occurrence of word variants, ACM Transactions on Information Systems. Volume 16 (1)1, 61-81.
Index Terms

Computer Science
Information Sciences

Keywords

Phonetic based stem generation system Natural Language Processing Tools