CFP last date
20 January 2025
Call for Paper
February Edition
IJCA solicits high quality original research papers for the upcoming February edition of the journal. The last date of research paper submission is 20 January 2025

Submit your paper
Know more
Reseach Article

A Survey on Techniques in NLP

by Nihar Ranjan, Kaushal Mundada, Kunal Phaltane, Saim Ahmad
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 134 - Number 8
Year of Publication: 2016
Authors: Nihar Ranjan, Kaushal Mundada, Kunal Phaltane, Saim Ahmad
10.5120/ijca2016907355

Nihar Ranjan, Kaushal Mundada, Kunal Phaltane, Saim Ahmad . A Survey on Techniques in NLP. International Journal of Computer Applications. 134, 8 ( January 2016), 6-9. DOI=10.5120/ijca2016907355

@article{ 10.5120/ijca2016907355,
author = { Nihar Ranjan, Kaushal Mundada, Kunal Phaltane, Saim Ahmad },
title = { A Survey on Techniques in NLP },
journal = { International Journal of Computer Applications },
issue_date = { January 2016 },
volume = { 134 },
number = { 8 },
month = { January },
year = { 2016 },
issn = { 0975-8887 },
pages = { 6-9 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume134/number8/23932-2016907355/ },
doi = { 10.5120/ijca2016907355 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:33:36.086392+05:30
%A Nihar Ranjan
%A Kaushal Mundada
%A Kunal Phaltane
%A Saim Ahmad
%T A Survey on Techniques in NLP
%J International Journal of Computer Applications
%@ 0975-8887
%V 134
%N 8
%P 6-9
%D 2016
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The field of natural language processing (aka NLP) is an intersection of the study of linguistics, computation and statistics. The primary goal of NLP is automated understanding of the semi-structured language that humans use. This study stems application in diverse fields like semantic analysis, summarization, text classification and the like. The domain natural language processing is a fledgling domain with no concrete indication of when it will mature. Compared to well established domains like “Study of Algorithms”, NLP is yet in its emerging period and hence there’s dearth of a concise piece of literature that elaborates on the phases of NLP and lists different techniques that can be adapted. NLP borrows heavily from foundational subjects of study like statistics, probability theory and theory of computation. In this paper, we describe three phases of natural language processing namely, language modelling, parts-of-speech tagging and parsing, outlining the approaches used that can be used.

References
  1. Adwait Ratnaparkhi, A Maximum Entropy Model for Part-Of-Speech Tagging
  2. D Jurafsky, JH Martin, Speech and Language Processing.
  3. Michael Collins, Head-Driven Statistical Models for Natural Language Parsing
  4. Bill Wilson, University of New South Wales.
  5. Roni Rosenfeld, Two decades of statistical language modeling: where do we go from here?
  6. Stanley F. Chen, Joshua Goodman, An Empirical Study of Smoothing Techniques for Language Modeling. Proceedings of the 34th Annual Meeting of the ACL, June 1996
  7. Nidhi Adhvaryu, Prem Balani, Survey: Part-Of-Speech Tagging in NLP, International Journal of Research in Advent Technology (E-ISSN: 2321-9637)
  8. Dinesh Kumar, Gurpreet Singh Josan,” Part of Speech Taggers for Morphologically Rich Indian Languages: A Survey”, International Journal of Computer Applications Volume 6–No.5, September 2010, pp. 1-9
  9. Manish Shrivastava and Pushpak Bhattacharyya, Hindi POS Tagger Using Naive Stemming: Harnessing Morphological Information Without Extensive Linguistic Knowledge, International Conference on NLP (ICON08), Pune, India, December, 2008
  10. PVS Avinesh, G Karthik, ”Part-Of-Speech Tagging and Chunking using Conditional Random Fields and Transformation Based Learning” in the proceedings of NLPAI Contest, 2006
  11. Antony P.J, Santhanu P Mohan, Soman K.P,”SVM Based Part of Speech Tagger for Malayalam”, IEEE International Conference on Recent Trends in Information, Telecommunication and Computing, pp. 339-341, 2010
  12. Agarwal Himashu, Amni Anirudh,” Part of Speech Tagging and Chunking with Conditional Random Fields” in the proceedings of NLPAI Contest, 2006
  13. Brants, TnT – A statistical part-of-speech tagger. In Proc. of the 6th Applied NLP Conference, pp. 224-231, 2000
  14. Cutting, J. Kupiec, J. Pederson and P. Sibun, A practical partof-speech tagger. In Proc. of the 3rd Conference on Applied NLP, pp. 133-140, 1992
  15. Sumam Mary Idicula and Peter S David, A Morphological processor for Malayalam Language, South Asia Research, SAGE Publications, 2007
Index Terms

Computer Science
Information Sciences

Keywords

NLP Language Modelling Parsing POS tagging HMM