International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 19 - Number 9 |
Year of Publication: 2011 |
Authors: Sanghamitra Mohanty |
10.5120/2389-3164 |
Sanghamitra Mohanty . Phonotactic Model for Spoken Language Identification in Indian Language Perspective. International Journal of Computer Applications. 19, 9 ( April 2011), 18-24. DOI=10.5120/2389-3164
Indian Languages are Indo-Aryan being influenced by Sanskrit or Dravidian being influenced by Tamil. Dravidian Languages have the influence of Sanskrit also. All Indian Languages have the influence of Pali language for which the graphemes are being influenced Brahmi. All the Indian languages are phonetic in nature. Every Indian language has its distinctive phone sets. North Indian languages are Indo- Aryan and South Indian Languages are Dravidian. Considering their respective Phonetic properties during speaking we have tried to consider the special CV behaviour of the language in their syllables and are able to identify the Language analysing it with the limited training data set available using the SVM Classifier. During this process we have analysed the PPR Language Modelling concept for four major Indian languages like Hindi, Bengali, Oriya, and Telugu and the results are quite appreciable.