International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 85 - Number 5 |
Year of Publication: 2014 |
Authors: Kshirod Sarmah, Utpal Bhattacharjee |
10.5120/14840-3103 |
Kshirod Sarmah, Utpal Bhattacharjee . GMM based Language Identification using MFCC and SDC Features. International Journal of Computer Applications. 85, 5 ( January 2014), 36-42. DOI=10.5120/14840-3103
Language Identification (LID) is one of the most popular areas of research in speech signal processing. Now a day's lots of approaches have been used to improve performance of LID system which includes Parallel Phone Recognition Language Modeling (PPRLM), Support Vector Machine (SVM) and general Gaussian Mixture Model (GMM) etc. The state-of-art LID system has been utilised lots of feature vectors like LPCC, MFCC, SDC and prosodic. Although fusion of prosodic features with MFCC features shows some improvement in the performance of the LID system. But still it is not sufficient. In this paper, a baseline system for the LID system in multilingual environments has been developed using GMM as a classifier and MFCC combined with Shifted-Delta-Cepstral (SDC) as front end processing feature vectors. In this works, we used the Arunachali Language Speech Database (ALS-DB), a multilingual and multichannel speech corpus which was recently collected from the four local languages namely Adi, Apatani, Galo and Nyishi in Arunachal Pradesh including Hindi and English as secondary languages. The performance of the LID system has been improved by combing MFCC and SDC features than its individual performances. The minimum ERR rates for the features MFCC and SDC individually are 19. 70% and 11. 83% respectively while minimum ERR rate for the combined features both MFCC and SDC is 6. 40%. Approximately 15. 00% and 6. 00% of performance of the LID system has been improved while using the combining features of MFCC with SDC over the baseline systems that using MFCC and SDC features in individual respectively.