International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 19 - Number 5 |
Year of Publication: 2011 |
Authors: Cini Kurian, Kannan Balakrishnan |
10.5120/2360-3091 |
Cini Kurian, Kannan Balakrishnan . Automated Transcription System for Malayalam Language. International Journal of Computer Applications. 19, 5 ( April 2011), 5-10. DOI=10.5120/2360-3091
Malayalam is one of the 22 scheduled languages in India with more than 130 million speakers. This paper presents a report on the development of a speaker independent, continuous transcription system for Malayalam. The system employs Hidden Markov Model (HMM) for acoustic modeling and Mel Frequency Cepstral Coefficient (MFCC) for feature extraction. It is trained with 21 male and female speakers in the age group ranging from 20 to 40 years. The system obtained a word recognition accuracy of 87.4% and a sentence recognition accuracy of 84%, when tested with a set of continuous speech data.