National Conference “Electronics, Signals, Communication and Optimization" |
Foundation of Computer Science USA |
NCESCO2015 - Number 1 |
September 2015 |
Authors: Akhila K.S., R. Kumaraswamy |
378a0905-8edc-4038-9623-beed266e8226 |
Akhila K.S., R. Kumaraswamy . Deep Belief Networks for Kannada Phoneme Recognition. National Conference “Electronics, Signals, Communication and Optimization". NCESCO2015, 1 (September 2015), 25-30.
In this paper, a baseline phoneme recognition system for Kannada language is built using MFCC and Deep Belief Networks (DBNs). Phonemes are segmented from continuous Kannada speech and MFCC features are extracted from each speech frame. These features are further used as input to the recognizer. DBNs are probabilistic generative model which are constructed by stacking Restricted Boltzmann machines (RBMs). The learning procedure of DBN undergoes pre-training phase followed by fine-tuning phase. Evaluations are also carried out on conventional speech recognition methods such as Multi-Layer Feed Forward Neural Networks (ML-FFNNs) and Support Vector Machines (SVMs). The Experimental result shows that DBN's performance is superior to the conventional methods for recognition of Kannada phonemes using MFCC features.