International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 149 - Number 9 |
Year of Publication: 2016 |
Authors: Khalil Ahammad, Md. Mahfuzur Rahman |
10.5120/ijca2016911568 |
Khalil Ahammad, Md. Mahfuzur Rahman . Connected Bangla Speech Recognition using Artificial Neural Network. International Journal of Computer Applications. 149, 9 ( Sep 2016), 38-41. DOI=10.5120/ijca2016911568
There are a plenty of research experiments and achieved results in various languages throughout the world regarding speech recognition. But, in Bangla language, early researchers in this field had qualified success, though the scenario is being changed in recent years. This research work aims at developing a neural network based connected digit recognition system in Bangla language. Firstly, a Bangla digit corpus has been developed comprising of male and female speakers. Speech is recorded in connected fashion and words are extracted through automatic segmentation. Then MFCC features of the segmented words are calculated and these feature values are sent as the input to the back-propagation neural network (BPNN). BPNN learning algorithm is used to train the network. The required time to train the network, number of hidden layers, error threshold and number of epochs are considered while training the network to reach the best possible recognition accuracy. This proposed system has been implemented using object oriented programming and the achieved recognition accuracy is very much satisfactory and consistent. The network has been tested for three different setups and the best recognition accuracy achieved for digit dataset is 98.46%.