CFP last date
20 January 2025
Reseach Article

Connected Bangla Speech Recognition using Artificial Neural Network

by Khalil Ahammad, Md. Mahfuzur Rahman
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 149 - Number 9
Year of Publication: 2016
Authors: Khalil Ahammad, Md. Mahfuzur Rahman
10.5120/ijca2016911568

Khalil Ahammad, Md. Mahfuzur Rahman . Connected Bangla Speech Recognition using Artificial Neural Network. International Journal of Computer Applications. 149, 9 ( Sep 2016), 38-41. DOI=10.5120/ijca2016911568

@article{ 10.5120/ijca2016911568,
author = { Khalil Ahammad, Md. Mahfuzur Rahman },
title = { Connected Bangla Speech Recognition using Artificial Neural Network },
journal = { International Journal of Computer Applications },
issue_date = { Sep 2016 },
volume = { 149 },
number = { 9 },
month = { Sep },
year = { 2016 },
issn = { 0975-8887 },
pages = { 38-41 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume149/number9/26027-2016911568/ },
doi = { 10.5120/ijca2016911568 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:54:18.782188+05:30
%A Khalil Ahammad
%A Md. Mahfuzur Rahman
%T Connected Bangla Speech Recognition using Artificial Neural Network
%J International Journal of Computer Applications
%@ 0975-8887
%V 149
%N 9
%P 38-41
%D 2016
%I Foundation of Computer Science (FCS), NY, USA
Abstract

There are a plenty of research experiments and achieved results in various languages throughout the world regarding speech recognition. But, in Bangla language, early researchers in this field had qualified success, though the scenario is being changed in recent years. This research work aims at developing a neural network based connected digit recognition system in Bangla language. Firstly, a Bangla digit corpus has been developed comprising of male and female speakers. Speech is recorded in connected fashion and words are extracted through automatic segmentation. Then MFCC features of the segmented words are calculated and these feature values are sent as the input to the back-propagation neural network (BPNN). BPNN learning algorithm is used to train the network. The required time to train the network, number of hidden layers, error threshold and number of epochs are considered while training the network to reach the best possible recognition accuracy. This proposed system has been implemented using object oriented programming and the achieved recognition accuracy is very much satisfactory and consistent. The network has been tested for three different setups and the best recognition accuracy achieved for digit dataset is 98.46%.

References
  1. Md. Ali Hossain, Md. Mijanur Rahman, Uzzal Kumar Prodhan, Md. Farukuzzaman Khan “Implementation Of Back-Propagation Neural Network For Isolated Bangla Speech Recognition”, IJIST-Vol.3, No.4, July 2013
  2. Pratik K. Kurzekar , Ratnadeep R. Deshmukh, Vishal B. Waghmare, Pukhraj P. Shrishrimal, “Continuous Speech Recognition System: A Review”, Asian Journal of Computer Science And Information Technology 4 : 6 (2014) 62 – 66
  3. S. A. Hossain, M. L. Rahman, F. Ahmed & M. Dewan, "Bangla speech synthesis, analysis, and recognition: an overview”, Proc. NCCPB, Dhaka, Bangladesh-2004.
  4. K. J. Rahman, M. A. Hossain, D. Das, A. Z. M. Touhidul Islam and Dr. M G. Ali, “Continuous Bangla Speech Recognition System”, Proc. 6th Int. Conf. on Computer and Information Technology (ICCIT), Dhaka, 2003.
  5. M. R. Hassan,B. Nath &M. A. Bhuiyan, "Bengali phoneme recognition: a new approach", Proc. 6th International Conference on Computer and Information Technology, Dhaka, Bangladesh-2003.
  6. Md. Abul Hasnat, Jabir Mowla, Mumit Khan, “Isolated and Continuous Bangla Speech Recognition: Implementation, Performance and application perspective”, Conference Papers (Centre for Research on Bangla Language Processing) Department of Computer Science and Engineering, BRAC University, Bangladesh, 2007.
  7. ParneetKaur, Parminder Singh, VidushiGarg, “Speech Recognition System; Challenges and Techniques”, International Journal of Computer Science and Information Technologies, Vol. 3 (3), 2012, 3989-3992.
  8. V.A. Keturi, “Speech Recognition Based on Artificial Neural Networks”, Helsinki University of Technology, Finland, 2004.
  9. Joseph Picone,"Continuous Speech Recognition Using Hidden Markov Models ", IEEEASSP MAGAZINE JULY 1990.
  10. Lawrence Rabiner, Biing Hwang Juang, B. Yegnanarayana,“Fundamentals of Speech Recognition”, Pearson Education India, 2008 -page-60
Index Terms

Computer Science
Information Sciences

Keywords

Connected Speech Recognition Bangla Speech Recognition Bangla Digits Back-Propagation Learning Algorithm MFCC.