CFP last date
20 January 2025
Reseach Article

Hindi Speech Recognition System with Robust Front End-Back End Features

by Atul Gairola, Swapna Baadkar
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 64 - Number 1
Year of Publication: 2013
Authors: Atul Gairola, Swapna Baadkar
10.5120/10601-5305

Atul Gairola, Swapna Baadkar . Hindi Speech Recognition System with Robust Front End-Back End Features. International Journal of Computer Applications. 64, 1 ( February 2013), 42-45. DOI=10.5120/10601-5305

@article{ 10.5120/10601-5305,
author = { Atul Gairola, Swapna Baadkar },
title = { Hindi Speech Recognition System with Robust Front End-Back End Features },
journal = { International Journal of Computer Applications },
issue_date = { February 2013 },
volume = { 64 },
number = { 1 },
month = { February },
year = { 2013 },
issn = { 0975-8887 },
pages = { 42-45 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume64/number1/10601-5305/ },
doi = { 10.5120/10601-5305 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:15:17.759472+05:30
%A Atul Gairola
%A Swapna Baadkar
%T Hindi Speech Recognition System with Robust Front End-Back End Features
%J International Journal of Computer Applications
%@ 0975-8887
%V 64
%N 1
%P 42-45
%D 2013
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The ideal aim of a speech recognition system is efficient and accurate conversion of speech signal into text message without any dependence on device, environment, and speaker. In this paper a system for Hindi speech recognition is discussed employing robust front end- back end techniques. At front end MF-PLP is used for feature extraction while continuous density HMM is used at the back end for evaluation. A comparison of MFCC, PLP & MF-PLP is also presented to show the robust characteristics of MF-PLP.

References
  1. H. Hermansky, "Perceptually predictive (PLP) analysis of speech," Journal of Acoustic Society of America, vol. 87, 1990, pp. 1738-1752.
  2. A. O. Afolabi, A. Williams, and O. Dotun, "Development of a text dependent speaker identification security system", Research Journal of Applied Sciences, 2 (6), pp. 677-684, 2007.
  3. K. Samudravijaya, Barot & Maria, "A Comparison of Public-Domain Software Tools for Speech Recognition", In WSLP, pp. 125-131, 2003.
  4. R. Josef and P. Pollak , "Modified Feature Extraction Methods in Robust Speech Recognition", Radioelektronika, 17th IEEE International Conference, pp. 1-4, (2007).
  5. Andra´s Zolnay , Daniil Kocharov , Ralf Schlüter and Hermann Ney, "Using multiple acoustic feature sets for speech recognition", Science direct, Speech Communication 49 , pp. 514–525, 2007.
  6. study on the effect of additive noise on automatic speech recognition system", Reports of NATO Research Study Group (RSG. 10), 1992.
  7. N. Goel, S. Thomas, M. Agarwal et al. "Approaches to Automatic Lexicon Learning with Limited Training Examples", Proc. of IEEE Conference on Acoustic Speech and Signal Processing, 2010.
  8. S. F. Boll, "Suppression of Acoustic Noise in Speech using Spectral Subtraction", IEEE Transaction of Acoustic, Speech and Signal Processing, Vol. 27, No. 2, 1979, pp. 113-120.
  9. H. Hermansky and N. Morgan, "RASTA Processing of Speech", IEEE Transaction on Speech and Audio Processing, Vol. 2, No. 4, 1994, pp. 578-589.
  10. S. Young, "A Review of Large Vocabulary Continuous Speech Recognition", IEEE Signal Processing Mag. , Vol. 13, 1996, pp. 45-57.
  11. C. H. Lee, J. L. Gauvain, R. Pieraccini, and L. R. Rabiner, "Large Vocabulary Speech Recognition using Subword Units", Speech Communication, Vol. 13, 1993, pp. 263-279.
  12. X. D. Huang, H. W. Hon, M. Y. Hwang, and K. F. Lee, "A comparative study of discrete, semi continuous and continuous hidden Markov models," Computer Speech and Language, vol. 7(4), 1993, pp. 359-368.
Index Terms

Computer Science
Information Sciences

Keywords

Feature Extraction Front End Back End MFCC PLP MF-PLP CDHMM