CFP last date
20 December 2024
Reseach Article

Hindi Speech Recognition and Online Speaker Adaptation

Published on None 2011 by Ganesh Sivaraman, K Samudravijaya
International Conference on Technology Systems and Management
Foundation of Computer Science USA
ICTSM - Number 1
None 2011
Authors: Ganesh Sivaraman, K Samudravijaya
5a0f7ffa-98f6-4123-8146-4e3083eb1e24

Ganesh Sivaraman, K Samudravijaya . Hindi Speech Recognition and Online Speaker Adaptation. International Conference on Technology Systems and Management. ICTSM, 1 (None 2011), 27-30.

@article{
author = { Ganesh Sivaraman, K Samudravijaya },
title = { Hindi Speech Recognition and Online Speaker Adaptation },
journal = { International Conference on Technology Systems and Management },
issue_date = { None 2011 },
volume = { ICTSM },
number = { 1 },
month = { None },
year = { 2011 },
issn = 0975-8887,
pages = { 27-30 },
numpages = 4,
url = { /proceedings/ictsm/number1/2779-42/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Proceeding Article
%1 International Conference on Technology Systems and Management
%A Ganesh Sivaraman
%A K Samudravijaya
%T Hindi Speech Recognition and Online Speaker Adaptation
%J International Conference on Technology Systems and Management
%@ 0975-8887
%V ICTSM
%N 1
%P 27-30
%D 2011
%I International Journal of Computer Applications
Abstract

Speaker Adaptation is a technique which is used to improve the recognition accuracy of Automatic Speech Recognition (ASR) systems. Here, we report a study of the impact of online speaker adaptation on the performance of a speaker independent, continuous speech recognition system for Hindi language. The speaker adaptation is performed using the Maximum Likelihood Linear Regression (MLLR) transformation of acoustic models. The ASR system was trained using narrowband speech. The efficacy of the speaker adaptation is studied by using an unrelated speech database as test data. The MLLR transform based speaker adaptation technique is found to improve the accuracy of the Hindi ASR system by 3%. It was also observed that the improvement in accuracy is dependent upon the recognition accuracy of the un-adapted system.

References
  1. Rabiner, L.R. 1989. A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. In Proceedings of the IEEE 77, 257—286.
  2. Legetter, C.J. 1995. Improved acoustic modeling for HMMs using linear transformations. Ph.D. Thesis. University of Cambridge, (1995)
  3. Leggetter, C.J., Woodland, P.C. 1995. Maximum likelihood linear regression for speaker adaptation of HMMs. Computer Speech & Language 9, 171–185.
  4. Doh, S.-J. 2000. Enhancements to Transformation-Based Speaker Adaptation: Principal Component and Inter-Class Maximum Likelihood Linear Regression, PhD. Thesis, Carnegie Mellon University.
  5. CMU sphinx – Speech Recognition Toolkit, http://www.cmusphinx.sourceforge.net.
  6. Chan, A., et al. 2003. The Hieroglyphs: Building Speech Applications Using CMU Sphinx and Related
  7. Samudravijaya, K. 2000. Hindi Speech Recognition. J. Acoustic Society of India 29(1), 385–393.
  8. Sivaraman, G. et.al. 2011. Higher Accuracy of Hindi Accuracy of Hindi Speech Recognition Due to Online Speaker Adaptation, In Proceedings of ICTSM 2011, CCIS 145, 233 – 238.
Index Terms

Computer Science
Information Sciences

Keywords

Automatic Speech Recognition online speaker adaptation Maximum Likelihood Linear Regression (MLLR) Hindi Speech recognition