We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 November 2024
Reseach Article

MFCC VQ based Speaker Recognition and Its Accuracy Affecting Factors

by Satyanand Singh, Dr. E.G Rajan
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 21 - Number 6
Year of Publication: 2011
Authors: Satyanand Singh, Dr. E.G Rajan
10.5120/2519-3423

Satyanand Singh, Dr. E.G Rajan . MFCC VQ based Speaker Recognition and Its Accuracy Affecting Factors. International Journal of Computer Applications. 21, 6 ( May 2011), 1-6. DOI=10.5120/2519-3423

@article{ 10.5120/2519-3423,
author = { Satyanand Singh, Dr. E.G Rajan },
title = { MFCC VQ based Speaker Recognition and Its Accuracy Affecting Factors },
journal = { International Journal of Computer Applications },
issue_date = { May 2011 },
volume = { 21 },
number = { 6 },
month = { May },
year = { 2011 },
issn = { 0975-8887 },
pages = { 1-6 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume21/number6/2519-3423/ },
doi = { 10.5120/2519-3423 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:07:45.732227+05:30
%A Satyanand Singh
%A Dr. E.G Rajan
%T MFCC VQ based Speaker Recognition and Its Accuracy Affecting Factors
%J International Journal of Computer Applications
%@ 0975-8887
%V 21
%N 6
%P 1-6
%D 2011
%I Foundation of Computer Science (FCS), NY, USA
Abstract

The present study was conducted to evaluate the accuracy affecting factors of a Mel-Frequency Cepstral Coefficients (MFCC) and Vector Quantization (VQ) based speaker recognition system. This investigation analyses the factors that affecting recognition accuracy using speech signal from day to day life in surrounding environments. It was studied the mismatch affects of text-dependency, voice sample length, speaking language, speaking style, mimicry, the quality of microphone, utterance sample quality and surrounding noise. The corpuses of 10 people of 20 utterance subjects were collected which were indicate that any mismatch degrades recognition accuracy. It was found that most dominating factors that degrades the accuracy of speaker recognition systems were surrounding noise, quality of microphone by which voice sample were collected, disguise, and degrading of the sample rate and quality. Speech-related factors and sample length were less critical.

References
  1. Gatica-Perez, G. Lathoud, J.-M. Odobez and I. Mc Cowan. 2007 Audiovisual probabilistic tracking of multiple speakers in meetings, IEEE Transactions on Speech and Audio Processing, 15(2), pp. 601–616.
  2. J. P. Cambell, Jr. 1997 Speaker Recognition A Tutorial Proceedings of the IEEE, 85(9), pp. 1437-1462.
  3. Faundez-Zanuy M. and Monte-Moreno E. 2005 State-of-the-art in speaker recognition , Aerospace and Electronic Systems Magazine, IEEE, 20(5), pp. 7-12.
  4. K. Saeed and M. K. Nammous. 2005 Heuristic method of Arabic speech recognition, in Proc. IEEE 7th Int. Conf. DSPA, Moscow, Russia, pp. 528–530.
  5. Lamel, L.F. and Gauvain, J.L., 2000. Speaker Verification over the Telephone, Speech Communication, pp. 141–154.
  6. Ortega-Garcia, J., Gonz´alez-Rodriguez, J., et al., May 1998 AHUMADA: A large speech corpus in Spanish for speaker identification and verification, IEEE Intl. Conf. on Acoust. Speech and Signal Proc, pp. 773–776.
  7. Singh Satyanand, Dr. E.G Rajan. March 2011 Vector Quantization Approach for Speaker Recognition Using MFCC and Inverted MFCC, International Journal of Computer Applications,17(1), pp. 1-7 .
  8. Yegnanarayana B., Prasanna S.R.M., Zachariah J.M. and Gupta C. S. 2005 Combining evidence from source suprasegmental and spectral features for a fixed-text speaker verification system , IEEE Trans. Speech and Audio Processing, 13(4), pp. 575-582.
  9. J. Kittler, M. Hatef, R. Duin, J. Mataz. 1998 On combining classifiers, IEEE Trans, Pattern Anal. Mach. Intell, 20(3), pp. 226-239.
  10. He, J., Liu, L., Palm, G. 1999 A Discriminative Training Algorithm for VQ-based Speaker Identification , IEEE Transactions on Speech and Audio Processing, 7(3), pp. 353-356.
  11. Laurent Besacier and Jean-Francois Bonastre. 2000 Subband architecture for automatic speaker recognition, Signal Processing, 80, pp. 1245-1259.
  12. Ganchev, T., Fakotakis, N., and Kokkinakis, G. 2005 Comparative Evaluation of Various MFCC Implementations on the Speaker Verification Task, Proc. of SPECOM Patras, Greece, pp. 1191-194.
  13. Zheng F., Zhang, G. and Song, Z. 2001 Comparison of different implementations of MFCC, J. Computer Science & Technology 16(6), pp. 582-589.
Index Terms

Computer Science
Information Sciences

Keywords

GF Triangular Filter Subbands Correlation MFCC inverted MFCC Vector Quantization