Speaker Recognition using VQ and DTW

Call for Paper

May Edition

IJCA solicits high quality original research papers for the upcoming May edition of the journal. The last date of research paper submission is 20 April 2026

Submit your paper

Know more

The week's pick

A Unified NIST SP 800-90B Validation Framework for CMOS True Random Number Generators and Quantum Random Number Generators

Che-Ping Lin

Random Articles

Reseach Article

Speaker Recognition using VQ and DTW

Published on August 2012 by Maruti Limkar, B. Rama Rao, Vidya Sagvekar

International Conference on Advances in Communication and Computing Technologies 2012

Foundation of Computer Science USA

ICACACT - Number 3

August 2012

Authors: Maruti Limkar, B. Rama Rao, Vidya Sagvekar

Maruti Limkar, B. Rama Rao, Vidya Sagvekar . Speaker Recognition using VQ and DTW. International Conference on Advances in Communication and Computing Technologies 2012. ICACACT, 3 (August 2012), 18-20.

@article{

author = { Maruti Limkar, B. Rama Rao, Vidya Sagvekar },

title = { Speaker Recognition using VQ and DTW },

journal = { International Conference on Advances in Communication and Computing Technologies 2012 },

issue_date = { August 2012 },

volume = { ICACACT },

number = { 3 },

month = { August },

year = { 2012 },

issn = 0975-8887,

pages = { 18-20 },

numpages = 3,

url = { /proceedings/icacact/number3/7982-1018/ },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Proceeding Article

%1 International Conference on Advances in Communication and Computing Technologies 2012

%A Maruti Limkar

%A B. Rama Rao

%A Vidya Sagvekar

%T Speaker Recognition using VQ and DTW

%J International Conference on Advances in Communication and Computing Technologies 2012

%@ 0975-8887

%V ICACACT

%N 3

%P 18-20

%D 2012

%I International Journal of Computer Applications

Abstract

Speaker recognition is a process where a person is recognized on the basis of his/her voice signals. In this paper we provide a brief overview for evolution of pattern classification technique used in speaker recognition. Also discussed propose process to modeling a speaker recognition system, which include pre-processing phase, feature extraction phase and pattern classification phase. Linear Prediction Cepstrum Coefficient (LPCC) and Mel Frequency Cepstrum Coefficient (MFCC) are used as the features for text dependent speaker recognition in this system and the experiments compare the recognition rate of LPCC, MFCC or a combination of LPCC and MFCC through using Vector Quantization (VQ) and Dynamic Time Warping (DTW) to recognize a speaker's identity. It proves that the combination of LPCC and MFCC has a higher recognition rate.

References

Campbell J. P,"Speaker Recogniton :ATutorial", Proc. of the IEEE,vol. 85,no. 9,pp. 1437-1462,sep. 1997.
Sadaoki Furui. , "Recent advances in speaker recognition",Pattern Recognition Letters. 1997,18 (9): 859-72.
ZhiyouMa,"Further Extraction for Speaker Recognition", IEEE International Conference on Systems, Man and Cybernetics,153- 158,2003.
Lawrence R. Rabiner. , "A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition", Proceedings of the IEEE, 77 (2), 1989, p. 257–286.
DengHaojiang,Wangshoujie,XingCangju,LiuQian,"Research of Text-Independent Speaker Recognition Using Clustering Statistic",Jumal of Circuits and Systems,2001.
Reynolds, D. A. and Rose, R. C. "Robust text independent speaker identification using Gaussian mixture speaker model", IEEE Trans. Speech Audio Process,3,1995,pp 72-83.
Sadoki Furui, "Cepstral analysis technique for automatic speaker verification", IEEE Trans. ASSP 29,1981, pages 254-272.
F. K. Soong,A. E. Rosenberg,L. R. Rabiner and B. H. Juang, "A Vector Quantization approach to Speaker Recognition", Florida: ICASSP Vol. 1, 1985, pp. 387-390.

Index Terms

Computer Science

Information Sciences

Keywords

Speaker Recognition Lpcc Mfcc Vq Dtw