CFP last date
20 January 2025
Reseach Article

Comparative Analysis of Automatic Speaker Recognition using Kekreís Fast Codebook Generation Algorithm in Time and Transform Domain

by DR. H. B. Kekre, Ms. Vaishali Kulkarni
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 7 - Number 1
Year of Publication: 2010
Authors: DR. H. B. Kekre, Ms. Vaishali Kulkarni
10.5120/1128-1479

DR. H. B. Kekre, Ms. Vaishali Kulkarni . Comparative Analysis of Automatic Speaker Recognition using Kekreís Fast Codebook Generation Algorithm in Time and Transform Domain. International Journal of Computer Applications. 7, 1 ( September 2010), 37-41. DOI=10.5120/1128-1479

@article{ 10.5120/1128-1479,
author = { DR. H. B. Kekre, Ms. Vaishali Kulkarni },
title = { Comparative Analysis of Automatic Speaker Recognition using Kekreís Fast Codebook Generation Algorithm in Time and Transform Domain },
journal = { International Journal of Computer Applications },
issue_date = { September 2010 },
volume = { 7 },
number = { 1 },
month = { September },
year = { 2010 },
issn = { 0975-8887 },
pages = { 37-41 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume7/number1/1128-1479/ },
doi = { 10.5120/1128-1479 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T19:55:21.349966+05:30
%A DR. H. B. Kekre
%A Ms. Vaishali Kulkarni
%T Comparative Analysis of Automatic Speaker Recognition using Kekreís Fast Codebook Generation Algorithm in Time and Transform Domain
%J International Journal of Computer Applications
%@ 0975-8887
%V 7
%N 1
%P 37-41
%D 2010
%I Foundation of Computer Science (FCS), NY, USA
Abstract

In this paper, an approach based on Kekre’s fast code book Generation (KFCG) Algorithm in the transform domain has been proposed. KFCG is used for feature extraction in both the training and testing phases. Three methods for codebook generation have been used. In the 1st method, codebooks are generated from the speech samples by using Discrete Fourier Transform (DFT). In the 2nd method, the codebooks are generated using Discrete Cosine Transform (DCT). In the 3rd method, the codebooks are generated using the Discrete Sine Transform (DST). For speaker identification, the codebook of the test sample is similarly generated and compared with the codebooks of the reference samples stored in the database. The results obtained for the above methods in the transform domain are compared with the results obtained in the time domain analysis. The results show that KFCG gives better results in transform domain than in time domain. Also the results improve as the vector dimension while generating the codebook is increased.

References
  1. Lawrence Rabiner, Biing-Hwang Juang and B.Yegnanarayana, “Fundamental of Speech Recognition”, Prentice-Hall, Englewood Cliffs, 2009.
  2. S Furui, “50 years of progress in speech and speaker recognition research”, ECTI Transactions on Computer and Information Technology, Vol. 1, No.2, November 2005.
  3. D. A. Reynolds, “An overview of automatic speaker recognition technology”, Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP’02), 2002, pp. IV-4072–IV-4075.
  4. Joseph P. Campbell, Jr., Senior Member, IEEE, “Speaker Recognition: A Tutorial”, Proceedings of the IEEE, vol. 85, no. 9, pp. 1437-1462, September 1997.
  5. F. Bimbot, J.-F. Bonastre, C. Fredouille, G. Gravier, I. Magrin-Chagnolleau, S. Meignier, T. Merlin, J. Ortega-García, D.Petrovska-Delacrétaz, and D. A. Reynolds, “A tutorial on text-independent speaker verification,” EURASIP J. Appl. Signal Process., vol. 2004, no. 1, pp. 430–451, 2004.
  6. D. A. Reynolds, “Experimental evaluation of features for robust speaker identification,” IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 639–643, Oct. 1994.
  7. Tomi Kinnunen, Evgeny Karpov, and Pasi Fr¨anti, “Realtime Speaker Identification”, ICSLP2004.
  8. Marco Grimaldi and Fred Cummins, “Speaker Identification using Instantaneous Frequencies”, IEEE Transactions on Audio, Speech, and Language Processing, vol., 16, no. 6, August 2008.
  9. Zhong-Xuan, Yuan & Bo-Ling, Xu & Chong-Zhi, Yu. (1999). “Binary Quantization of Feature Vectors for Robust Text-Independent Speaker Identification” in IEEE Transactions on Speech and Audio Processing, Vol. 7, No. 1, January 1999. IEEE, New York, NY, U.S.A.
  10. R. M. Gray.: ‘Vector quantization’, IEEE ASSP Marg., pp. 4-29, Apr. 1984.
  11. Y. Linde, A. Buzo, and R. M. Gray.: ‘An algorithm for vector quantizer design,” IEEE Trans. Commun.’, vol. COM-28, no. 1, pp. 84-95, 1980.
  12. A. Gersho, R.M. Gray.: ‘Vector Quantization and Signal Compression’, Kluwer Academic Publishers, Boston, MA, 1991.
  13. F. K. Soong, et. al., “A vector quantization approach to speaker recognition”, At & T Technical Journal, 66, pp. 14-26, 1987.
  14. A. E. Rosenberg and F. K. Soong, “Evaluation of a vector quantization talker recognition system in text independent and text dependent models”, Computer Speech and Language 22, pp. 143-157, 1987.
  15. Jeng-Shyang Pan, Zhe-Ming Lu, and Sheng-He Sun.: ‘An Efficient Encoding Algorithm for Vector Quantization Based on Subvector Technique’, IEEE Transactions on image processing, vol 12 No. 3 March 2003.
  16. F. Soong, E. Rosenberg, B. Juang, and L. Rabiner, "A Vector Quantization Approach to Speaker Recognition", AT&T Technical Journal, vol. 66, March/April 1987, pp. 1426.
  17. Md. Rashidul Hasan, Mustafa Jamil, Md. Golam Rabbani Md. Saifur Rahman , “Speaker Identification using Mel Frequency Cepstral Coefficients”, 3rd International Conference on Electrical & Computer Engineering ICECE held at Dhaka, Bangladesh , 28-30 December 2004.
  18. Poonam Bansal, Amrita Dev, Shail Bala Jain, “Automatic Speaker Identification using Vector Quantization”, Asian Journal of Information Technology 6 (9): 938-942, 2007.
  19. Jyoti Singhai, “Automatic Speaker Recognition :An Approach using DWT based Feature Extraction and Vector Quantization”, IETE Technical Review, vol. 24, No 5, pp 395-402, September-October 2007
  20. H. B. Kekre, Tanuja K. Sarode, “Speech Data Compression using Vector Quantization”, WASET International Journal of Computer and Information Science and Engineering (IJCISE), Fall 2008, Volume 2, Number 4, pp.: 251-254, 2008. http://www.waset.org/ijcise.
  21. H. B. Kekre, Tanuja K. Sarode, “New Fast Improved Codebook Generation Algorithm for Color Images using Vector Quantization,” International Journal of Engineering and Technology, vol.1, No.1, pp. 67-77, September 2008.
  22. H. B. Kekre, Tanuja K. Sarode, “Fast Codebook Generation Algorithm for Color Images using Vector Quantization,” International Journal of Computer Science and Information Technology, Vol. 1, No. 1, pp: 7-12, Jan 2009.
  23. H. B. Kekre, Tanuja K. Sarode, “An Efficient Fast Algorithm to Generate Codebook for Vector Quantization,” First International Conference on Emerging Trends in Engineering and Technology, ICETET-2008, held at Raisoni College of Engineering, Nagpur, India, 16-18 July 2008, Avaliable at online IEEE Xplore.
  24. H B Kekre, Vaishali Kulkarni, “Speaker Identification by using Vector Quantization”, International Journal of Engineering Science and Technology, May 2010 edition.
  25. H B Kekre, Vaishali Kulkarni, “Performance Comparison of Speaker Recognition using Vector Quantization by LBG and KFCG”, International Journal of Computer Applications, vol. 3, July 2010.
  26. H.B. Kekre, Archana Athawale, Tanuja K. Sarode, Kalpana Sagvekar, “Comparative Performance of Information Hiding in Vector Quantized Codebooks using LBG, KPE, KMCG and KFCG”, International Journal of Computer Science and Information Security, 2010 Vol: 8 Issue: 2,pp 89-95.
  27. H B Kekre, Archana Athawale, Tanuja Sarode and Kalpana Sagvekar, “Increased Capacity of Information Hiding using Mixed Codebooks of Vector Quantization Algorithms: LBG, KPE and KMCG, International Journal of Advances in Computational Sciences and Technology, Volume 3 Number 2 (2010) pp. 245–256.
Index Terms

Computer Science
Information Sciences

Keywords

Vector Quantization (VQ) Code Vectors Code Book Discrete Fourier Transform (DFT) Discrete Sine Transform (DST) Discrete Cosine Transform (DCT)