CFP last date
20 January 2025
Reseach Article

Glottal Excitation Feature based Gender Identification System using Ergodic HMM

by R. Rajeshwara Rao, A. Prasad
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 17 - Number 3
Year of Publication: 2011
Authors: R. Rajeshwara Rao, A. Prasad
10.5120/2200-2794

R. Rajeshwara Rao, A. Prasad . Glottal Excitation Feature based Gender Identification System using Ergodic HMM. International Journal of Computer Applications. 17, 3 ( March 2011), 31-36. DOI=10.5120/2200-2794

@article{ 10.5120/2200-2794,
author = { R. Rajeshwara Rao, A. Prasad },
title = { Glottal Excitation Feature based Gender Identification System using Ergodic HMM },
journal = { International Journal of Computer Applications },
issue_date = { March 2011 },
volume = { 17 },
number = { 3 },
month = { March },
year = { 2011 },
issn = { 0975-8887 },
pages = { 31-36 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume17/number3/2200-2794/ },
doi = { 10.5120/2200-2794 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:04:40.769101+05:30
%A R. Rajeshwara Rao
%A A. Prasad
%T Glottal Excitation Feature based Gender Identification System using Ergodic HMM
%J International Journal of Computer Applications
%@ 0975-8887
%V 17
%N 3
%P 31-36
%D 2011
%I Foundation of Computer Science (FCS), NY, USA
Abstract

In this paper, through different experimental studies it is demonstrated that the time varying glottal excitation component of speech can be exploited for text independent gender recognition studies. Linear prediction (LP) residual is used as a representation of excitation information in speech. The gender-specific information in the excitation of voiced speech is captured using the Hidden Markov Models (HMMs). The decrease in the error during training and recognizing genders during testing phase close to 100 % accuracy demonstrates that the excitation component of speech contains gender-specific information and is indeed being effectively captured by continuous Ergodic HMM. A gender recognition study using gender specific features for different HMM states, mixture components, size of testing data on the performance of the gender recognition is evaluated. We demonstrate the gender recognition studies on TIMIT database.

References
  1. Alex Acero and Xuedong Huang, Speaker and Gender Normalization for Continuous-Density Hidden Markov Models, in Proc. of the Int. Conf. on Acoustics, Speech, and Signal , IEEE, May 1996
  2. C. Neti and Salim Roukos. Phone-specific gender-dependent models for continuous speech recognition, Automatic Speech Recognition and Understanding Workshop (ASRU97), Santa Barbara, CA, 1997.
  3. R. Vergin, A. Farhat and D.O’Shaughnessy, “Robust gender-dependent acoustic-phonetic modeling in continuous speech recognition based on a new automatic male/female classification”, Proc. Of IEEE Int. Conf. on Spoken Language (ICSLP), pp. 1081, Oct. 1996.
  4. S. Slomka and S. Sridharan, “Automatic gender identification optimized for language independence”, Proc. Of IEEE TENCON’97, pp. 145-148,Dec. 1997.
  5. O’Shaughnessy, D., 1987. Speech Communication: Human and Machine. Addison-Wesley, New York.
  6. Rabiner, L.R., Juang, B.H., 1993. Fundamentals of Speech Recognition. Prentice-Hall, Englewood Cliffs, NJ.
  7. Makhoul, J., 1975. Linear prediction: a tutorial review. Proc. IEEE 63, 561–580.
  8. B.S. Atal, “Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification” J. Acoust. Soc. Ameri., vol. 55, pp.1304-1312, Jun. 1974.K. Elissa, “Title of paper if known,” unpublished
  9. A.E. Rosenberg and M. Sambur, “New techniques for automatic speaker verification.”, vol. 23, no.2, pp.169-175, 1975.
  10. M. R. Sambur, “Speaker recognition using orthogonal linear prediction,” IEEE Trans. Acoust. Speech, Signal Processing, vol. 24, pp.283-289, Aug. 1976
  11. J. Naik and G. R. Doddington, “ high performance speaker verification using principal spectral components”, in proc. IEEE Int. Conf. Acoust. Speech, Singal Processing, pp. 881-884, 1986.
  12. Furui, S., 1997. Recent advances in speaker recognition. Pattern Recognition Lett. 18, 859–872.
  13. S.R.Mahadeva Prassana, Cheedella S. Gupta, B. Yegnanarayana. Extraction of speaker-specific excitation information from linear prediction residual of speech. Speech Communications Vol.48 (2006) pp.1243-1261.
  14. Dempster, A., Laird, N., and Rubin, D., “Maximum likelihood from incomplete data via the EM algorithm,” Journal of the Royal Statistical Society, vol. 39, pp. 1-38, 1977.
  15. Molau, S., Pitz, M., Schluter, R., and Ney, H., “Computing Mel-frequency cepstral coefficients on the power spectrum,” Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), vol. 1, pp. 73-76, May. 2001.
  16. Picone, J. W., “Signal modeling techniques in speech recognition,” Proceedings of IEEE, vol. 81, no. 9, pp. 1215-1247, Sep. 1993.
  17. M. Forsyth and M. Jack, ―Discriminating semi-continuous HMM for
  18. speaker verification,‖ in proc. IEEE Int. Conf. Acoust. Speech, Signal
  19. Processing, vol.1, pp. 313-316, 1994.
  20. M. Forsyth, ―Discriminating observation probability (DOP) HMM for
  21. speaker verification,‖ Speech Communicaiton, vol. 17, pp.117-129,
  22. 1995.
  23. A. P. Dempster, N. M. Laird, and D. B. Rubin, “Maximum likelihood from incomplete data via the EM algorithm”, J. Royal Statist. Soc. Ser. B. (methodological), vol. 39, pp. 1-38, 1977
  24. K.N. Stevens, Acoustic Phonetics. Cambridge, England: The MIT Press, 1999
Index Terms

Computer Science
Information Sciences

Keywords

Gender Hidden Markov Model (HMM) LPC MFCC