We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 November 2024
Reseach Article

Speaker Recognition from Noisy Spoken Sentences

by Fatima K. Faek, Abdulbasit K. Al-talabani
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 70 - Number 20
Year of Publication: 2013
Authors: Fatima K. Faek, Abdulbasit K. Al-talabani
10.5120/12182-8213

Fatima K. Faek, Abdulbasit K. Al-talabani . Speaker Recognition from Noisy Spoken Sentences. International Journal of Computer Applications. 70, 20 ( May 2013), 11-14. DOI=10.5120/12182-8213

@article{ 10.5120/12182-8213,
author = { Fatima K. Faek, Abdulbasit K. Al-talabani },
title = { Speaker Recognition from Noisy Spoken Sentences },
journal = { International Journal of Computer Applications },
issue_date = { May 2013 },
volume = { 70 },
number = { 20 },
month = { May },
year = { 2013 },
issn = { 0975-8887 },
pages = { 11-14 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume70/number20/12182-8213/ },
doi = { 10.5120/12182-8213 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:33:21.801618+05:30
%A Fatima K. Faek
%A Abdulbasit K. Al-talabani
%T Speaker Recognition from Noisy Spoken Sentences
%J International Journal of Computer Applications
%@ 0975-8887
%V 70
%N 20
%P 11-14
%D 2013
%I Foundation of Computer Science (FCS), NY, USA
Abstract

In this paper a text independent speaker recognizer from controlled noisy speech signals has been investigated. A recorded data is used for 20 Kurdish speakers (10 males, and 10 females) . The feature used in this work is the MFCC, and k-NN is used as a classifier. The recognition performance from the noisy speech signals has been improved by a de-noising technique using wavelet transform. The result show that the de-noising technique could improve the performance of speaker recognizer by about 36%.

References
  1. N. McLaughlin, J. Ming, and D. Crookes. 2011. Speaker recognition in Noisy Condition With limited Training Data,19th European Signal Processing Conference , Barcelona , Spain .
  2. M. Zamalloa, G. Bordel, L. J. Rodriguez, M. Penagarikano. 2006. Feature Selection Based on Genetic Algorithms for Speaker Recognition,IEEE Odyssey - The Speaker and Language Recognition Workshop.
  3. A. Srinivasan. 2012. Speaker Identification and Verification using Vector Quantization and Mel Frequency Cepstral Coefficients, Research Journal of Applied Sciences, Engineering and Technology 4(1): 33-40.
  4. V. Tiwari ,G. Ganga , J. Singhai , and M. Azad . 2011. Wavelet Based Noise Robust Features for Speaker Recognition ,An International Journal (SPIJ), Volume (5) : Issue (2) .
  5. Ji Ming, Timothy J. Hazen, James R. Glass, and Douglas A. Reynolds. 2007. Robust Speaker Recognition in Noisy Conditions, Transaction on Audio, Speech, and Language Processing , vol. 15, no. 5, 1711-1723.
  6. L. Wang, K. Minami, K. Yamamoto, S. Nakagawa. 2010. Speaker Identification By Combining MFCC and Phase Information in Noisy Environments , IEEE 4502-4505 ICASSP.
  7. P. Krishnamoorthy , H. S. Jayanna , S. R. M. Prasanna . 2011. Speaker recognition under limited data condition by noise addition ,Expert Systems with Applications 38 13487–13490.
  8. T. Azetsu, M. Abuku, N. Suetake and E. Uchin. 2012. Speaker Identification in Noisy Environment with Use of the Precise Model of the Human Auditory System ,Proceeding of the international multi Conference of Engineers and Computer Scientists Vol1, IMECS, Hong Kong .
  9. H. Seddik, A. Rahmouni and M. Sayadi, C. Esstt, AV. Taha Hussein . 2010. Text Independent Speaker Recognition Using The Mel Frequency Cepctral Coefficients And A Neural Network Classifier, 1008, Tunis, Tunisia , Ecole National Des Scince Information, Manouba, IEEE.
  10. R Sarikaya et. al. 1998. Wavelet packet transform features with application to speaker identification, in proceedings of the IEEE Nordic signal processing symposium .
  11. R. Sarikaya and J. H. L. Hansen. 2000. High resolution speech feature parameterization formonophone-based stressed speech recognition, IEEE signal processing letters vol7(7),pp. 182-185.
  12. O. Farooq and S. Datta . 2001. Mel Filter- Like Admissible Wavelet packet Structure for Speech Recognition , IEEE Signal Processing letters, Vol. 8 , No. 7,pp 196-198 .
  13. P. Maragos, J. F. Kaiser and T. F. Quatieri . 1993. Energy Separation in Signal Modulations with Application to Speech Analysis, IEEE transactions on signal processing, vol. 41, no. 10, pp. 3024-3051
  14. D. L. Donoho, 1995. De-noising by soft-thresholding, IEEE, Trans. on Inf. Theory, 41, 3, pp. 613-627.
  15. S. Davis and P. Mermelstein. 1980. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences, IEEE Transaction Acoustics Speech and Signal Processing, vol. 4, pp. 375-366.
  16. J. Wu and Z. G. Cao,2005. Improved MFCC-Based Feature for Robust Speaker Identification, TSINGHUA Science and Technology, vol. 10, pp. 158–161.
  17. Wang Yutai, Li Bo, Jiang Xiaoqing, Liu Feng, Wang Lihao. 2011. Speaker Recognition Based on Dynamic MFCC Parameters, IEEE .
  18. D. Hand, H. Mannila, P. Smyth. 2001. Principles of Data Mining. The MIT Press.
  19. G. Guo, H. Wang ,K. Greer ,D. Bell , and Y. Bi . 2001. KNN Model-Based Approach in Classification ,This work was partly supported by the European Commission project ICONS, project no. IST-2001-32429.
Index Terms

Computer Science
Information Sciences

Keywords

Speaker recognition in noisy environment MFCC features k-NN classifier de- noising signals in wavelet domain