CFP last date
20 January 2025
Reseach Article

DWT and MFCCs based Feature Extraction Methods for Isolated Word Recognition

by Mahmoud I. Abdalla, Haitham M. Abobakr, Tamer S. Gaafar
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 69 - Number 20
Year of Publication: 2013
Authors: Mahmoud I. Abdalla, Haitham M. Abobakr, Tamer S. Gaafar
10.5120/12087-8165

Mahmoud I. Abdalla, Haitham M. Abobakr, Tamer S. Gaafar . DWT and MFCCs based Feature Extraction Methods for Isolated Word Recognition. International Journal of Computer Applications. 69, 20 ( May 2013), 21-25. DOI=10.5120/12087-8165

@article{ 10.5120/12087-8165,
author = { Mahmoud I. Abdalla, Haitham M. Abobakr, Tamer S. Gaafar },
title = { DWT and MFCCs based Feature Extraction Methods for Isolated Word Recognition },
journal = { International Journal of Computer Applications },
issue_date = { May 2013 },
volume = { 69 },
number = { 20 },
month = { May },
year = { 2013 },
issn = { 0975-8887 },
pages = { 21-25 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume69/number20/12087-8165/ },
doi = { 10.5120/12087-8165 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:30:47.420380+05:30
%A Mahmoud I. Abdalla
%A Haitham M. Abobakr
%A Tamer S. Gaafar
%T DWT and MFCCs based Feature Extraction Methods for Isolated Word Recognition
%J International Journal of Computer Applications
%@ 0975-8887
%V 69
%N 20
%P 21-25
%D 2013
%I Foundation of Computer Science (FCS), NY, USA
Abstract

A new method for feature extraction is presented in this paper for speech recognition using a combination of discrete wavelet transform (DWT) and mel Frequency Cepstral Coefficients (MFCCs). The objective of this method is to enhance the performance of the proposed method by introducing more features from the signal. The performance of the Wavelet-based mel Frequency Cepstral Coefficients method is compared to mel Frequency Cepstral Coefficients based method for features extraction. Wavelet transform is applied to the speech signal where the input speech signal is decomposed into various frequency channels using the properties of wavelet transform. then Mel-Frequency Cepstral Coefficients (MFCCs) of the wavelet channels are calculated. A new set of features can be generated by concatenating both features. The speech signals are sampled directly from the microphone. Neural Networks (NN) are used in the proposed methods for classification. The proposed method is implemented for 15 male speakers uttering 10 isolated words each which are the digits from zero to nine. each digit is repeated 15 times.

References
  1. M. C. Shrotriya, Omar Farooq, and Z. A. Abbasi, " Hybrid Wavelet based Lpc Features for Hindi Speech Recognition ", International Journal of Information and Communication Technology, March 2008.
  2. Santosh k. Gaikwad, Bharti W. Gawali and Pravin Yannawar, "A Review on Speech Recognition Technique", international journal of computer applications, November 2010.
  3. A. Shafik, S. M. Elhalafawy, S. M. Diab, B. M. Sallam and F. E. Abd El-samie, "A Wavelet based Approach for Speaker Identification from Degraded Speech", International Journal of Communication Networks and Information Security (IJCNIS), December 2009.
  4. Yousef Ajami Alotaibi, "Comparative Study of ANN and HMM to Arabic Digits Recognition Systems", JKAU: Eng. Sci. , Vol. 19 No. 1, pp: 43-60 (2008 A. D. / 1429 A. H. )
  5. Engin Avci, "An Automatic System for Turkish Word Recognition Using Discrete Wavelet Neural Network based on Adaptive Entropy", The Arabian Journal for Science and Engineering, October 2007
  6. Daniel Motlotle Rasetshwane, "Identification of Transient Speech Using Wavelet Transforms", MS, thesis submitted to University of Pittsburgh, 2005.
  7. Robi Polikar, " The Wavelet Tutorial ", 2006.
  8. N. S. Nehe and R. S. Holambe, " DWT and LPC based Feature Extraction Methods for Isolated Word Recognition" , EURASIP Journal on Audio, Speech, and Music Processing, 2012
  9. Sonia Sunny, David Peter S and K Poulose Jacob, Recognition of Speech Signals: An Experimental Comparison of Linear Predictive Coding and Discrete Wavelet Transforms ", Sonia Sunny et al. / International Journal of Engineering Science and Technology (IJEST), April 2012.
  10. Mahmoud I. Abdalla and Hanaa S. Ali, " Wavelet-based Mel – Frequency Cepstral Coefficients for Speaker Identification using Hidden Markov Models ", Journal of telecommunications , March 2010.
  11. G. Madhavilatha , A . Mallaiah, and T. Venkata Lakshmi, "Advanced Speaker Verification System Using Wavelets" International Journal of Engineering Research and applications (IJERA), ISSN: 2248-9622 www. ijera. com Vol. 1, Issue 3, pp. 891-898.
  12. R. J. E. Merry, " Wavelet Theory and Applications " , Alexandria . tue . nl , Eindhoven, June 7 , 2005
  13. " Wavelet Toolbox ", Help Quide, Matlab 2012.
  14. Amruta Anantrao Malode and Shashikant Sahare, " Advanced Speaker Recognition ", International Journal of Advances in Engineering & Technology, July 2012.
  15. " An Automatic Speaker Recognition System ", http://www. ifp. uiuc. edu/~minhdo/teaching/speaker_recognition.
  16. Noelia Alcaraz Meseguer, " Speech Analysis for Automatic Speech Recognition ", Thesis submitted to Norwegian University of Science and Technology Department of Electronics and Telecommunications, 2009.
  17. Meysam Mohamad pour and Fardad Farokhi, " An Advanced Method for Speech Recognition ", World Academy of Science, Engineering and Technology 25 2009.
  18. Veera Ala-Keturi, " Speech Recognition based on Artificial Neural Networks ", Helsinki hnology Institute of Tec, 2004.
  19. Sonia Sunny, David Peter S and K Poulose Jacob, "Discrete Wavelet Transforms and Artificial Neural Networks for Recognition of Isolated Spoken Words ", International Journal of Computer Applications, January 2012.
Index Terms

Computer Science
Information Sciences

Keywords

Speech Recognition Feature Extraction Mel-Frequency Cepstral Coefficients Discrete Wavelet Transforms Neural Networks