CFP last date
20 January 2025
Reseach Article

Analysis of Various Features using Different Temporal Derivatives from Speech Signals

by Muskan, Naveen Aggarwal
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 118 - Number 8
Year of Publication: 2015
Authors: Muskan, Naveen Aggarwal
10.5120/20762-3191

Muskan, Naveen Aggarwal . Analysis of Various Features using Different Temporal Derivatives from Speech Signals. International Journal of Computer Applications. 118, 8 ( May 2015), 1-9. DOI=10.5120/20762-3191

@article{ 10.5120/20762-3191,
author = { Muskan, Naveen Aggarwal },
title = { Analysis of Various Features using Different Temporal Derivatives from Speech Signals },
journal = { International Journal of Computer Applications },
issue_date = { May 2015 },
volume = { 118 },
number = { 8 },
month = { May },
year = { 2015 },
issn = { 0975-8887 },
pages = { 1-9 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume118/number8/20762-3191/ },
doi = { 10.5120/20762-3191 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:01:06.255976+05:30
%A Muskan
%A Naveen Aggarwal
%T Analysis of Various Features using Different Temporal Derivatives from Speech Signals
%J International Journal of Computer Applications
%@ 0975-8887
%V 118
%N 8
%P 1-9
%D 2015
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Speech recognition being an upcoming field is evaluated and research is being done for the same. Research in speech recognition for different languages is at peak. Less amount of work has been done for Indian languages particularly for Punjabi language. In this paper, Punjabi speech has been analyzed by extracting various features along with different temporal derivatives using feature extraction techniques. The dataset which has been considered for the research work is the set of Punjabi isolated digit recorded as 24 bit 44100 Hz mono PCM signal. Comparison of range and accuracy for acceptable results has been determined using HMM.

References
  1. L. Rabiner and R. Schafer, "Introduction to digital speech processing", Foundations and Trends in Signal Processing, Journal of ACM vol. 1, no. 1-2, pp. 1–194, 2007.
  2. L. Rabiner and B. H. Jaung, Fundamentals of Speech Recognition, Englewood Cliffs, NJ: Prentice-Hall, 1993.
  3. X. Huang, J. Baker and R Reddy, "A Historical Perspective of Speech Recognition", Communications of the ACM, vol. 57, no. 1, January 2014.
  4. K. H. Davis, R. Biddulph and S. Balashek, "Automatic recognition of spoken digits," J. A. S. A. , vol. 24, no. 6, pp. 637-642, 1952.
  5. S. C. Sajjan and C. Vijaya, "Comparison of DTW and HMM for isolated word recognition", Proceedings of International Conference on Pattern Recognition, Informatics and Medical Engineering (PRIME), IEEE, pp. 466-470, 2012.
  6. H Sakoi and S Chiba, "Dynamic Programming Algorithm Optimization for Spoken Word Recognition", IEEE Transactions on acoustics, speech and signal processing, vol. Assp- 26, no. 1, February 1978.
  7. L R Rabiner, A E Rosenberg, S E Levinson and J G Wilpon, "Speaker-Independent Recognition of Isolated Words Using Clustering Techniques", IEEE Transactions on acoustics, speech and signal processing, vol. Assp- 27, no. 4, August 1979.
  8. L. R. Rabinar and M. R. Sambur, "An algorithm for determining the endpoints of isolated utterances", The Bell System Technical Journal, pp. 297-315, 1975.
  9. L R Rabiner, A E Rosenberg, L F Lamel and J G Wilpon , "An Improved Endpoint Detector for Isolated Word Recognition", IEEE Transactions on acoustics, speech and signal processing, vol. Assp- 29, no. 4, 1981.
  10. M A Bush, G E Kopec and N Lauritzen, "Segmentation in Isolated Word Recognition Using Vector Quantization", Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '84, vol. 9, 1984
  11. L R Rabiner and B H Jaung, "An Introduction to Hidden Markov Models", IEEE ASSP Magazine, pp. 4-16, January 1986.
  12. L R Rabiner and B H Jaung, "Hidden Markov Models for Speech Recognition", Technometrics, vol 33, no. 3, 1991.
  13. S B Davis and P Mermelstein, "Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences", IEEE Transactions on acoustics, speech and signal processing, vol. Assp- 28, no. 4, 1980.
  14. Wei Han, Cheong-Fat Chan, Chiu-Sing Choy and Kong-Pang Pun, "An Efficient MFCC Extraction Method in Speech Recognition", Proceedings of IEEE, ISCAS 2006, 2006
  15. H A Patil and T K Basu, "Development of speech corpora for speaker recognition research and evaluation in Indian languages", IJST 2008, Springer, 2008
  16. S Ranjan, "A Discrete Wavelet Transform Based Approach to Hindi Speech Recognition", Proceedings of International Conference on Signal Acquisition and Processing, IEEE, pp, 345-348, 2010.
  17. K S Rao, "Application of prosody models for developing speech systems in Indian languages", IJST 2011, Springer, 2011.
  18. I Bhardwaj and N D Londhe, "Hidden Markov Model Based Isolated Hindi Word Recognition", Proceedings of 2nd International Conference on Power, Control and Embedded Systems, IEEE, 2012.
  19. T Pruthi, S Saksena and P K Das, "Swaranjali: Isolated Word Recognition for Hindi Language using VQ and HMM", International Conference on Multimedia Processing and Systems (ICMPS), IIT Madras.
  20. S Tripathy, N Baranwal and G C Nandi, "A MFCC based Hindi Speech Recognition Technique using HTK Toolkit", Proceedings of the 2013 IEEE Second International Conference on Image Information Processing (ICIIP-2013), pp. 539-544, 2013.
  21. A Sharma and A Kaur, "Automatic Segmentation of Punjabi Speech into Syllable-Like Units using Group Delay: A Review", Proceedings of International Journal of Computer Science & Engineering Technology (IJCSET), vol 4, no 6, 2013.
  22. R. Kumar, "Comparison of HMM and DTW for isolated word recognition system for punjabi language", Proceedings of IJSC, vol 5, no. 3, pp. 88-92, 2010
  23. Gurpreet Kaur, Parminder Singh and Amandeep Kaur, "Syllable Boundary Detection System for Punjabi Language", Proceedings of International Journal of Applied Research in Computing, vol. 1, no. 2, July 2013.
  24. Ramana Rao G. V. and Srichand J, "Word boundary detection using pitch variations", Proceedings of Fourth International Conference on Spoken Language, 1996. ICSLP 96, pp. 813-816, May 1996.
  25. Wiqas Ghai and Navdeep Singh, "Continuous Speech recognition for Punjabi Language", International Journals of Computer Application, vol. 72, no. 14, May 2013.
  26. J. Psutka, L. Muller and J. V. Psutka, "Comparison of MFCC and PLP Parametrizations in Speaker Independent continuous speech recognition task", Eurospeech 2001, Scandanavia.
  27. A. M. Toh, R. Togneri and S. Nordholm, "Investigating robust features for speech recognition in hostile environment", Asia Pacific Conference on Communication IEEE, October 2005.
  28. H. Manabe and Z. Zhang, "Multi-stream HMM for EMG-Based Speech Recognition", Multimedia Laboratories, NTT Docomo, Kanagawa, Japan.
  29. Muskan and Naveen Aggarwal, "Punjabi Speech Recognition: A Survey", Proceedings of ICAET, May 2014.
  30. A. N. Mishra, M Chandra, A Biswas, S. N. Sharan, "Robust features for connected Hindi digits recognition", International Journal of Signal Processing, Image Processing and Pattern Recognition, Vol. 4, No. 2, June, 2011
  31. K. M Krishna, M V Lakshmi and S. Sathiya Lakshmi, "Feature Extraction and Dimensionality Reduction using IPS for Isolated Tamil Words Speech Recognizer", International Journal of Advanced Research in Computer and Communication Engineering, Vol. 3, Issue 3, March 2014.
  32. M Alsulaiman, G Muhammad and Z Ali, "Comparison of Voice Features for Arabic Speech Recognition", IEEE, 2011.
  33. V Tiwari, "MFCC and its applications in speaker recognition", Proceedings of IJET, 2010.
  34. Bassam A. Q. Al-Qatab and Raja N. Ainon, "Arabic speech recognition using Hidden Markov Model toolkit (HTK)", IEEE, 2010.
  35. M Yanzhou and Y Mianzhu, "Russian Speech Recognition System Design Based on HMM", Proceedings of LEMCS, 2014.
  36. J Kaur, Nidhi, R Kaur, "Issues involved in speech-to-text conversion", International Journal Of Computational Engineering Research, Vol. 2, Issue No. 2, Page No. 512-515, Mar-Apr 2012
  37. S R Mankala, S R Bojja, V S Ramaiah & R. Rajeswara Rao, "Automatic speech processing using HTK for Telghu language", International Journal of Advances in Engineering & Technology, Jan. 2014
  38. A Kumar, M Dua and T Chaudhary, "Continuous Hindi Speech Recognition using Monophone based Acoustic Modeling", International Journal of Computer Applications, 2014.
  39. K. Murali Krishna, M. Vanitha Lakshmi and S. Sathiya Lakshmi, "Feature Extraction and Dimensionality Reduction using IPS for Isolated Tamil Words Speech Recognizer", International Journal of Advanced Research in Computer and Communication Engineering, Vol. 3, Issue 3, March 2014.
  40. S Young, "The HTK Book", Cambridge University Engineering Department.
  41. E Vozarikova, J Juhar and A Cizmar, "Dual Shots Detection", Information and Communication technologies and services, Vol. 10, Issue. 4, 2012.
  42. N H Quang, T V Loan, LE The Dat, "Automatic Speech Recognition for Vietnamese using HTK System", IEEE, 2010.
Index Terms

Computer Science
Information Sciences

Keywords

Speech Recognition MFCC PLP LPC FBank Melspec