We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 December 2024
Reseach Article

Digit Recognition based on Euclidean and DTW

by Sreeja Nair, Milind Shah
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 125 - Number 6
Year of Publication: 2015
Authors: Sreeja Nair, Milind Shah
10.5120/ijca2015905923

Sreeja Nair, Milind Shah . Digit Recognition based on Euclidean and DTW. International Journal of Computer Applications. 125, 6 ( September 2015), 15-18. DOI=10.5120/ijca2015905923

@article{ 10.5120/ijca2015905923,
author = { Sreeja Nair, Milind Shah },
title = { Digit Recognition based on Euclidean and DTW },
journal = { International Journal of Computer Applications },
issue_date = { September 2015 },
volume = { 125 },
number = { 6 },
month = { September },
year = { 2015 },
issn = { 0975-8887 },
pages = { 15-18 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume125/number6/22435-2015905923/ },
doi = { 10.5120/ijca2015905923 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:15:18.554648+05:30
%A Sreeja Nair
%A Milind Shah
%T Digit Recognition based on Euclidean and DTW
%J International Journal of Computer Applications
%@ 0975-8887
%V 125
%N 6
%P 15-18
%D 2015
%I Foundation of Computer Science (FCS), NY, USA
Abstract

This paper describes the implementation of two isolated digit recognition techniques and is a comparison between the algorithms implemented. Any digit recognition comprises of mainly two stages feature extraction and similarity evaluation. Here, two feature extraction techniques, namely linear predictive cepstral coefficients (LPCC) and mel frequency cepstral coefficients (MFCC) are implemented and the similarity evaluation is done using Euclidean distance and Dynamic Time Warping (DTW). In DTW both single and averaged template matching is done. The results obtained for these algorithms are perused, compared and conclusions are drawn.

References
  1. L. R. Rabiner and M. R. Sambur, “Some Preliminary Experiments in the Recognition of Connected Digits,” IEEE Trans. on Acoust. Speech and Signal Processing, vol. ASSP-24, no. 2, pp. 170-182, April 1976.
  2. L. R. Rabiner, B. Juang and B. Yegnanarayana, “Fundamentals of Speech Recognition,” 5th ed. Pearson, 2011.
  3. D. O'Shaughnessy, Speech Communications: Human & Machine, 2nd ed. Wiley-IEEE Press, 1999, pp. 367-435.
  4. B. S. Atal, “Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification,” J. Acoust. Soc. Am., vol. 55, no. 6, pp. 1304-1312, 1974.
  5. A. A. M. Abushariah, T. S. Gunawan, O. O. Khalifa and M. A. M. Abushariah., “English Digits Speech Recognition Based on Hidden Markov Models,” in International Conference on Computer and Communication Eng., Kuala Lumpur, Malaysia, May 2010.
  6. A. Revathi and Y. Venkataramani, “Speaker Independent Continuous Speech and Isolated Digit Recognition using VQ and HMM,” Proc. IEEE, pp. 198-202, 2011.
  7. L. R. Rabiner and M.R. Sambur, “An Algorithm for Determining the Endpoints of Isolated Utterances” Bell Syst. Tech. J., vol. 24, no. 2,pp. 297-315, 1975.
  8. S. Savitha, “DSP Implementation of Isolated Digit Recognizer,” M.Tech Dissertation, Dept. Elect. Eng., IIT, Bombay, India, 2008.
  9. A. S. Thakur and N. Sahayam, “Speech Recognition Using Euclidean Distance,” International Journal of Emerging Technology and Advanced Engineering, vol. 3, no. 3, pp. 587-590, 2013.
  10. G. Nitin, “Implementation of Algorithms for Speaker Dependent Isolated Digit Recognition,” M.Tech Dissertation, Dept. Elect. Eng., IIT, Bombay, India, 1997
  11. C. P. Lim, S. C. Woo, A. S. Loh and R. Osman, “Speech Recognition Using Artificial Neural Networks,” Proc. IEEE, Malaysia, 2000, pp. 419-423.
  12. L. R. Rabiner and R. W. Schafer, “Digital Speech Processing for Man-Machine Communication by Voice ” in Digital processing of Speech Signals, 3rd ed. Pearson Education,2009, pp. 505-516.
  13. L. R. Rabiner and C. E. Schmidt, “Application of Dynamic Time Warping to Connected Digit Recognition,” IEEE Trans. ASSP, vol. -28, no. 4, pp. 377-388, Aug. 1980.
  14. H. Sakoe and S. Chiba, “Dynamic Programming Algorithm Optimization for spoken word recognition” IEEE Trans. ASSP, vol. 26, pp. 43-49, 1978.
  15. W. H. Abdulla, D. Chow and G. Sin, “ Cross-words Reference Template for DTW-based Speech Recognition Systems,” TENCON 2003,Conference on Convergent Technologies for Asia-Pacific Region, vol. 4, Oct. 2003, pp.1576 - 1579.
  16. L. R. Rabiner and S. E. Levinson, “Isolated and Connected word recognition- theory and application,” IEEE Trans. Commun.,vol. 29, no. 5, pp. 621-658, 1981.
  17. L. Jalan and T. Palav, “Speech Recognition Based Learning System,” International Journal of Engineering Trends and Technology, vol. 4, no. 2, pp. 165-169, 2013.
Index Terms

Computer Science
Information Sciences

Keywords

Digit recognition linear predictive cepstral coefficients mel frequency cepstral coefficients euclidean distance dynamic time warping.