Automatic Speech Recognition: A Review

Shipra J. Arora; Rishi Pal Singh

Call for Paper

September Edition

IJCA solicits high quality original research papers for the upcoming September edition of the journal. The last date of research paper submission is 20 August 2025

Submit your paper

Know more

The week's pick

Real-time Synchronization Mechanisms Between Batch-oriented Legacy Systems and Modern Interfaces in the Retirement Domain

Balamurugan Krishnaswamy Gnanasekaran

Random Articles

Estimation of Population Variance in Simple Random Sampling using Auxiliary Information

Nov

2020

Compiler for Detection of Program Vulnerabilities

October

2014

Color Content based Video Retrieval using Block Truncation Coding with Different Color Spaces

February

2013

A Novel Progressive Sampling based Approach for Effective Mining of Association Rules

November

2010

Reseach Article

Automatic Speech Recognition: A Review

by Shipra J. Arora, Rishi Pal Singh

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 60 - Number 9

Year of Publication: 2012

Authors: Shipra J. Arora, Rishi Pal Singh

10.5120/9722-4190

Shipra J. Arora, Rishi Pal Singh . Automatic Speech Recognition: A Review. International Journal of Computer Applications. 60, 9 ( December 2012), 34-44. DOI=10.5120/9722-4190

@article{ 10.5120/9722-4190,

author = { Shipra J. Arora, Rishi Pal Singh },

title = { Automatic Speech Recognition: A Review },

journal = { International Journal of Computer Applications },

issue_date = { December 2012 },

volume = { 60 },

number = { 9 },

month = { December },

year = { 2012 },

issn = { 0975-8887 },

pages = { 34-44 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume60/number9/9722-4190/ },

doi = { 10.5120/9722-4190 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T21:06:07.811869+05:30

%A Shipra J. Arora

%A Rishi Pal Singh

%T Automatic Speech Recognition: A Review

%J International Journal of Computer Applications

%@ 0975-8887

%V 60

%N 9

%P 34-44

%D 2012

%I Foundation of Computer Science (FCS), NY, USA

Abstract

This paper attempts to describe a literature review of Automatic Speech Recognition. It discusses past years advances made so as to provide progress that has been accomplished in this area of research. One of the important challenges for researchers is ASR accuracy. The Speech recognition System focuses on difficulties with ASR, basic building blocks of speech processing, feature extraction, speech recognition and performance evaluation. The main objective of the review paper is to bring to light the progress made for ASRs of different languages and the technological viewpoint of ASR in different countries and to compare and contrast the techniques used in various stages of Speech recognition and identify research topic in this challenging field. We are not presenting exhaustive descriptions of systems or mathematical formulations but rather, we are presenting distinctive and novel features of selected systems and their relative merits and demerits.

References

Sadaoki Furui, November 2005, 50 years of Progress in speech and Speaker Recognition Research , ECTI Transactions on Computer and Information Technology,Vol. 1. No. 2.
K. H. Davis, R. Biddulph, and S. Balashek, 1952, Automatic Recognition of spoken Digits, J. Acoust. Soc. Am. ,24(6):637-642.
H. F. Olson and H. Belar, 1956, Phonetic Typewriter , J. Acoust. Soc. Am. ,28(6):1072-1081.
J. W. Forgie and C. D. Forgie, 1959, Results obtained from a vowel recognition computer program , J. Acoust. Soc. Am. , 31(11),pp. 1480-1489.
D. B. Fry, 1959, Theoritical Aspects of Mechanical speech Recognition , and P. Denes, The design and Operation of the Mechanical Speech Recognizer at Universtiy College London, J. British Inst. Radio Engr. , 19:4,211-299.
K. Nagata, Y. Kato, and S. Chiba, 1963, Spoken Digit Recognizer for Japanese Language , NEC Res. Develop. , No. 6.
T. Sakai and S. Doshita, 1962 The phonetic typewriter, information processing 1962 , Proc. IFIP Congress.
L. R. Rabiner, S. E. Levinson, A. E. Rosenberg, and J. G. Wilpon, August 1979, Speaker Independent Recognition of Isolated Words Using Clustering Techniques , IEEE Trans. Acoustics, Speech, Signal Proc. , ASSP-27:336-349.
B. Lowrre, 1990, The HARPY speech understanding system ,Trends in Speech Recognition, W. Lea,Ed. , Speech Science Pub. , pp. 576-586.
R. K. Moore, 1994, Twenty things we still don t know about speech , Proc. CRIM/ FORWISS Workshop on Progress and Prospects of speech Research an Technology.
J. Ferguson, 1980, Hidden Markov Models for Speech, IDA,Princeton, NJ.
L. R. Rabiner, February 1989, A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition , Proc. IEEE,77(2):257-286.
B. H. Juang and S. Furui, 2000, Automatic speech recognition and understanding: A first step toward natural human machine communication , Proc. IEEE,88,8,pp. 1142-1165.
K. P. Li and G. W. Hughes, 1974, Talker differences as they appear in correlation matrices of continuous speech spectra , J. Acoust. Soc. Am. , 55,pp. 833-837.
Ananth Sankar, May 1996, " A maximum likelihood approach to stochastic matching for Robust Speech recognition", IEEE Transactions on Audio, Speech and Language processing Vol. 4,No. 3.
Gerhard Rigoll, Jan. 1994, "Maximum Mutual Information Neural Networks for Hybrid connectionist-HMM speech Recognition Systems ", IEEE Transactions on Audio, Speech and Language processing Vol. 2,No. 1, PartII.
Nam Soo Kim et. al. , July1995, On estimating Robust probability Distribution in HMM in HMM based speech recognition , IEEE Transactions on Audio, Speech and Language processing Vol. 3,No. 4.
Jean Francois, Jan. 1997, Automatic Word Recognition Based on Second Order Hidden Markov Models , IEEE Transactions on Audio, Speech and Language processing Vol. 5,No. 1.
Mohamed Afify and Olivier Siohan, January 2004, Sequential Estimation With Optimal Forgetting for Robust Speech Recognition , IEEE Transactions On Speech And Audio Processing, Vol. 12, No. 1.
Giuseppe Riccardi, July 2005, Active Learning: Theory and Applications to Automatic Speech Recognition , IEEE Transactions On Speech And Audio Processing, Vol. 13, No. 4.
Mohamed Afify, Feng Liu, Hui Jiang, July 2005, A New Verification-Based Fast-Match for Large Vocabulary Continuous Speech Recognition , IEEE Transactions On Speech And Audio Processing, Vol. 13, No. 4.
S. Furui, 2005, Recent progress in corpus-based spontaneous speech recognition, IEICE Trans. Inf. & Syst. , E88-D, 3, pp. 366-375.
S. Furui, 2004, Speech-to-text and speech-to-speech summarization of spontaneous speech, IEEE Trans. Speech & Audio Processing, 12, 4, pp. 401-408.
Eduardo Lleida et. al. March 2000, Utterance Verification In Decoding And Training Procedures , IEEE Transactions On Speech And Audio Processing, Vol. 8, No. 2.
Geoff Bristow, 1986, "Electronic Speech recognition: Techniques, Technology and Applications'' , Collins .
Doh-Suk Kim, 1999, " Auditory processing of Speech Signals for Robust Speech Recognition in Real World Noisy Environment", IEEE Transactions on Speech and Audio Processing Vol. 7,No. 1.
Adoram Erell et. al. ,1993, " Filter bank energy estimation using mixture and Markov models for Recognition of Noisy Speech" IEEE Transactions on Audio, Speech and Language processing Vol. 1,No. 1.
M. A. Anusuya, S. K. Katti, 2009, "Speech Recognition by Machine: A Review", International Journal of Computer Science and Information Security, vol. 6, No. 3.

Index Terms

Computer Science

Information Sciences

Keywords

Automatic speech recognition Language Model Speech Processing Database Pattern Recognition Hidden Markov Model