Speech Recognition System for Windows Commands

Call for Paper

August Edition

IJCA solicits high quality original research papers for the upcoming August edition of the journal. The last date of research paper submission is 21 July 2025

Submit your paper

Know more

The week's pick

FORENSIC ANALYSIS FRAMEWORKS FOR ENCRYPTED CLOUD STORAGE INVESTIGATIONS

Joy Awoleye Sarah Mavire Allan Munyira Kelvin Magora

Random Articles

Impact of using Snowflake Schema and Bitmap Index on Data Warehouse Querying

Jan

2018

Customer Complain Detection in E-commerce Platforms using NLP

Dec

2022

Comparative Analysis of Search Algorithms

Jun

2018

Enhanced HMM Speech Emotion Recognition using SVM and Neural Classifier

February

2014

Reseach Article

Speech Recognition System for Windows Commands

Published on May 2013 by Sumit Patel, Amit Bramhecha, Santosh Mahale, Anant Maind, Mahesh Sanghavi

International Conference on Recent Trends in Engineering and Technology 2013

Foundation of Computer Science USA

ICRTET - Number 5

May 2013

Authors: Sumit Patel, Amit Bramhecha, Santosh Mahale, Anant Maind, Mahesh Sanghavi

Sumit Patel, Amit Bramhecha, Santosh Mahale, Anant Maind, Mahesh Sanghavi . Speech Recognition System for Windows Commands. International Conference on Recent Trends in Engineering and Technology 2013. ICRTET, 5 (May 2013), 26-30.

@article{

author = { Sumit Patel, Amit Bramhecha, Santosh Mahale, Anant Maind, Mahesh Sanghavi },

title = { Speech Recognition System for Windows Commands },

journal = { International Conference on Recent Trends in Engineering and Technology 2013 },

issue_date = { May 2013 },

volume = { ICRTET },

number = { 5 },

month = { May },

year = { 2013 },

issn = 0975-8887,

pages = { 26-30 },

numpages = 5,

url = { /proceedings/icrtet/number5/11795-1360/ },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Proceeding Article

%1 International Conference on Recent Trends in Engineering and Technology 2013

%A Sumit Patel

%A Amit Bramhecha

%A Santosh Mahale

%A Anant Maind

%A Mahesh Sanghavi

%T Speech Recognition System for Windows Commands

%J International Conference on Recent Trends in Engineering and Technology 2013

%@ 0975-8887

%V ICRTET

%N 5

%P 26-30

%D 2013

%I International Journal of Computer Applications

Abstract

To develop a system to recognize system commands through voice and convert it into equivalent text, the system accepts voice commands from user and displays its equivalent text. The system accepts voice commands, performs processing on it to recognize the actual command before displaying the corresponding output. For this particular system processing being done are noise removal, feature extraction and pattern matching. Various features are available. These are totally application dependent i. e. for a particular application particular feature is being extracted. Hence performing this various processing, text format of equivalent voice command is being displayed. To accept the voice commands User Must use a good quality microphone. The voice commands is being recorded and saved as a . wav file. Wav file is being used because it stores the data in the digital form. Initially the features of each command would be saved in a file. Once the 'init' is recognized the system will then wait for the users commands. On getting a command the system will save the input as a . wav file. The features of this command are then matched against the predefined command features. If the match is found the command is a valid one. It then displays its text form. If the command is not valid it simply discards it.

References

Markku Turunen and Jaakko Hakulinen, Design and Development of Speech Interfaces Course Material http://www. cs. uta. fi/hci/spi/ddsi/
Pinker, S. , (1994), the Language Instinct, Harper Collins, New York City, New York, USA.
Deshmukh, N. , Ganapathiraju, A, Picone J. , (1999), Hierarchical Search for Large Vocabulary Conversational Speech Recognition. IEEE Signal Processing Magazine, 1(5):84-107.
Zue, V. , Cole, R. , Ward, W. (1996). Speech Recognition. Survey of the State Of the Art in Human Language Technology. Kauii, Hawaii, USA.
Dix, A. J. , Finlay. Abowd, G. , Beale, R. (1998). Human-Computer Interaction, 2nd edition, Prentice Hall, Englewood Cliffs, NJ, USA.
Rudnicky, A. I. , Lee, K. F. , and Hauptmann, A. G. (1992) Survey of current Speech Technology. Communications of the ACM, 37(3):52-57.
Picheny, M. , (2002). Large vocabulary speech Recognition, 3 5(4):42-50.
Rabiner, L. , R. , and Wilpon, J. G. , (1979). Considerations In applying clustering Techniques to speaker-independent word recognition. Journal of Acoustic Society of America. 66(3):663-673.
Kumar, M. Rajput, N. Verma, A . (2006) IBM Journal of Research and Development, 0018-8646,10. 1147/rd. 485. 0703,Sponsored by: IBM
De Mori, Renato, Lam, Lily, Gilloux, Michel. (1987) Pattern Issue, 0162- 8828, 10. 1109/TPAMI. 1987. 4767902, IEEE Computer Society
Bahl, Lalit R, Jelinek, Frederick, Mercer, Robert L, (2000), IBM T. J. Watson Research Center, Yorktown Heights, NY 10598. PAMI-5 Issue: 2 , IEEE Computer Society
Liu, Y. Jones, H. Vaidya, S. Perrone, (2009). http://research. microsoft. com/pubs/80528/SPM-MINDS-I. pdf
M. Tydlitat, B. Nanda, A. K. (2010), IBM Journal of Research and Development, Issue: 5, 0018-8646, 1147/rd. 515. 0583.
Mengjie, Z. , (2001) Overview of speech Recognition and related machine Learning techniques, Technical report. Retrieved December 10, 2004 from http://www. mcs. vuw. ac. nz/comp/Publications/archive/CS-TR-01/CS-TR- 01-15. Pdf
"Research Developments and Directions in Speech Recognition and Understanding, Part 1" , (2009). http://research. microsoft. com/pubs/80528/SPM- MINDS-I. pdf
Speech Recognition Technologies, (John Kirriemuir, 2003 ). http://www. ceangal. com/
Speech Recognition – Wikipedia http://en. wikipedia. org/wiki/Speech_recognition
Voice Recognition Technology http://cobweb. ecn. purdue. edu/~tanchoco/MHE/ADC-is/Voice/main. shtml
http://www. opendl. net/solutions/recognition. aspx
Casey Brains http://www. scribd. com/doc/6901516/ugSpeechSpeech
Wolfgang Wahlster, Verbmobil: Foundations of Speech-To-SpeechTranslation http://books. google. com/books?hl=en&lr=&id=RiT0aAzeudkC&oi=fnd&pg=PR5&dq=Verbmobil:+Foundations+of+Speech-ToSpeech+Translation&ots=jBhMwQ0HnT&sig=zx2EWMK4n-lYhG9k5gKU2zGieE#PPP1,M1
Roni Rosenfeld, Alexander Rudnicky, Stefanie Tomko, Thomas Harris, Universal Speech Interface project http://www. cs. cmu. edu/~usi/
Wikipedia the Free Encyclopedia – Talkman http://en. wikipedia. org/wiki/Talkman
Talking Windows http://msdn. microsoft. com/da-dk/magazine/cc163663(en-us ,printer). aspx
IBM Research, IBM Text-to-Speech Research http://www. research. ibm. com/tts/
Microsoft Corporation, Windows Speech Recognition http://www. microsoft. com/windows/products/windowsvista/features/details/speechrecognition. mspx
Nuance Communications, Inc. , Nuance – Open Speech Recognizer http://www. nuance. com/recognizer/openspeechrecognizer/
Carnegie Mellon University, Sphinx-4 A Speech Recognizer Written http://cmusphinx. sourceforge. net/sphinx4/

Index Terms

Computer Science

Information Sciences

Keywords

Recognize Feature Extraction Pattern Matching Noise Removal