CFP last date
20 December 2024
Reseach Article

Implementation of Voice User Interface using Speech Recognition

by Pranali Joshi, Ravi Patki
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 124 - Number 10
Year of Publication: 2015
Authors: Pranali Joshi, Ravi Patki
10.5120/ijca2015905633

Pranali Joshi, Ravi Patki . Implementation of Voice User Interface using Speech Recognition. International Journal of Computer Applications. 124, 10 ( August 2015), 37-42. DOI=10.5120/ijca2015905633

@article{ 10.5120/ijca2015905633,
author = { Pranali Joshi, Ravi Patki },
title = { Implementation of Voice User Interface using Speech Recognition },
journal = { International Journal of Computer Applications },
issue_date = { August 2015 },
volume = { 124 },
number = { 10 },
month = { August },
year = { 2015 },
issn = { 0975-8887 },
pages = { 37-42 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume124/number10/22143-2015905633/ },
doi = { 10.5120/ijca2015905633 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T23:14:05.934128+05:30
%A Pranali Joshi
%A Ravi Patki
%T Implementation of Voice User Interface using Speech Recognition
%J International Journal of Computer Applications
%@ 0975-8887
%V 124
%N 10
%P 37-42
%D 2015
%I Foundation of Computer Science (FCS), NY, USA
Abstract

A voice–user interface (VUI) makes human interaction with computers possible through a voice/speech platform in order to initiate an automated service or process. A VUI is the interface to any speech application. Controlling a machine by simply talking to it was science fiction only a short time ago. Until recently, this area was considered to be artificial intelligence. However, with advances in technology, VUIs have become more commonplace, and people are taking advantage of the value that these hands-free, eyes-free interfaces provide in many situations. The system will be implemented by using techniques such as speech and language processing, human language technology, natural language processing, computational linguistics, and speech recognition and synthesis. The goal of this new field is to get computers to perform useful tasks involving human language, tasks like enabling human-machine communication, improving human-human communication, or simply doing useful processing of text or speech. These voice applications can provide intrinsically comfortable, easy-to-use, and efficient way for users to interact with computer. And as user can use commands in his mother tongue so it is asking your friend computer to do something for you. As a technology for expression, voice works for a much wider range of people than typing, drawing, or gesture because it is a natural part of human existence. Without a great deal of training, normal human beings can express themselves in a wide variety of domains using voice applications, and thus this breadth of application will be a powerful tool in a ubiquitous environment.

References
  1. James R.Evan, Wayne A.Tjoland: Achieving a hand free computer Interface using voice recognition and speech synthesis(2000)
  2. Mukherjee, R.: Text dependent speaker recognition using shifted MFCC (2012)
  3. Robert Keefer, Yan Liu, and Nikolaos Bourbakis: The Development and Evaluation of an Eyes-Free Interaction Model for Mobile Reading Devices (JAN 2013 IEEE)
  4. Vicente P. Minotto, Carlos B. O.Lopes: Audiovisual Voice Activity Detection Based on Microphone Arrays and Color Information (FEB 2013 IEEE)
  5. Xueliang Huo, Hangue Park: A Dual-Mode Human Computer Interface Combining Speech and Tongue Motion for People with Severe Disabilities (NOV 2013 IEEE)
  6. Kwang-Ho Kim, Donghyun Lee: Self -Improvement of Voice Interface with User-input Spoken Query at Early Stage of Commercialization (NOV 2013 IEEE)
  7. A.Esposito, G. Pelosi : Building The Next Generation Of Personal Digital Assistants(2013)
  8. Zhizheng Wu, Xiong Xiao: Synthetic speech detection using temporal modulation Feature(2013)
  9. Tredinnick, R.: Poster: Say it to see it: A speech based immersive model retrieval system(2013)
  10. Vicente P. Minotto, Claudio R.Jung: Simultaneous-Speaker Voice Activity Detection and Localization Using Mid-Fusion of SVM and HMMs (JUNE 2014 IEEE)
  11. Gemmeke, J.F. : The self-taught vocal interface (2014)
Index Terms

Computer Science
Information Sciences

Keywords

Voice Recognition Speech Synthesis Digitization Acoustic Model Speech Engine.