CFP last date
20 December 2024
Reseach Article

Analysis of Speech Recognition Models for Real Time Captioning and Post Lecture Transcription

Published on February 2015 by Wilny Wilson.p, Sindhu.s
Advanced Computing and Communication Techniques for High Performance Applications
Foundation of Computer Science USA
ICACCTHPA2014 - Number 1
February 2015
Authors: Wilny Wilson.p, Sindhu.s
6e19ee1b-e532-4219-a0e1-e75bf24b0cdf

Wilny Wilson.p, Sindhu.s . Analysis of Speech Recognition Models for Real Time Captioning and Post Lecture Transcription. Advanced Computing and Communication Techniques for High Performance Applications. ICACCTHPA2014, 1 (February 2015), 19-23.

@article{
author = { Wilny Wilson.p, Sindhu.s },
title = { Analysis of Speech Recognition Models for Real Time Captioning and Post Lecture Transcription },
journal = { Advanced Computing and Communication Techniques for High Performance Applications },
issue_date = { February 2015 },
volume = { ICACCTHPA2014 },
number = { 1 },
month = { February },
year = { 2015 },
issn = 0975-8887,
pages = { 19-23 },
numpages = 5,
url = { /proceedings/icaccthpa2014/number1/19431-6006/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Proceeding Article
%1 Advanced Computing and Communication Techniques for High Performance Applications
%A Wilny Wilson.p
%A Sindhu.s
%T Analysis of Speech Recognition Models for Real Time Captioning and Post Lecture Transcription
%J Advanced Computing and Communication Techniques for High Performance Applications
%@ 0975-8887
%V ICACCTHPA2014
%N 1
%P 19-23
%D 2015
%I International Journal of Computer Applications
Abstract

Today attempts are made to improve human machine interaction. Automatic speech recognition is widely used for helping hearing impaired and elderly people so that they can watch television shows more effectively. Speech recognition is also known as Automated Speech Recognition (ASR). Different models used for speech recognition include hidden markovian model, dynamic time warping, artificial neural network and acoustic phone model. The two methods of SRmLA i. e. RTC and PLT were beneficial in its own ways. The later method was found to be more advantages in terms of word recognition. Full accessibility for persons who are deaf and hard of hearing requires easy-to-use and pervasive conversion methods for audio information both in academic environments and the workplace. Transcription of audio materials provides one method to solve this access problem.

References
  1. Rohit Ranchal, Teresa Taber-Doughty, Yiren Guo, Keith Bain, Heather Martin, J. Paul Robinson, Bradley S. Duerstock. 2013. Using Speech Recognition for Real-Time Captioning and Lecture Transcription in the Classroom. IEEE Transactions on Learning Technologies.
  2. M. Wald, K. Bain. 2008. Universal access to communication and learning: the role of automatic speech recognition. Universal Access in the Information Society, vol. 6, no. 4, 435-447.
  3. K. Hadjikakou, V. Polycarpou, A. Hadjili . 2010. The Experiences of Students with Mobility Disabilities in Cypriot Higher Education institutions: Listening to Their Voices. International Journal of Disability, Development and Education, vol. 57, no. 4, 403-426.
  4. M. Wald, G. Wills, D. Millard, L. Gilbert, S. Khoja, J. Kajaba, and Y. Li. 2009. Synchronised Annotation of Multimedia. IEEE International Conference on Advanced Learning Technologies, 594-596.
  5. A. L. Buchsbaum and R. Giancarlo. Algorithmic Aspects in Speech Recognition: An Introduction, Association for Computing Machinery, Inc. , 1515 Broadway, New York, NY 10036,USA, Tel: (212), 869-7440.
  6. Jean Hennebert, Martin Hasler and Hervé Dedieu. Neural Networks In Speech Recognition. Department of Electrical Engineering Swiss Federal Institute of Technology.
  7. Khalid A. Darabkh, Ala F. Khalifeh, Baraa A. Bathech and Saed W. Sabah. Efficient DTW-Based Speech Recognition System for Isolated Words of Arabic Language. ,World Academy of Science, Engineering and Technology Vol:7 2013-05-25.
  8. http://www. speech-to-text. eu/
  9. Ashwini B V and Laxmi B Rananavare. Enhancement of Learning using Speech Recognition and Lecture Transcription: A Survey, International Journal of Computer Applications (0975 – 8887).
  10. Mohamad Adnan Al-Alaoui, Lina Al-Kanj, Jimmy Azar, and Elias Yaacoub. Speech Recognition using Artificial Neural Networks and Hidden Markov Models, Ieee multidisciplinary engineering education magazine, vol. 3, no. 3, September 2008.
  11. http://www2. dc. uba. ar/materias/rn/Aplicaciones/Perceptron/asr-hmm-ann. pdf.
  12. Toru Imai, Shinichi Homma, Akio Kobayashi, Shoei Sato, Tohru Takagi,Kyouichi Saitou, and Satoshi Hara. Real-Time Closed-Captioning Using Speech Recognition.
Index Terms

Computer Science
Information Sciences

Keywords

Automated Speech Recognition Real Time Captioning Post Lecture Transcription Speech Recognition Mediated Language Acquisition.