International Conference on Innovation in Communication, Information and Computing 2013 |
Foundation of Computer Science USA |
ICICIC2013 - Number 3 |
June 2013 |
Authors: T.senthil Kumar, Vishnu Gajendran, Rajamanuri Harshad, Sneha Aswani, Deepti Sankara Narayanan |
d04338fb-c209-417a-ab23-6e2421a2f3ef |
T.senthil Kumar, Vishnu Gajendran, Rajamanuri Harshad, Sneha Aswani, Deepti Sankara Narayanan . Mediscript Mobile Cloud Collabrative Speech Recognition Framework. International Conference on Innovation in Communication, Information and Computing 2013. ICICIC2013, 3 (June 2013), 36-45.
Speech recognition is a vital part in medical transciription. The existing speech recognition systems, that run as standalone desktop applications, fall short in many cases due to low accuracy rates and high processing time. The bottleneck in these systems, is the lack of computation power (in terms of processing power and memory) made accessible to them. This paper proposes a mobile-cloud collaborative approach for the automation of speech to text conversion. The model proposed leverages the power of cloud computing and the ubiquitous nature of mobile computing. Computing resthisces can be scaled up/down in the cloud (Elastic Computing) depending on the usage of the system. This kind of speech recognition framework has many real time applications such as IVR systems, Medical Transcription systems, Railway Enquiries, Jthisnalism, Interactive User Interfaces, etc. A generic framework is advantageous, because the speech models in the Automatic Speech Recognizer (ASR) could be trained according to the specific domain required, allowing wide usability. The proposed speech framework is used for medical transcription process. Medical transcription process involves a medical transcriptionist who listens to the recorded speech of a doctor and manually types a transcript file. This process is automated by using the proposed speech framework. With this system, the work of the medical transcriptionist is reduced to error checking in the auto generated transcript file. The entire model is developed for a mobile cloud environment considering the characteristics of cloud delivery models.