International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 130 - Number 6 |
Year of Publication: 2015 |
Authors: Sangramsing N.Kayte, Monica Mundada, Charansing Kayte |
10.5120/ijca2015907024 |
Sangramsing N.Kayte, Monica Mundada, Charansing Kayte . Speech Synthesis System for Marathi Accent using FESTVOX. International Journal of Computer Applications. 130, 6 ( November 2015), 38-42. DOI=10.5120/ijca2015907024
A Text To Speech synthesis (TTS) is the production of artificial speech by a machine for the given text as input. This field of study is known both as Speech Synthesis that is the “synthetic” (computer) generation of speech, and Text-To-Speech or TTS. It is the process of converting written text into speech. In the process of speech synthesis, mainly two processing components are used; they are NLP (natural language processing) and DSP (digital signal processing) modules. The speech synthesis has enormous applications such as reading for blind people, telecommunication services, language education, and aid to handicapped persons, talking books and toys, call center automation etc. The main aim of the project is to develop a TTS system producing a voice with Indian accent for the given input text. In this project, for the conversion of text to speech, we use Festival in Linux environment. Festival is a general pre-packaged tool for development of multi-language speech synthesis systems; and it will support most of the languages in the text to speech conversion. In this project, the speech generation process is done by using Festival frame work and speech tools. The voice model is generated by using festvox frame work, festival and speech tools. The required speech data for generating voice is recorded in noise less environment. The voice models can be generated by unit selection or clustergen modules present in festvox. It is observed from the generated voices that clustergen voices are better than unit selection voices.