International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 39 - Number 16 |
Year of Publication: 2012 |
Authors: Rajeswari K C, Uma Maheswari P |
10.5120/4902-7399 |
Rajeswari K C, Uma Maheswari P . Prosody Modeling Techniques for Text-to-Speech Synthesis Systems ñ A Survey. International Journal of Computer Applications. 39, 16 ( February 2012), 8-11. DOI=10.5120/4902-7399
This paper presents a study on prosody modeling for speech synthesis. Any Text to Speech system comprises of two phases. One is text analysis and second is speech synthesis. The task of text analysis is to find the words and the task of speech synthesis is to generate the speech. To attain this, different models are available such as text as language models, grapheme to phoneme models, full linguistic analysis model and complete prosody generation model. In complete prosody generation model, the quantities like phrasing, stress and the like are determined to generate naturalness bearing synthetic voice. Towards generating such a speech, an explicit prosodic model is required. This makes the speech more understandable. Many researches have been done in this stream, but still better solution is required. In this paper, the strength and weaknesses of different approaches of prosody models are discussed.