International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 134 - Number 13 |
Year of Publication: 2016 |
Authors: Sangramsing N. Kayte, Monica Mundada |
10.5120/ijca2016907882 |
Sangramsing N. Kayte, Monica Mundada . Extraction of Speech Parameters from Speech Database using Festival. International Journal of Computer Applications. 134, 13 ( January 2016), 5-10. DOI=10.5120/ijca2016907882
Speech synthesis is the process of production of artificial speech. The system used for generation of speech from text is called as text-to-speech (TTS) system. In TTS system, text and voice models for a particular language or multiple languages are given as input to the system, which generates speech as output corresponding to the provided voice models. Speech synthesis systems can be extremely useful to people who are visually challenged, visually impaired and illiterate to get into the mainstream society. More recent applications include spoken dialogue systems and communicative robots. HMM (Hidden Markov Model) based Speech synthesis is the emerging technology for TTS. HMM based speech synthesis system consists of training phase and synthesis phase. In the training part, phone and excitation parameters are extracted from speech database and modeled by context dependent HMMs. In synthesis part, the system will extract the suitable phone and excitation parameters from the previously trained models and generates the speech. The main objective of this project is to build an HMM based speech synthesis system. In the training process, the system uses HTK (Hidden Markov Model Tool Kit) and SPTK (Signal Processing Tool Kit) developed at Cambridge University and Tokyo Institute of Technology respectively. Synthesis part is be done by „Festival‟. Festival is a speech synthesis tool for the generation of speech and it is language independent which is developed at the University of Edinburgh. The main advantage of this approach is its flexibility in changing speaker identities, emotions and speaking styles.