International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 136 - Number 3 |
Year of Publication: 2016 |
Authors: Rubeena A. Khan, J. S. Chitode |
10.5120/ijca2016907992 |
Rubeena A. Khan, J. S. Chitode . Concatenative Speech Synthesis: A Review. International Journal of Computer Applications. 136, 3 ( February 2016), 1-6. DOI=10.5120/ijca2016907992
The primary objective of this paper is to provide an overview of existing Concatenative Text-To-Speech synthesis techniques. Concatenative speech synthesis can be broadly categorized into three categories, Diphone Based, Corpus based and Hybrid. Diphone based speech synthesis relies on different signal processing techniques such as PSOLA, FD-PSOLA etc. These signal processing techniques introduce unwanted artifacts in the synthesized speech. The most popularly used method is the Unit selection synthesis which is a corpus based synthesis method. This method produces the most natural sounding synthetic speech.