2nd National Conference on Innovative Paradigms in Engineering and Technology (NCIPET 2013) |
Foundation of Computer Science USA |
NCIPET - Number 14 |
March 2012 |
Authors: Swati D. Bhutekar, M. B. Chandak |
f648415c-7840-4543-89b4-64edd07088cc |
Swati D. Bhutekar, M. B. Chandak . Designing and Recording Emotional Speech Databases. 2nd National Conference on Innovative Paradigms in Engineering and Technology (NCIPET 2013). NCIPET, 14 (March 2012), 4-10.
This paper describes the factors used in designing and recording large speech databases for applications requiring speech synthesis. Given the growing demand for customized and domain specific voices for use in corpus based synthesis systems, good practices should be established for the creation of these databases which are a key factor in the quality of the resulting speech synthesizer. This paper focuses on the factors affecting to the designing of the recording prompts, on the speaker selection procedure, on the recording setup and on the quality control of the resulting database. One way to find the emotions in the speech is , Once the speech has been recorded from the user it is converted into text, at the same time the stressed word from the speech is recorded & then the frequency for that word is find out for recording the corresponding emotion.