We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 December 2024
Reseach Article

Speech Synthesis - Automatic Segmentation

by Poonam Bansal, Amita Pradhan, Ankita Goyal, Astha Sharma, Mona Arora
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 98 - Number 4
Year of Publication: 2014
Authors: Poonam Bansal, Amita Pradhan, Ankita Goyal, Astha Sharma, Mona Arora
10.5120/17172-7253

Poonam Bansal, Amita Pradhan, Ankita Goyal, Astha Sharma, Mona Arora . Speech Synthesis - Automatic Segmentation. International Journal of Computer Applications. 98, 4 ( July 2014), 29-31. DOI=10.5120/17172-7253

@article{ 10.5120/17172-7253,
author = { Poonam Bansal, Amita Pradhan, Ankita Goyal, Astha Sharma, Mona Arora },
title = { Speech Synthesis - Automatic Segmentation },
journal = { International Journal of Computer Applications },
issue_date = { July 2014 },
volume = { 98 },
number = { 4 },
month = { July },
year = { 2014 },
issn = { 0975-8887 },
pages = { 29-31 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume98/number4/17172-7253/ },
doi = { 10.5120/17172-7253 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:25:20.010611+05:30
%A Poonam Bansal
%A Amita Pradhan
%A Ankita Goyal
%A Astha Sharma
%A Mona Arora
%T Speech Synthesis - Automatic Segmentation
%J International Journal of Computer Applications
%@ 0975-8887
%V 98
%N 4
%P 29-31
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

In this paper, after an a review of the previous work done in this field, the most frequently used approach using Hidden Markov Model (HMM) is used for implementation for phonetic segmentation. A baseline HMM phonetic segmentation tool is used for segmentation and analysis of speech at phonetic level. The results are approximately same as obtained using manual segmentation.

References
  1. Automatic Phonetic Segmentation, D. T. Toledano, L. A. H. Gómez , Member, IEEE, and L. V. Grande IEEE transactions on speech and audio processing, Vol. 11, No. 6, Nov 2003.
  2. The HTK Book, S. Young, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, Version 2. 1: Cambridge University, 1997.
  3. Automatic speech segmentation with HTK, Kyle Gorman, Department of Linguistics,University of Pennsylvania, nstitute for Research in Cognitive Science .
  4. HTK Tutorial,Giampiero Salvi, KTH (Royal Institute of Technology),Dep. of Speech, Music and Hearing, Drottning Kristinas v. 31,SE-100 44, Stockholm, Sweden .
  5. L21, Introduction to Speech Processing | Ricardo Gutierrez Osuna | CSE@TAMU
  6. P. Cosi, D. Falavigna, and M. Omologo, "A preliminary statistical evaluation of manual and automatic segmentation discrepancies," in Proceedings EUROSPEECH, 1991, pp. 693–696.
  7. A. Ljolje, J. Hirschberg, and J. P. H. Van Santen,"Automatic speech segmentation for oncatenative inventory selection," in Progress in Speech Synthesis, J. P. H. Van Santen, Ed: Springer, 1997, pp. 305–311.
  8. A. Ljolje and M. D. Riley, "Automatic segmentation of speech for TTS," in Proceedings EUROSPEECH, 1993, pp. 1445–1448.
Index Terms

Computer Science
Information Sciences

Keywords

HMM HTK Phonetic Segmentation MFCC Speech Synthesis Viterbi