Speech Synthesis - Automatic Segmentation

Poonam Bansal; Amita Pradhan; Ankita Goyal; Astha Sharma; Mona Arora

Call for Paper

April Edition

IJCA solicits high quality original research papers for the upcoming April edition of the journal. The last date of research paper submission is 20 March 2026

Submit your paper

Know more

The week's pick

Explainable Hybrid Deep Learning for Automated Diagnosis of Canine Mammary Tumors

Elham Shawky Salama Heba Askr Ashraf Darwish Aboul Ella Hassanien

Random Articles

Reseach Article

Speech Synthesis - Automatic Segmentation

by Poonam Bansal, Amita Pradhan, Ankita Goyal, Astha Sharma, Mona Arora

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 98 - Number 4

Year of Publication: 2014

Authors: Poonam Bansal, Amita Pradhan, Ankita Goyal, Astha Sharma, Mona Arora

10.5120/17172-7253

Poonam Bansal, Amita Pradhan, Ankita Goyal, Astha Sharma, Mona Arora . Speech Synthesis - Automatic Segmentation. International Journal of Computer Applications. 98, 4 ( July 2014), 29-31. DOI=10.5120/17172-7253

@article{ 10.5120/17172-7253,

author = { Poonam Bansal, Amita Pradhan, Ankita Goyal, Astha Sharma, Mona Arora },

title = { Speech Synthesis - Automatic Segmentation },

journal = { International Journal of Computer Applications },

issue_date = { July 2014 },

volume = { 98 },

number = { 4 },

month = { July },

year = { 2014 },

issn = { 0975-8887 },

pages = { 29-31 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume98/number4/17172-7253/ },

doi = { 10.5120/17172-7253 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T22:25:20.010611+05:30

%A Poonam Bansal

%A Amita Pradhan

%A Ankita Goyal

%A Astha Sharma

%A Mona Arora

%T Speech Synthesis - Automatic Segmentation

%J International Journal of Computer Applications

%@ 0975-8887

%V 98

%N 4

%P 29-31

%D 2014

%I Foundation of Computer Science (FCS), NY, USA

Abstract

In this paper, after an a review of the previous work done in this field, the most frequently used approach using Hidden Markov Model (HMM) is used for implementation for phonetic segmentation. A baseline HMM phonetic segmentation tool is used for segmentation and analysis of speech at phonetic level. The results are approximately same as obtained using manual segmentation.

References

Automatic Phonetic Segmentation, D. T. Toledano, L. A. H. Gómez , Member, IEEE, and L. V. Grande IEEE transactions on speech and audio processing, Vol. 11, No. 6, Nov 2003.
The HTK Book, S. Young, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, Version 2. 1: Cambridge University, 1997.
Automatic speech segmentation with HTK, Kyle Gorman, Department of Linguistics,University of Pennsylvania, nstitute for Research in Cognitive Science .
HTK Tutorial,Giampiero Salvi, KTH (Royal Institute of Technology),Dep. of Speech, Music and Hearing, Drottning Kristinas v. 31,SE-100 44, Stockholm, Sweden .
L21, Introduction to Speech Processing | Ricardo Gutierrez Osuna | CSE@TAMU
P. Cosi, D. Falavigna, and M. Omologo, "A preliminary statistical evaluation of manual and automatic segmentation discrepancies," in Proceedings EUROSPEECH, 1991, pp. 693–696.
A. Ljolje, J. Hirschberg, and J. P. H. Van Santen,"Automatic speech segmentation for oncatenative inventory selection," in Progress in Speech Synthesis, J. P. H. Van Santen, Ed: Springer, 1997, pp. 305–311.
A. Ljolje and M. D. Riley, "Automatic segmentation of speech for TTS," in Proceedings EUROSPEECH, 1993, pp. 1445–1448.

Index Terms

Computer Science

Information Sciences

Keywords

HMM HTK Phonetic Segmentation MFCC Speech Synthesis Viterbi