Speech Synthesis System for Telugu Language

G. Swathi; C. Kiran Mai; B. Raveendra Babu

Call for Paper

May Edition

IJCA solicits high quality original research papers for the upcoming May edition of the journal. The last date of research paper submission is 20 April 2026

Submit your paper

Know more

The week's pick

A Unified NIST SP 800-90B Validation Framework for CMOS True Random Number Generators and Quantum Random Number Generators

Che-Ping Lin

Random Articles

Reseach Article

Speech Synthesis System for Telugu Language

by G. Swathi, C. Kiran Mai, B. Raveendra Babu

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 81 - Number 5

Year of Publication: 2013

Authors: G. Swathi, C. Kiran Mai, B. Raveendra Babu

10.5120/14009-2060

G. Swathi, C. Kiran Mai, B. Raveendra Babu . Speech Synthesis System for Telugu Language. International Journal of Computer Applications. 81, 5 ( November 2013), 25-30. DOI=10.5120/14009-2060

@article{ 10.5120/14009-2060,

author = { G. Swathi, C. Kiran Mai, B. Raveendra Babu },

title = { Speech Synthesis System for Telugu Language },

journal = { International Journal of Computer Applications },

issue_date = { November 2013 },

volume = { 81 },

number = { 5 },

month = { November },

year = { 2013 },

issn = { 0975-8887 },

pages = { 25-30 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume81/number5/14009-2060/ },

doi = { 10.5120/14009-2060 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T21:55:17.415906+05:30

%A G. Swathi

%A C. Kiran Mai

%A B. Raveendra Babu

%T Speech Synthesis System for Telugu Language

%J International Journal of Computer Applications

%@ 0975-8887

%V 81

%N 5

%P 25-30

%D 2013

%I Foundation of Computer Science (FCS), NY, USA

Abstract

A system which takes input as a sequence of words and converts them to speech. Vowels and consonants are most important in Telugu language. The voices are sampled from real recorded speech. The speech synthesis is handheld by computers and mobile phones. To build a natural sounding speech synthesis system, it is essential that text processing component produce an appropriate sequence of phonemic units. Generation of sequence of phonetic units for a given standard word is referred to as letter to phoneme rule or text to phoneme rule. The complexity of these rules and their derivation depends upon the nature of the language. In Telugu TTS the input is Telugu text in Unicode. Speech synthesis is the technique of converting given input text to synthetic speech. Speech synthesis can be used to read written text as in e-mail, SMS, newspapers and can be used by blinds people. Speech synthesis has been widely researched in last four decades. The quality and intelligibility of the synthetic speech produced using the latest methods have been remarkably well for most of the applications. This project focuses primarily on the process of creating a voice for a concatenative Text-To-Speech system, or altering the TTS systems own standard output voice to sound more like the target voice.

References

C. Bickley, A. Syrdal, and J. Schroeter, ''Speech Synthesis,'' in The Acoustics of Speech Communication, J. M. Picket, Ed. , Boston, NY: Allyn and Bacon, 1998.
T. Dutoit,An Introduction to Text-to-Speech Synthesis, Dordrecht/Boston/London: Kluwer Academic Publishers, 1997.
Lakshmi A, Hema A Murthy. A Syllable Based Continuous Speech Recognizer for Tamil. In Proc. of the 2nd Int. Workshop on East-Asian Language Resources and Evaluation,2009.
Ö. Salor, B. Pellom and M. Demirekler, "Implementation and Evaluation of a Text-to-Speech Synthesis System for Turkish", Proceedings of Eurospeech-Interspeech 2003, Geneva, Switzerland, 2003, pp. 1573-1576.
S. Lemmetty, Review of Speech Synthesis Technology, MSc. thesis, Helsinki University of Technology, 1999.
K. Ishizaka and J. L. Flanagan, ''Synthesis of voiced sounds from a two-mass model of the vocal cords,'' Bell Syst. Tech. J. , vol. 51, no. 6, pp. 133–1268, 1972.
van Santen J. P . H. (1994): " Assignment of seg-mental duration in text-to-speech synthesis". Com-puter Speech and Language 8, 95-128
Sproat R. (1995): " A finite-state architecture for tokenization and grapheme-to-phoneme conver-sion for multilingual text analysis". In F rom text to tags: Issues in multilingual language analysis. Proc. ACL SIGDAT W orkshop (Dublin, Ireland), 65-72
Sproat R. , Olive J. (1995): "Text to speech syn-thesis". AT&T T echnical Journal 74(2), 35-44
Sproat R. , Olive J. (1996): " A modular architec-ture for multi-lingual text-to-speech". In J. van Santen, R. Sproat, J. Olive and J. Hirschberg (eds. ), Progress in speech synthesis (Springer , New Y ork).
T alkin D. , Rowley J. (1990): "Pitch-syn-chronous analysis and synthesis for TTS systems". Proc. ESCA W orkshop on Speech Synthesis (Autrans,France), 55-58.
A. M. Zeki and N. Azizah, "A Speech Synthesizer for Malay Language", National Conference on Research and Development in Computer Science, Selangor, Malaysia, October 2001.
S P Kishore, Rohit Kumar and Rajeev Sangal, "A Data Driven Synthesis Approach For Indian Languages using Syllables as BasicUnit", in Proceedings of Intl. Conf. on NLP (ICON) 2002, pp. 311-316, Mumbai, India, 200.
O. Fujimura and J. Lovins, ''Syllables as concatenative phonetic elements,'' inSyllables and Segments, A. Bell and J. B. Hooper, Eds. , New York: North-Holland, 107–120, 1978.
BlackA. W. ,ZenH. ,andTokudaK. ,"Statistical parametric speech synthesis," in Proceeding sofIEEEInt. Conf. Acoust. , Speech,and Signal Processing, Honolulu,USA, 2007.
Alan W Black, Paul Taylor, "Automatically Clustering similar units for unit selection in speech synthesis", Proceedings of Eurospeech 97.
ZenH. ,NoseT. ,YamagishiJ. ,SakoS. ,MasukoT. ,Black A. W. , andTokudaK. ,"The hmm-based speech synthe sis system version2. 0," in Proc. ofISCASSW6, Bonn, Germany,2007.
A. W. Black, and K. A. Lenzo, Building Synthetic Voices, Language Technologies Institute, Carnegie Mellon University and Cepstral LLC.
B. Williams, R. J. Jones and I. Uemlianin, "Tools and Resources for Speech Synthesis Arising from a Welsh TTS Project", Fifth Language Resources and Evaluation Conference (LREC), Genoa, Italy, 2006.
C. Kamisetty and S. M. Adapa, Telugu Festival Text-to-Speech System.
A. Wasala, R. Weerasinghe and K. Gamage, "Sinhala Grapheme-to-Phoneme Conversion and Rules for Schwa epenthesis", Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions, Sydney, Australia, 2006, pp. 890-897.
J. B. Disanayaka. 1991. The Structure of Spoken Sinhala, National Institute of Education, Maharagama.
Marian Macchi, Bellcore,"Issues in text-to-speech Synthesis" In Proc. EEE International Joint Symposia on Intelligence and Systems, pp. 318-325, 1998.
A. Hunt, & A. Black, "Unit selection in a concatenative speech synthesis system using a large speech database", In Proc. of EEE int. Conference acoust, speech, and signal processing, vol. 1, pp. 373–376, 1996.
Carlson, R. , & Nord, L. "Vowel dynamics in a text-to-speech system - some considerations". In Proceedings Eurospeech '93 (pp. 1911-1914). Berlin, 1993.
Anupam Basu, Debasish Sen , Shiraj Sen and Soumen Chakraborty "An Indian Language Speech Synthesizer –Techniques and Applications" National Systems Conference, Indian Institute of Technology, Kharagpur, december 17-19, 2003.

Index Terms

Computer Science

Information Sciences

Keywords

Text processing speech generation phoneme Speech synthesis