Nepali Text to Speech Synthesis System using ESNOLA Method of Concatenation

Bhusan Chettri; Krishna Bikram Shah

Call for Paper

October Edition

IJCA solicits high quality original research papers for the upcoming October edition of the journal. The last date of research paper submission is 22 September 2025

Submit your paper

Know more

The week's pick

RESPONSIVE WEB DESIGN FOR ENHANCED USER EXPERIENCE (UX) AND USER INTERFACE (UI)

Victor Aienobe Muhammad Zahid Iqbal

Random Articles

Reseach Article

Nepali Text to Speech Synthesis System using ESNOLA Method of Concatenation

by Bhusan Chettri, Krishna Bikram Shah

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 62 - Number 2

Year of Publication: 2013

Authors: Bhusan Chettri, Krishna Bikram Shah

10.5120/10053-4909

Bhusan Chettri, Krishna Bikram Shah . Nepali Text to Speech Synthesis System using ESNOLA Method of Concatenation. International Journal of Computer Applications. 62, 2 ( January 2013), 24-28. DOI=10.5120/10053-4909

@article{ 10.5120/10053-4909,

author = { Bhusan Chettri, Krishna Bikram Shah },

title = { Nepali Text to Speech Synthesis System using ESNOLA Method of Concatenation },

journal = { International Journal of Computer Applications },

issue_date = { January 2013 },

volume = { 62 },

number = { 2 },

month = { January },

year = { 2013 },

issn = { 0975-8887 },

pages = { 24-28 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume62/number2/10053-4909/ },

doi = { 10.5120/10053-4909 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T21:10:36.800641+05:30

%A Bhusan Chettri

%A Krishna Bikram Shah

%T Nepali Text to Speech Synthesis System using ESNOLA Method of Concatenation

%J International Journal of Computer Applications

%@ 0975-8887

%V 62

%N 2

%P 24-28

%D 2013

%I Foundation of Computer Science (FCS), NY, USA

Abstract

This paper confer the tools and methodology used in developing a Nepali Text to Speech Synthesis System, which is based on concatenative approach employing Epoch Synchronous Non Overlap Add Method (ESNOLA), which uses signal dictionary having raw sound signal representing parts of phonemes as a speech database. The developed system is an unintonated (flat) TTS system where the pitch of the pre-recorded speech signal remains same throughout, while taking care of aspects such as naturalness, personality, platform independence and quality assessments. Some of the applications and problems encountered with TTS systems are also discussed.

References

Jonathan Allen, M. Sharon Hunnicutt, Dennis Klatt, "From Text to Speech The MITalk System", Cambridge University Press, 1987.
Mandal, Shyamal Kumar Das and Datta, Asoke kumar, "Epoch Synchronous non-overlap-add (ESNOLA) method based concatenative speech synthesis system for Bangla". ISCA workshop on Speech Synthesis, Bonn, Germany, August 22-24, 2007.
CDAC: Research & Development – Speech Research. Online: Access Date: 4th July, 2012.
Thierry Dutoit. An introduction to Text-To-Speech Synthesis. Kluwer Academic Publishers. 1997.
Speech Synthesis. Online: http://en. wikipedia/wiki/Speech_Synthesis. Access Date: 4th July, 2012.
Building Synthetic Voices. Online: http://www. festvox. org/bsv. Access Date: 4th July, 2012.
Muhammad Masud Rashid, Md. Akhter Hussain, M. Shahidur Rahman, "Diphone preparation for Bangla text to Speech Synthesis", Proc. Of International Conference on Computer Sciences and Information Technology, pp. 226-230, Dhaka, November, 2009.
Firoj Alam, S. M. Murtoza Habib, Mumit Khan, "Text normalization System for Bangla", Proc. of Conference on Language and Technology, Lahore, pp. 22-24, 2009.
Firoj Alam, Promila Kanti Nath and Mumit Khan, "Text to Speech for Bangla language using Festival", Proc. of Intl. Conf. on Digital Communications and Computer Applications, Irbid, Jordan, 2007.
Tanuja Sarkar, Venkatesh Keri, Santhosh Yuvaraj, Kishore Prahalad, "Building Bengali Voice using Festival", Proc. of ICLSI 2005, Hyderabad, India, 2005.
Das Mandal S. K, Datta A. K, Gupta B. "Spectral Matching of Epoch Synchronous Non-Over lapping Add (ESNOLA) Method based Concatenative Synthesizer", International Conference on Communication Devices and Intelligent System (CODIS-2004), Jadavpur University, 2004, pp. 729-732.
Das Mandal Shyamal Kr, Saha Arup, Sarkar Indranil, Datta Asoke Kumar, "Phonological, International & Prosodic Aspects of Concatenative SpeechSynthesizer Development for Bangla", Proceedings of SIMPLE-05, February 2005, pp. 56-60, 2005.
Nepali TTS. Online: http://www. bhashasanchar. org/pdfs/NepaliTTS_%20manual. pdf. Access Date: 4th July, 2012.
Nepali Language. Online: http://en. wikipedia. org/wiki/Nepali_language. Access Date: 4th July, 2012
Nepali fonts. Online: http://www. explorenepal. com/fonts. Access Date: 4th July, 2012.
J. Acharya, A Descriptive Grammar of Nepali And An Analyzed Corpus, Georgetown University Press, Washington, DC, 1991.
M. J. Liberman, K. W. Church, "Text Analysis and Word Pronunciation in Text-to-Speech Synthesis", Advances in Speech Signal Processing, S. Fumy, M. M. Sondhi eds, Dekker, New York, pp. 791-831, 1992.
Narasimhan B, Sproat R and Kiraz G. Schwa-deletion in Hindi Text-to-speech synthesis. In workshop on computational linguistic in South Asian Languages, 21st SALA, October 2001, Konstanz.
Hunnicut S. , "Grapheme-to-Phoneme rules: a Review", Speech Transmission Laboratory, Royal Institute of Technology, Stockholm, Sweden, QPSR 2-3, pp.

Index Terms

Computer Science

Information Sciences

Keywords

TTS ESNOLA Partneme Speech Synthetic Synthesis