We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 November 2024
Reseach Article

Nepali Text to Speech Synthesis System using ESNOLA Method of Concatenation

by Bhusan Chettri, Krishna Bikram Shah
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 62 - Number 2
Year of Publication: 2013
Authors: Bhusan Chettri, Krishna Bikram Shah
10.5120/10053-4909

Bhusan Chettri, Krishna Bikram Shah . Nepali Text to Speech Synthesis System using ESNOLA Method of Concatenation. International Journal of Computer Applications. 62, 2 ( January 2013), 24-28. DOI=10.5120/10053-4909

@article{ 10.5120/10053-4909,
author = { Bhusan Chettri, Krishna Bikram Shah },
title = { Nepali Text to Speech Synthesis System using ESNOLA Method of Concatenation },
journal = { International Journal of Computer Applications },
issue_date = { January 2013 },
volume = { 62 },
number = { 2 },
month = { January },
year = { 2013 },
issn = { 0975-8887 },
pages = { 24-28 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume62/number2/10053-4909/ },
doi = { 10.5120/10053-4909 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:10:36.800641+05:30
%A Bhusan Chettri
%A Krishna Bikram Shah
%T Nepali Text to Speech Synthesis System using ESNOLA Method of Concatenation
%J International Journal of Computer Applications
%@ 0975-8887
%V 62
%N 2
%P 24-28
%D 2013
%I Foundation of Computer Science (FCS), NY, USA
Abstract

This paper confer the tools and methodology used in developing a Nepali Text to Speech Synthesis System, which is based on concatenative approach employing Epoch Synchronous Non Overlap Add Method (ESNOLA), which uses signal dictionary having raw sound signal representing parts of phonemes as a speech database. The developed system is an unintonated (flat) TTS system where the pitch of the pre-recorded speech signal remains same throughout, while taking care of aspects such as naturalness, personality, platform independence and quality assessments. Some of the applications and problems encountered with TTS systems are also discussed.

References
  1. Jonathan Allen, M. Sharon Hunnicutt, Dennis Klatt, "From Text to Speech The MITalk System", Cambridge University Press, 1987.
  2. Mandal, Shyamal Kumar Das and Datta, Asoke kumar, "Epoch Synchronous non-overlap-add (ESNOLA) method based concatenative speech synthesis system for Bangla". ISCA workshop on Speech Synthesis, Bonn, Germany, August 22-24, 2007.
  3. CDAC: Research & Development – Speech Research. Online: Access Date: 4th July, 2012.
  4. Thierry Dutoit. An introduction to Text-To-Speech Synthesis. Kluwer Academic Publishers. 1997.
  5. Speech Synthesis. Online: http://en. wikipedia/wiki/Speech_Synthesis. Access Date: 4th July, 2012.
  6. Building Synthetic Voices. Online: http://www. festvox. org/bsv. Access Date: 4th July, 2012.
  7. Muhammad Masud Rashid, Md. Akhter Hussain, M. Shahidur Rahman, "Diphone preparation for Bangla text to Speech Synthesis", Proc. Of International Conference on Computer Sciences and Information Technology, pp. 226-230, Dhaka, November, 2009.
  8. Firoj Alam, S. M. Murtoza Habib, Mumit Khan, "Text normalization System for Bangla", Proc. of Conference on Language and Technology, Lahore, pp. 22-24, 2009.
  9. Firoj Alam, Promila Kanti Nath and Mumit Khan, "Text to Speech for Bangla language using Festival", Proc. of Intl. Conf. on Digital Communications and Computer Applications, Irbid, Jordan, 2007.
  10. Tanuja Sarkar, Venkatesh Keri, Santhosh Yuvaraj, Kishore Prahalad, "Building Bengali Voice using Festival", Proc. of ICLSI 2005, Hyderabad, India, 2005.
  11. Das Mandal S. K, Datta A. K, Gupta B. "Spectral Matching of Epoch Synchronous Non-Over lapping Add (ESNOLA) Method based Concatenative Synthesizer", International Conference on Communication Devices and Intelligent System (CODIS-2004), Jadavpur University, 2004, pp. 729-732.
  12. Das Mandal Shyamal Kr, Saha Arup, Sarkar Indranil, Datta Asoke Kumar, "Phonological, International & Prosodic Aspects of Concatenative SpeechSynthesizer Development for Bangla", Proceedings of SIMPLE-05, February 2005, pp. 56-60, 2005.
  13. Nepali TTS. Online: http://www. bhashasanchar. org/pdfs/NepaliTTS_%20manual. pdf. Access Date: 4th July, 2012.
  14. Nepali Language. Online: http://en. wikipedia. org/wiki/Nepali_language. Access Date: 4th July, 2012
  15. Nepali fonts. Online: http://www. explorenepal. com/fonts. Access Date: 4th July, 2012.
  16. J. Acharya, A Descriptive Grammar of Nepali And An Analyzed Corpus, Georgetown University Press, Washington, DC, 1991.
  17. M. J. Liberman, K. W. Church, "Text Analysis and Word Pronunciation in Text-to-Speech Synthesis", Advances in Speech Signal Processing, S. Fumy, M. M. Sondhi eds, Dekker, New York, pp. 791-831, 1992.
  18. Narasimhan B, Sproat R and Kiraz G. Schwa-deletion in Hindi Text-to-speech synthesis. In workshop on computational linguistic in South Asian Languages, 21st SALA, October 2001, Konstanz.
  19. Hunnicut S. , "Grapheme-to-Phoneme rules: a Review", Speech Transmission Laboratory, Royal Institute of Technology, Stockholm, Sweden, QPSR 2-3, pp.
Index Terms

Computer Science
Information Sciences

Keywords

TTS ESNOLA Partneme Speech Synthetic Synthesis