CFP last date
20 December 2024
Reseach Article

The Development of Pashto Speech Synthesis System

by Muhammad Akbar Ali Khan, Sahibzada Abdur Rehman Abid, Fatima Tuz Zuhra, Nasir Ahmad
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 71 - Number 24
Year of Publication: 2013
Authors: Muhammad Akbar Ali Khan, Sahibzada Abdur Rehman Abid, Fatima Tuz Zuhra, Nasir Ahmad
10.5120/12695-9478

Muhammad Akbar Ali Khan, Sahibzada Abdur Rehman Abid, Fatima Tuz Zuhra, Nasir Ahmad . The Development of Pashto Speech Synthesis System. International Journal of Computer Applications. 71, 24 ( June 2013), 49-53. DOI=10.5120/12695-9478

@article{ 10.5120/12695-9478,
author = { Muhammad Akbar Ali Khan, Sahibzada Abdur Rehman Abid, Fatima Tuz Zuhra, Nasir Ahmad },
title = { The Development of Pashto Speech Synthesis System },
journal = { International Journal of Computer Applications },
issue_date = { June 2013 },
volume = { 71 },
number = { 24 },
month = { June },
year = { 2013 },
issn = { 0975-8887 },
pages = { 49-53 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume71/number24/12695-9478/ },
doi = { 10.5120/12695-9478 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:36:35.835902+05:30
%A Muhammad Akbar Ali Khan
%A Sahibzada Abdur Rehman Abid
%A Fatima Tuz Zuhra
%A Nasir Ahmad
%T The Development of Pashto Speech Synthesis System
%J International Journal of Computer Applications
%@ 0975-8887
%V 71
%N 24
%P 49-53
%D 2013
%I Foundation of Computer Science (FCS), NY, USA
Abstract

This paper presents a novel Pashto text-to-speech (TTS) synthesis system based on data driven techniques such as Classification and Regression Tree (CART), Bigrams, and Non Uniform Units (NUUs). A modular concatenative TTS system has been developed for the Pashto language. Speech synthesis is carried out through a series of steps with the intention to provide a gradually more absolute transcription of the text, from which the final speech signal is then generated. The steps can be divided into two modules; a Natural Language Processing (NLP) module and a Digital Signal Processing (DSP) module. These steps incrementally enhance the information derived from the input and put it on a generally accessible internal data structure. The goal is to obtain enough information on the internal data structure so as to be capable to obtain an intelligible and natural speech.

References
  1. R. Sproat, and J. Olive, "Text-to-Speech Synthesis" in V. K. Madisetti and D. B. Williams (eds. ), Digital Signal Processing Handbook, Ch. 46, CRC Press, 1998.
  2. J. Allen, M. S. Hunnicutt, and D. Klatt, From Text to Speech, Cambridge University Press, Cambridge, 1987.
  3. J. Allen, M. S. Hunnicutt, and D. Klatt, From Text to Speech: the MITalk System, Cambridge University Press, Cambridge, 1987.
  4. P. J. Farrugia "Text-To-Speech Technologies for Mobile Telephony Services", MSc thesis, Dept of Computer Science and AI, University of Malta 2005.
  5. S. Parthasarathy, and C. H. Coker, "Automatic estimation of articulatory parameters", Computer Speech and Language, vol. 6, no. 1, pp. 37-75, 1992.
  6. B. Baxter, and W. J. Strong, "WINDBAG—a vocal-tract analog speech synthesizer", Journal of the Acoustical Society of America, vol. 45, no. 1, pp. 309, 1969.
  7. P. Birkholz, D. Jackel, and B. J. Kröger, "Construction and control of a three-dimensional vocal tract model", ICASSP 2006, Toulouse, France, pp. 873-876. 2006.
  8. R. Carlson, B. Granström, and I. Karlsson "Experiments with voice modelling in speech synthesis", Speech communication, vol. 10, pp. 481-490. 1991.
  9. R. Donovan, "Trainable Speech Synthesis", PhD. Thesis. Cambridge University Engineering Department, England, 1996.
  10. A. J. Hunt and A. W. Black, "Unit Selection in a Concatenative Speech Synthesis System Using a Large Speech Database", ATR Interpreting Telecommunications Research Labs. 2-2 Hikaridai, Seika-cho, Soraku-gun, Kyoto 619-02, Japan, 1996.
  11. S. Lemmetty, "Review of Speech Synthesis Technology", MSc Thesis, Helsinki University of Technology Department of Electrical and Communications Engineering, March 30, 1999.
  12. M. J. Liberman and K. W. Church, "Text Analysis and Word Pronunciation in Text-to-Speech Synthesis," in S. Furui and M. M. Sondhi, (eds. ), Advances in Speech Signal Processing, pp. 791–831. Dekker, New York, 1992.
  13. F. Malfrere, O. Deroo, T. Dutoit, and C. Ris, "Phonetic Alignement: Speech-Synthesis-based versus Viterbi-based", Speech Communication, vol. 40, no. 4, pp. 503–517, 2003.
Index Terms

Computer Science
Information Sciences

Keywords

Pashto speech synthesis Classification and Regression Tree Non Uniform Units Pashto TTS