The Development of Pashto Speech Synthesis System

Muhammad Akbar Ali Khan; Sahibzada Abdur Rehman Abid; Fatima Tuz Zuhra; Nasir Ahmad

Call for Paper

May Edition

IJCA solicits high quality original research papers for the upcoming May edition of the journal. The last date of research paper submission is 20 April 2026

Submit your paper

Know more

The week's pick

Evaluating Text-to-Text Generation from LLMs: A Case Study and Scalable Framework

Ziqiao Ao Juhi Singh Sebastian Antinome

Random Articles

Self-Training using a K-Nearest Neighbor as a Base Classifier Reinforced by Support Vector Machines

October

2012

GPU based Suffix Array Pattern Matching Approach for Big Data

Jul

2017

Open Source Vs Proprietary Application and Technologies

July

2012

Representation Learning with Adaptive Superpixel Coding

Dec

2025

Reseach Article

The Development of Pashto Speech Synthesis System

by Muhammad Akbar Ali Khan, Sahibzada Abdur Rehman Abid, Fatima Tuz Zuhra, Nasir Ahmad

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 71 - Number 24

Year of Publication: 2013

Authors: Muhammad Akbar Ali Khan, Sahibzada Abdur Rehman Abid, Fatima Tuz Zuhra, Nasir Ahmad

10.5120/12695-9478

Muhammad Akbar Ali Khan, Sahibzada Abdur Rehman Abid, Fatima Tuz Zuhra, Nasir Ahmad . The Development of Pashto Speech Synthesis System. International Journal of Computer Applications. 71, 24 ( June 2013), 49-53. DOI=10.5120/12695-9478

@article{ 10.5120/12695-9478,

author = { Muhammad Akbar Ali Khan, Sahibzada Abdur Rehman Abid, Fatima Tuz Zuhra, Nasir Ahmad },

title = { The Development of Pashto Speech Synthesis System },

journal = { International Journal of Computer Applications },

issue_date = { June 2013 },

volume = { 71 },

number = { 24 },

month = { June },

year = { 2013 },

issn = { 0975-8887 },

pages = { 49-53 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume71/number24/12695-9478/ },

doi = { 10.5120/12695-9478 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T21:36:35.835902+05:30

%A Muhammad Akbar Ali Khan

%A Sahibzada Abdur Rehman Abid

%A Fatima Tuz Zuhra

%A Nasir Ahmad

%T The Development of Pashto Speech Synthesis System

%J International Journal of Computer Applications

%@ 0975-8887

%V 71

%N 24

%P 49-53

%D 2013

%I Foundation of Computer Science (FCS), NY, USA

Abstract

This paper presents a novel Pashto text-to-speech (TTS) synthesis system based on data driven techniques such as Classification and Regression Tree (CART), Bigrams, and Non Uniform Units (NUUs). A modular concatenative TTS system has been developed for the Pashto language. Speech synthesis is carried out through a series of steps with the intention to provide a gradually more absolute transcription of the text, from which the final speech signal is then generated. The steps can be divided into two modules; a Natural Language Processing (NLP) module and a Digital Signal Processing (DSP) module. These steps incrementally enhance the information derived from the input and put it on a generally accessible internal data structure. The goal is to obtain enough information on the internal data structure so as to be capable to obtain an intelligible and natural speech.

References

R. Sproat, and J. Olive, "Text-to-Speech Synthesis" in V. K. Madisetti and D. B. Williams (eds. ), Digital Signal Processing Handbook, Ch. 46, CRC Press, 1998.
J. Allen, M. S. Hunnicutt, and D. Klatt, From Text to Speech, Cambridge University Press, Cambridge, 1987.
J. Allen, M. S. Hunnicutt, and D. Klatt, From Text to Speech: the MITalk System, Cambridge University Press, Cambridge, 1987.
P. J. Farrugia "Text-To-Speech Technologies for Mobile Telephony Services", MSc thesis, Dept of Computer Science and AI, University of Malta 2005.
S. Parthasarathy, and C. H. Coker, "Automatic estimation of articulatory parameters", Computer Speech and Language, vol. 6, no. 1, pp. 37-75, 1992.
B. Baxter, and W. J. Strong, "WINDBAG—a vocal-tract analog speech synthesizer", Journal of the Acoustical Society of America, vol. 45, no. 1, pp. 309, 1969.
P. Birkholz, D. Jackel, and B. J. Kröger, "Construction and control of a three-dimensional vocal tract model", ICASSP 2006, Toulouse, France, pp. 873-876. 2006.
R. Carlson, B. Granström, and I. Karlsson "Experiments with voice modelling in speech synthesis", Speech communication, vol. 10, pp. 481-490. 1991.
R. Donovan, "Trainable Speech Synthesis", PhD. Thesis. Cambridge University Engineering Department, England, 1996.
A. J. Hunt and A. W. Black, "Unit Selection in a Concatenative Speech Synthesis System Using a Large Speech Database", ATR Interpreting Telecommunications Research Labs. 2-2 Hikaridai, Seika-cho, Soraku-gun, Kyoto 619-02, Japan, 1996.
S. Lemmetty, "Review of Speech Synthesis Technology", MSc Thesis, Helsinki University of Technology Department of Electrical and Communications Engineering, March 30, 1999.
M. J. Liberman and K. W. Church, "Text Analysis and Word Pronunciation in Text-to-Speech Synthesis," in S. Furui and M. M. Sondhi, (eds. ), Advances in Speech Signal Processing, pp. 791–831. Dekker, New York, 1992.
F. Malfrere, O. Deroo, T. Dutoit, and C. Ris, "Phonetic Alignement: Speech-Synthesis-based versus Viterbi-based", Speech Communication, vol. 40, no. 4, pp. 503–517, 2003.

Index Terms

Computer Science

Information Sciences

Keywords

Pashto speech synthesis Classification and Regression Tree Non Uniform Units Pashto TTS