An Automatic MBROLA Tool For High Quality Arabic Speech Synthesis

Abdelkader Chabchoub; Adnan Cherif

Call for Paper

May Edition

IJCA solicits high quality original research papers for the upcoming May edition of the journal. The last date of research paper submission is 20 April 2026

Submit your paper

Know more

The week's pick

A Unified NIST SP 800-90B Validation Framework for CMOS True Random Number Generators and Quantum Random Number Generators

Che-Ping Lin

Random Articles

Reseach Article

An Automatic MBROLA Tool For High Quality Arabic Speech Synthesis

by Abdelkader Chabchoub, Adnan Cherif

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 36 - Number 1

Year of Publication: 2011

Authors: Abdelkader Chabchoub, Adnan Cherif

10.5120/4452-6228

Abdelkader Chabchoub, Adnan Cherif . An Automatic MBROLA Tool For High Quality Arabic Speech Synthesis. International Journal of Computer Applications. 36, 1 ( December 2011), 1-5. DOI=10.5120/4452-6228

@article{ 10.5120/4452-6228,

author = { Abdelkader Chabchoub, Adnan Cherif },

title = { An Automatic MBROLA Tool For High Quality Arabic Speech Synthesis },

journal = { International Journal of Computer Applications },

issue_date = { December 2011 },

volume = { 36 },

number = { 1 },

month = { December },

year = { 2011 },

issn = { 0975-8887 },

pages = { 1-5 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume36/number1/4452-6228/ },

doi = { 10.5120/4452-6228 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T20:21:58.154815+05:30

%A Abdelkader Chabchoub

%A Adnan Cherif

%T An Automatic MBROLA Tool For High Quality Arabic Speech Synthesis

%J International Journal of Computer Applications

%@ 0975-8887

%V 36

%N 1

%P 1-5

%D 2011

%I Foundation of Computer Science (FCS), NY, USA

Abstract

This work describes the Arabic Text-to-Speech (TTS) synthesis system. This system uses an automatic tool based on Diphone concatenation with MBROLA synthesizer. The quality of a synthesized speech is improved by analyzing the spectrum features of voice source in various F0 ranges and timbres in detail. It generates speech synthesis based on analysis and estimation of formant by classifying the voice source into different types. The developed model enhances the quality of the naturalness, and the intelligibility of speech synthesis in various speaking environment.

References

Alghmadi M. 2003. "KACST Arabic Phonetic Database", the Fifteenth International Congress of Phonetics Science, Barcelona 2003, pp 3109-3112.
Dutoit, T., Pagel, V., Pierret, N., Bataille, F. & van der Vrecken, O.1996. The MBROLA Project: Towards a Set of High-Quality Speech Synthesizers Free of Use.
Assaf, M.2005. "A Prototype of an Arabic Diphone Speech Synthesizer in Festival", Master Thesis, Department of Linguistics and Philology, Uppsala University.
Al-Zabibi, M.1990. "An Acoustic–Phonetic Approach in Automatic Arabic Speech Recognition", the British Library in Association with UMI.
Ibraheem, A.1990."Al-Aswat Al-Arabia", Arabic title, Anglo-Egyptian Publisher, Egypt.
Muhammad, A.1990. "Alaswaat Alaghawaiyah", Daar Alfalah, Jordan, (in Arabic).
Demenko, G., Grocholewski, S., Wagner, A. & Szymanski, M. 2006. "Prosody Annotation for Corpus Based Speech Synthesis". In: Proceedings of the Eleventh Australasian International Conference on Speech Science and Technology. Auckland, New Zealand, pp. 460-465.
Boersma, P. & Weenink, D. 2005. Praat. Doing phonetics by computer.
Computer program]. Version 4.3.04 Retrieved March 31, 2005 from http://www.praat.org/
Bachan, J. & Gibbon, D.2006. "Close Copy Speech Synthesis for Speech Perception Testing"In: Investigationes Linguisticae, vol. 13, pp. 9--24.
L. Welling, L., Ney, H.1998. "Formant Estimation for Speech Recognition", IEEE Trans. On Speech and Audio Processing, Vol.6, No.1.
Fujisaki, H. 1996. "Recent Research towards Advanced Man-Machine Interface through Spoken Language", Elsevier Science.
Krishnamurthy, A.K.1992. "Glottal Source Estimation Using a Sum-of-Exponentials Model", IEEE Trans. On Signal Processing, Vol. 40, No. 3, March 1992.
Walker, J., Murphy, P.2007. A review of glottal waveform analysis. In: Progress in Nonlinear Speech Processing.
Fant G. 1986. "Glottal flow: models and interaction", Journal of Phonetics, 14, 393-399.
Milenkovic, P. 1986. "Glottal Inverse Filtering by Joint Estimation of an AR System with a Linear Input Model", IEEE Trans. On Acoustics, Speech, and Signal Processing, Vol.ASSP-34, No.1.
Jianhua, T., Yongguo, K. 2004 ." multi-source based acoustic model for speech synthesis", 5th ISCA Speech Synthesis Workshop Pittsburgh, PA, USA, 14-16.

Index Terms

Computer Science

Information Sciences

Keywords

Arabic speech synthesis Diphone Spectrum analysis Formant Pitch Timbre MBROLA Inverse filtering