CFP last date
20 January 2025
Reseach Article

An Automatic MBROLA Tool For High Quality Arabic Speech Synthesis

by Abdelkader Chabchoub, Adnan Cherif
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 36 - Number 1
Year of Publication: 2011
Authors: Abdelkader Chabchoub, Adnan Cherif
10.5120/4452-6228

Abdelkader Chabchoub, Adnan Cherif . An Automatic MBROLA Tool For High Quality Arabic Speech Synthesis. International Journal of Computer Applications. 36, 1 ( December 2011), 1-5. DOI=10.5120/4452-6228

@article{ 10.5120/4452-6228,
author = { Abdelkader Chabchoub, Adnan Cherif },
title = { An Automatic MBROLA Tool For High Quality Arabic Speech Synthesis },
journal = { International Journal of Computer Applications },
issue_date = { December 2011 },
volume = { 36 },
number = { 1 },
month = { December },
year = { 2011 },
issn = { 0975-8887 },
pages = { 1-5 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume36/number1/4452-6228/ },
doi = { 10.5120/4452-6228 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:21:58.154815+05:30
%A Abdelkader Chabchoub
%A Adnan Cherif
%T An Automatic MBROLA Tool For High Quality Arabic Speech Synthesis
%J International Journal of Computer Applications
%@ 0975-8887
%V 36
%N 1
%P 1-5
%D 2011
%I Foundation of Computer Science (FCS), NY, USA
Abstract

This work describes the Arabic Text-to-Speech (TTS) synthesis system. This system uses an automatic tool based on Diphone concatenation with MBROLA synthesizer. The quality of a synthesized speech is improved by analyzing the spectrum features of voice source in various F0 ranges and timbres in detail. It generates speech synthesis based on analysis and estimation of formant by classifying the voice source into different types. The developed model enhances the quality of the naturalness, and the intelligibility of speech synthesis in various speaking environment.

References
  1. Alghmadi M. 2003. "KACST Arabic Phonetic Database", the Fifteenth International Congress of Phonetics Science, Barcelona 2003, pp 3109-3112.
  2. Dutoit, T., Pagel, V., Pierret, N., Bataille, F. & van der Vrecken, O.1996. The MBROLA Project: Towards a Set of High-Quality Speech Synthesizers Free of Use.
  3. Assaf, M.2005. "A Prototype of an Arabic Diphone Speech Synthesizer in Festival", Master Thesis, Department of Linguistics and Philology, Uppsala University.
  4. Al-Zabibi, M.1990. "An Acoustic–Phonetic Approach in Automatic Arabic Speech Recognition", the British Library in Association with UMI.
  5. Ibraheem, A.1990."Al-Aswat Al-Arabia", Arabic title, Anglo-Egyptian Publisher, Egypt.
  6. Muhammad, A.1990. "Alaswaat Alaghawaiyah", Daar Alfalah, Jordan, (in Arabic).
  7. Demenko, G., Grocholewski, S., Wagner, A. & Szymanski, M. 2006. "Prosody Annotation for Corpus Based Speech Synthesis". In: Proceedings of the Eleventh Australasian International Conference on Speech Science and Technology. Auckland, New Zealand, pp. 460-465.
  8. Boersma, P. & Weenink, D. 2005. Praat. Doing phonetics by computer.
  9. Computer program]. Version 4.3.04 Retrieved March 31, 2005 from http://www.praat.org/
  10. Bachan, J. & Gibbon, D.2006. "Close Copy Speech Synthesis for Speech Perception Testing"In: Investigationes Linguisticae, vol. 13, pp. 9--24.
  11. L. Welling, L., Ney, H.1998. "Formant Estimation for Speech Recognition", IEEE Trans. On Speech and Audio Processing, Vol.6, No.1.
  12. Fujisaki, H. 1996. "Recent Research towards Advanced Man-Machine Interface through Spoken Language", Elsevier Science.
  13. Krishnamurthy, A.K.1992. "Glottal Source Estimation Using a Sum-of-Exponentials Model", IEEE Trans. On Signal Processing, Vol. 40, No. 3, March 1992.
  14. Walker, J., Murphy, P.2007. A review of glottal waveform analysis. In: Progress in Nonlinear Speech Processing.
  15. Fant G. 1986. "Glottal flow: models and interaction", Journal of Phonetics, 14, 393-399.
  16. Milenkovic, P. 1986. "Glottal Inverse Filtering by Joint Estimation of an AR System with a Linear Input Model", IEEE Trans. On Acoustics, Speech, and Signal Processing, Vol.ASSP-34, No.1.
  17. Jianhua, T., Yongguo, K. 2004 ." multi-source based acoustic model for speech synthesis", 5th ISCA Speech Synthesis Workshop Pittsburgh, PA, USA, 14-16.
Index Terms

Computer Science
Information Sciences

Keywords

Arabic speech synthesis Diphone Spectrum analysis Formant Pitch Timbre MBROLA Inverse filtering