CFP last date
20 December 2024
Reseach Article

BiCross : Corpus Driven Malayalam Text-to-Speech Synthesis for Interactive Voice Response System

by Arun Soman, Sachin Kumar S., Hemanth V. K.
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 29 - Number 4
Year of Publication: 2011
Authors: Arun Soman, Sachin Kumar S., Hemanth V. K.
10.5120/3551-4871

Arun Soman, Sachin Kumar S., Hemanth V. K. . BiCross : Corpus Driven Malayalam Text-to-Speech Synthesis for Interactive Voice Response System. International Journal of Computer Applications. 29, 4 ( September 2011), 41-46. DOI=10.5120/3551-4871

@article{ 10.5120/3551-4871,
author = { Arun Soman, Sachin Kumar S., Hemanth V. K. },
title = { BiCross : Corpus Driven Malayalam Text-to-Speech Synthesis for Interactive Voice Response System },
journal = { International Journal of Computer Applications },
issue_date = { September 2011 },
volume = { 29 },
number = { 4 },
month = { September },
year = { 2011 },
issn = { 0975-8887 },
pages = { 41-46 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume29/number4/3551-4871/ },
doi = { 10.5120/3551-4871 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:14:56.119721+05:30
%A Arun Soman
%A Sachin Kumar S.
%A Hemanth V. K.
%T BiCross : Corpus Driven Malayalam Text-to-Speech Synthesis for Interactive Voice Response System
%J International Journal of Computer Applications
%@ 0975-8887
%V 29
%N 4
%P 41-46
%D 2011
%I Foundation of Computer Science (FCS), NY, USA
Abstract

In a text-to-speech system, spoken utterances are automatically produced from text. In this paper, we present a corpus-driven Malayalam text-to-speech (TTS) system based on the concatenative synthesis approach. The most important qualities of a synthesized speech are naturalness and intelligibility. In this system, words and syllables are used as the basic units for synthesis. Our corpus consists of speech waveforms that are collected for most frequently used words in different domains. The speaker is selected through subjective and objective evaluation of natural and synthesized waveform. The proposed Malayalam text-to-speech system is implemented in Java multimedia framework (JMF) and runs on both in Windows and Linux platforms. The proposed system provides utility to save the synthesized output. The output generated by the proposed Malayalam text-to-speech synthesis system resembles natural human voice. Our text to speech reader software converts a Malayalam text to speech wav file that has high rates of intelligibility and comprehensibility.

References
  1. Marian Macchi, Bellcore,”Issues in text-to-speech synthesis” In Proc. EEE International Joint Symposia on Intelligence and Systems, pp.318-325, 1998
  2. N.P. Narendra, K. Sreenivasa Rao ,Krishnendu Ghosh, Ramu Reddy Vempada, Sudhamay Maity “Development of syllable-based text to speech synthesis system in Bengali” DOI: 10.1007/s10772-011-9094-4
  3. C. Pornpanomchai, N. Soontharanont, C. Langla, N. Wongsawat, "A dictionary-based approach for Thai text to speech (TTTS)," icmtma, In Proc. Third Int. Conference on Measuring Technology and Mechatronics Automation, vol. 1, pp.40-43, 2011
  4. M. N. Rao, S. Thomas, T. Nagarajan, and H. A. Murthy, “Text-to-speech synthesis using syllable like units”. In National conference on communication, IIT Kharagpur, pp. 227–280, 2005.
  5. D. H. Klatt, “Review of text-to-speech conversion for English”. The Journal of the Acoustical Society of America, 82, 737–793, 1987
  6. M. Sreekanth, & A.G. Ramakrishnan, “Festival based maiden tts system for Tamil language”. In Proc. 3rd language and technology conf., Poznan, Poland, October pp. 187–191, 2007.
  7. S. P. Kishore and A. W. Black, “Unit size in unit selection speech synthesis,” in Proc. Eurospeech 2003, Sept. 2003.
  8. N.S. Krishna, P.P. Talukdar, K. Bali, A.G. Ramakrishnan, “Duration modeling for Hindi text-to-speech synthesis system”, in Proc. Of Int. Conference on Spoken Language Processing (ICSLP’04), Korea, 2004
  9. A. Hunt, & A. Black, “Unit selection in a concatenative speech synthesis system using a large speech database”, In Proc. of EEE int. Conference acoust, speech, and signal processing, vol. 1, pp. 373–376, 1996
  10. N. Sridhar Krishna, Hema A.Murthy and Timothy A. Gonsalves, “Text-to-Speech (TTS) in Indian Languages”, Int. Conference on Natural Language Processing, ICON-2002, Mumbai, pp. 317.326, 2002.
  11. Robert J. Utama, Ann K. Syrdal, Alistair Conkie,” Six approaches to limited domain concatenative speech synthesis”. INTERSPEECH, ICSLP, 2006
  12. S.D. Shirbahadurkar, D.S. Bormane, R.L. Kazi, “Subjective and spectrogram analysis of speech synthesizer for Marathi tts using concatenative synthesis”. Recent Trends in Information, Telecommunication and Computing (ITC), 2010
  13. K. P. Mohanan, T. Mohanan “Lexical phonology of the consonant system in Malayalam” Linguistic Inquiry The MIT Press, volume 15, 1984.
  14. K. Panchapagesan, P.P Talukdar, N.S. Krishna, K. Bali and A.G. Ramakrishnan, “Hindi text normalization”, Fifth International Conference on Knowledge Based Computer Systems (KBCS), Hyderabad, India, 2004.
  15. T.Charoenporn,A.Chotimongkol,V.Sornlertlamvanich, “Automatic romanization for Thai”, In Proc. of the 2nd Int. Workshop on East-Asian Language Resources and Evaluation, 1999
  16. M. Cernak, M. Rusko, “An evaluation of synthetic speech using the PESQ measure”, In Proc. Forum Acusticum, Budapest, 2725-2728, 2005
Index Terms

Computer Science
Information Sciences

Keywords

Text-to-Speech Concatenation Speech Synthesis Text Normalization Romanization