BiCross : Corpus Driven Malayalam Text-to-Speech Synthesis for Interactive Voice Response System

Arun Soman; Sachin Kumar S.; Hemanth V. K.

Call for Paper

April Edition

IJCA solicits high quality original research papers for the upcoming April edition of the journal. The last date of research paper submission is 20 March 2026

Submit your paper

Know more

The week's pick

Explainable Hybrid Deep Learning for Automated Diagnosis of Canine Mammary Tumors

Elham Shawky Salama Heba Askr Ashraf Darwish Aboul Ella Hassanien

Random Articles

Reseach Article

BiCross : Corpus Driven Malayalam Text-to-Speech Synthesis for Interactive Voice Response System

by Arun Soman, Sachin Kumar S., Hemanth V. K.

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 29 - Number 4

Year of Publication: 2011

Authors: Arun Soman, Sachin Kumar S., Hemanth V. K.

10.5120/3551-4871

Arun Soman, Sachin Kumar S., Hemanth V. K. . BiCross : Corpus Driven Malayalam Text-to-Speech Synthesis for Interactive Voice Response System. International Journal of Computer Applications. 29, 4 ( September 2011), 41-46. DOI=10.5120/3551-4871

@article{ 10.5120/3551-4871,

author = { Arun Soman, Sachin Kumar S., Hemanth V. K. },

title = { BiCross : Corpus Driven Malayalam Text-to-Speech Synthesis for Interactive Voice Response System },

journal = { International Journal of Computer Applications },

issue_date = { September 2011 },

volume = { 29 },

number = { 4 },

month = { September },

year = { 2011 },

issn = { 0975-8887 },

pages = { 41-46 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume29/number4/3551-4871/ },

doi = { 10.5120/3551-4871 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T20:14:56.119721+05:30

%A Arun Soman

%A Sachin Kumar S.

%A Hemanth V. K.

%T BiCross : Corpus Driven Malayalam Text-to-Speech Synthesis for Interactive Voice Response System

%J International Journal of Computer Applications

%@ 0975-8887

%V 29

%N 4

%P 41-46

%D 2011

%I Foundation of Computer Science (FCS), NY, USA

Abstract

In a text-to-speech system, spoken utterances are automatically produced from text. In this paper, we present a corpus-driven Malayalam text-to-speech (TTS) system based on the concatenative synthesis approach. The most important qualities of a synthesized speech are naturalness and intelligibility. In this system, words and syllables are used as the basic units for synthesis. Our corpus consists of speech waveforms that are collected for most frequently used words in different domains. The speaker is selected through subjective and objective evaluation of natural and synthesized waveform. The proposed Malayalam text-to-speech system is implemented in Java multimedia framework (JMF) and runs on both in Windows and Linux platforms. The proposed system provides utility to save the synthesized output. The output generated by the proposed Malayalam text-to-speech synthesis system resembles natural human voice. Our text to speech reader software converts a Malayalam text to speech wav file that has high rates of intelligibility and comprehensibility.

References

Marian Macchi, Bellcore,”Issues in text-to-speech synthesis” In Proc. EEE International Joint Symposia on Intelligence and Systems, pp.318-325, 1998
N.P. Narendra, K. Sreenivasa Rao ,Krishnendu Ghosh, Ramu Reddy Vempada, Sudhamay Maity “Development of syllable-based text to speech synthesis system in Bengali” DOI: 10.1007/s10772-011-9094-4
C. Pornpanomchai, N. Soontharanont, C. Langla, N. Wongsawat, "A dictionary-based approach for Thai text to speech (TTTS)," icmtma, In Proc. Third Int. Conference on Measuring Technology and Mechatronics Automation, vol. 1, pp.40-43, 2011
M. N. Rao, S. Thomas, T. Nagarajan, and H. A. Murthy, “Text-to-speech synthesis using syllable like units”. In National conference on communication, IIT Kharagpur, pp. 227–280, 2005.
D. H. Klatt, “Review of text-to-speech conversion for English”. The Journal of the Acoustical Society of America, 82, 737–793, 1987
M. Sreekanth, & A.G. Ramakrishnan, “Festival based maiden tts system for Tamil language”. In Proc. 3rd language and technology conf., Poznan, Poland, October pp. 187–191, 2007.
S. P. Kishore and A. W. Black, “Unit size in unit selection speech synthesis,” in Proc. Eurospeech 2003, Sept. 2003.
N.S. Krishna, P.P. Talukdar, K. Bali, A.G. Ramakrishnan, “Duration modeling for Hindi text-to-speech synthesis system”, in Proc. Of Int. Conference on Spoken Language Processing (ICSLP’04), Korea, 2004
A. Hunt, & A. Black, “Unit selection in a concatenative speech synthesis system using a large speech database”, In Proc. of EEE int. Conference acoust, speech, and signal processing, vol. 1, pp. 373–376, 1996
N. Sridhar Krishna, Hema A.Murthy and Timothy A. Gonsalves, “Text-to-Speech (TTS) in Indian Languages”, Int. Conference on Natural Language Processing, ICON-2002, Mumbai, pp. 317.326, 2002.
Robert J. Utama, Ann K. Syrdal, Alistair Conkie,” Six approaches to limited domain concatenative speech synthesis”. INTERSPEECH, ICSLP, 2006
S.D. Shirbahadurkar, D.S. Bormane, R.L. Kazi, “Subjective and spectrogram analysis of speech synthesizer for Marathi tts using concatenative synthesis”. Recent Trends in Information, Telecommunication and Computing (ITC), 2010
K. P. Mohanan, T. Mohanan “Lexical phonology of the consonant system in Malayalam” Linguistic Inquiry The MIT Press, volume 15, 1984.
K. Panchapagesan, P.P Talukdar, N.S. Krishna, K. Bali and A.G. Ramakrishnan, “Hindi text normalization”, Fifth International Conference on Knowledge Based Computer Systems (KBCS), Hyderabad, India, 2004.
T.Charoenporn,A.Chotimongkol,V.Sornlertlamvanich, “Automatic romanization for Thai”, In Proc. of the 2nd Int. Workshop on East-Asian Language Resources and Evaluation, 1999
M. Cernak, M. Rusko, “An evaluation of synthetic speech using the PESQ measure”, In Proc. Forum Acusticum, Budapest, 2725-2728, 2005

Index Terms

Computer Science

Information Sciences

Keywords

Text-to-Speech Concatenation Speech Synthesis Text Normalization Romanization