State of the Art Research for Bangla Text to Speech on Android Platform

Sheikh Abujar; M. S. I. Shahin; Anisur Rahman; Abdus Sattar

Call for Paper

September Edition

IJCA solicits high quality original research papers for the upcoming September edition of the journal. The last date of research paper submission is 20 August 2025

Submit your paper

Know more

The week's pick

Assessing LLMs as Cognitive Interpreters of Student Prompts: A Typological Framework

Tadeu da Ponte Matevz Vremec Matej Mertik

Random Articles

Reseach Article

State of the Art Research for Bangla Text to Speech on Android Platform

by Sheikh Abujar, M. S. I. Shahin, Anisur Rahman, Abdus Sattar

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 170 - Number 1

Year of Publication: 2017

Authors: Sheikh Abujar, M. S. I. Shahin, Anisur Rahman, Abdus Sattar

10.5120/ijca2017914650

Sheikh Abujar, M. S. I. Shahin, Anisur Rahman, Abdus Sattar . State of the Art Research for Bangla Text to Speech on Android Platform. International Journal of Computer Applications. 170, 1 ( Jul 2017), 19-23. DOI=10.5120/ijca2017914650

@article{ 10.5120/ijca2017914650,

author = { Sheikh Abujar, M. S. I. Shahin, Anisur Rahman, Abdus Sattar },

title = { State of the Art Research for Bangla Text to Speech on Android Platform },

journal = { International Journal of Computer Applications },

issue_date = { Jul 2017 },

volume = { 170 },

number = { 1 },

month = { Jul },

year = { 2017 },

issn = { 0975-8887 },

pages = { 19-23 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume170/number1/28034-2017914650/ },

doi = { 10.5120/ijca2017914650 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-07T00:17:19.199470+05:30

%A Sheikh Abujar

%A M. S. I. Shahin

%A Anisur Rahman

%A Abdus Sattar

%T State of the Art Research for Bangla Text to Speech on Android Platform

%J International Journal of Computer Applications

%@ 0975-8887

%V 170

%N 1

%P 19-23

%D 2017

%I Foundation of Computer Science (FCS), NY, USA

Abstract

There are different kinds of TTS (Text to Speech) systems are already available for Personal computers and web applications. In the Platform of Smart Phone, few of TTS systems are available for Bangla Language. Nowadays android is a popular platform considering Smartphone. There are few Bangla TTS Systems are Available with different kind of Mechanisms and techniques, various kind of tools were used. Here we tried to introduce all mechanisms together and proving a summary above all existing system.

References

Frances Alias, Xavier Servillano, Joan Claudi socoro and Xavier Gonzalvo “Towards High-Quality Next Generation Text-to-Speech Synthesis:A multi domain Approach by Automatic Domain Classification”,IEEE Transactions on AUDIO,SPEECH AND LANGUAG PROCESSING, VOL16,NO,7 september 2008.
Qing Guo, Jie Zhang, Nobuyuki Katae, Hao Yu , “High –Quality Prosody Generation in Mandrain Text-to-Speech system”, FujiTSu Sci.Tech,J., vol.46, No.1,pp.40-46 ,2010.
Gopalakrishna anumanchipalli, Rahul Chitturi, Sachin Joshi, Rohit Kumar, Satinder Pal Singh, R.n.v Sitaram, D.P.Kishore, “Development of Indian Language Speech Databases for Large Vocabulary Speech Recognition System”,
A.Black, H.Zen and K.Tokuda “Statistical parametric speech synthesis”, in proc.ICASSP, Honolulu, HI 2007, vol IV, PP 1229-1232.
G.Bailly, N.Campbell and b.Mobius, “ISCA special session: Hot topics in speech synthesis”, in proc.Eurospeech,Genea, Switzerland, 2003, pp 37-40.
M.Ostendorf and I.Bulyko, “The impact of speech recognition on speech synthesis”, in proc, IEEE Workshop Speech Synthesis, Santa Monica,2002,pp. 99-106.
Text To Speech Synthesis - a knol by Jaibatrik Dutta .
Silvio Ferreia,Celina Thillou, Bernaud Gosselin, “From Picture to Speech: an Innovative Application for Embedded Environment”,
M.Nageshwara Rao, Samuel Thomas, T.Nagarajan and Hema A.Muthy, “Text-to-Speech Syntheis using syllable line units”
Jindrich Matousek, Josef Psutks, Jiri Krita, “Design of speech Corpus for Text-to-Speech Synthesis”. Beckman M. and Elam G. “Guidelines for ToBI Labeling”. Manuscript, version 3, 1997.
Corrigan G., Massey N., and Karaali O. “Generating Segment Durations in a Text-to-Speech System: A Hybrid Rule-Based/Neural Network Approach”. Proc. Eurospeech ’97, Rhodes, September 1997.
Gerson I., Karaali O., Corrigan G., and Massey N. “Neural Network Speech Synthesis”. Speech Science and Technology (SST-96), Australia, 1996.
Karaali O., Corrigan G., and Gerson I. “Speech Synthesis with Neural Networks”. Invited paper, World Congress on Neural Networks (WCNN-96), San Diego, September 1996.
Karaali O., Corrigan G., Gerson I., and Massey N. “Text-to- Speech Conversion with Neural Networks: A Recurrent TDNN Approach”. Proc. Eurospeech ’97, September 1997.
Kiparsky P. “Lexical phonology and morphology”. Linguistics in the morning calm, ed. by I.S. Yang. Seoul: Hanshin, 1982.
Kruskal J. “An overview of sequence comparison”. Time Warps, String Edits, and Macromolecules, edited by Joseph Kruskal and David Sankoff. Reading, MA: Addison- Wesley, 1983.
Linguistic Data Consortium. COMLEX English pronouncing lexicon. Trustees of the University of Pennsylvania, version 0.2, 1995.
Miller C., Karaali O., and Massey N. “Variation and Synthetic Speech”. NWAVE 26, Quebec, October 1997.
Nusbaum H., Francis A., and Luks T. “Comparative valuation of the quality of synthetic speech produced at Motorola”. Research report, Spoken Language Research Laboratory, University of Chicago, 1995.
O’Shaughnessy, D. “Modeling fundamental frequency, and its relationship to syntax, semantics, and phonetics”. Ph.D. thesis, M.I.T., 1976.
Sejnowski T. and Rosenberg C. “NETtalk: a parallel network that learns to pronounce English text”. Complex Systems 1.145-168, 1987.
Seneff S. and Zue V. “Transcription and alignment of the TIMIT database”. M.I.T., 1988.
Tuerk C. and Robinson T. “Speech Synthesis using Artificial Neural Networks Trained on Cepstral Coefficients”. Proc. Eurospeech ’93, Berlin, September 1993.
Ward G. Moby Pronunciator II, 1996.
Weide R. The Carnegie Mellon Pronouncing Dictionary. cmudict.0.4, 1995.

Index Terms

Computer Science

Information Sciences

Keywords

TTS Speech Synthesis Bangla.