An Intelligent Text to Speech System for Windows based Systems and Mobile Devices

Abhishek Srivastava; Akshay Sharma; Neelu Jain

Call for Paper

August Edition

IJCA solicits high quality original research papers for the upcoming August edition of the journal. The last date of research paper submission is 21 July 2025

Submit your paper

Know more

The week's pick

Navigating the Future of Cybersecurity: A Strategic Approach to Crypto Agility for Modern Enterprises

Aditya Gupta

Random Articles

Characterization of Angular Error in Magnetic Head Tracking

July

2013

Design and Implementation of a Wireless Gesture Controlled Robotic Arm with Vision

October

2013

A Survey on Security in Medical Image Communication

September

2011

Application Specific Cache Simulation Analysis for Application Specific Instructionset Processor

March

2014

Reseach Article

An Intelligent Text to Speech System for Windows based Systems and Mobile Devices

by Abhishek Srivastava, Akshay Sharma, Neelu Jain

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 90 - Number 16

Year of Publication: 2014

Authors: Abhishek Srivastava, Akshay Sharma, Neelu Jain

10.5120/15801-4625

Abhishek Srivastava, Akshay Sharma, Neelu Jain . An Intelligent Text to Speech System for Windows based Systems and Mobile Devices. International Journal of Computer Applications. 90, 16 ( March 2014), 1-5. DOI=10.5120/15801-4625

@article{ 10.5120/15801-4625,

author = { Abhishek Srivastava, Akshay Sharma, Neelu Jain },

title = { An Intelligent Text to Speech System for Windows based Systems and Mobile Devices },

journal = { International Journal of Computer Applications },

issue_date = { March 2014 },

volume = { 90 },

number = { 16 },

month = { March },

year = { 2014 },

issn = { 0975-8887 },

pages = { 1-5 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume90/number16/15801-4625/ },

doi = { 10.5120/15801-4625 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T22:11:10.638166+05:30

%A Abhishek Srivastava

%A Akshay Sharma

%A Neelu Jain

%T An Intelligent Text to Speech System for Windows based Systems and Mobile Devices

%J International Journal of Computer Applications

%@ 0975-8887

%V 90

%N 16

%P 1-5

%D 2014

%I Foundation of Computer Science (FCS), NY, USA

Abstract

TTS (Text-to-speech) systems are used invariably as part of our daily lives and have come a long way. In this paper TTS system using Concatenative synthesis based on the SDK (Software Development Kit) platform has been presented. This system is compatible with both computer and mobile devices. It has a user friendly GUI (graphical user interface) to control various speech parameters. Speech signal produced can be saved and listened to whenever required. Signal analysis of the output speech can also be done using TTS System. The results of these signal analysis along with the stored speech signal can be used for further applications depending upon the requirements. It is an intelligent system and is able to overcome various normalization problems.

References

Tokuda et al," Speech Synthesis Based on Hidden Markov Models",Proceedings of the IEEE | Vol. 101, No. 5,pp. 1234-1252 May 2013
J. Hamzabegovic*, D. Kalpi? "A Proposal for Development of Software to SupportSpecific Learning Difficulties",12th International Conference on Telecommunications - ConTEL 2013,pp. 207-214,ISBN: 978-953-184-180-1, Zagreb, Croatia
JuergenSchroeter AT&T Laboratories
A. G. Ramakrishnan, Lakshmish N Kaushik, LaxmiNarayana. M, "Natural Language Processing for Tamil TTS", Proc. 3rd Language and Technology Conference, Poznan, Poland, October 5-7, 2007
Chen, G. L. , Yue, D. J. , Zu, Y. Q. , Yu, Z. L. , "An embedded English synthesis approach based on speech concatenation and smoothing", ISCSLP2004, pp. 157-160, Hong Kong, Dec. 2004
T. Dutoit, "An Introduction to Text-to-Speech Synthesis"Dordrecht/Boston/London: Kluwer Academic Publishers, 1997.
T. Styger and E. Keller, Fundamentals ofSpeech Synthesis and Speech Recognition: Basic Concepts, State of the Art, and Future Challenges Formant synthesis, In Keller E. (ed. ), 109-128, Chichester: John Wiley, 1994. , 4,5
13. D. H. Klatt, ''Software for a cascade/parallel formant synthesizer,'' J. Acoust. Soc. Am. , vol. 67, no. 3,971–995, 1980.
J. Allen, M. S. Hunnicutt, and D. Klatt, From Text to Speech, The MITalk System, Cambridge: CambridgeUniversity Press, 1987
Moulines, E. , Charpentier, F. "Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones", Speech Communication, Vol. 9, pp. 453-468, 1990
Sproat, R. , Hirschberg, J. , Yarowsky, D. , "A corpus-based synthesizer", ICSLP1992, pp. 563-566, Alberta, Canada, Oct. 1992
Van Santen, J. , Sproat, R. , Olive, J. , Hirshberg, J. , editors, Progress in Speech Synthesis, Springer Verlag, New York, 1995
IngmundBjørkan,Speech Generation and Modification in Concatenative Speech Synthesis Ph D Thesis,Norwegian University of Science and Technology . Faculty of Information Technology, Mathematics and Electrical Engineering, Department of Electronics and Telecommunications 2010
Sproat, R. and Oliver, J. "An Approach to Text-to-Speech Synthesis". Chapter 17 in book "Speech Coding and Synthesis", Elsevier, 1995
S. Nakajima and H. Hamada, "Automatic generation of Synthesis Units based on context oriented clustering", Proc. ICASSP 1988, pp. 659-662, (New York, USA), 1988].
R. E. Donovan and E. M. Eide, ''The IBM trainable speech synthesis system,'' in Proc. Int. Conf. Spoken Lang. Process. , 1998, pp. 1703–1706.
B. Beutnagel, A. Conkie, J. Schroeter, Y. Stylianou, and A. Syrdal, ''The AT&T Next-Gen TTS system,'' in Proc. Joint ASA/EAA/DAEA Meeting, 1999,pp. 15–19.
G. Coorman, J. Fackrell, P. Rutten, and B. Coile, ''Segment selection in the L&H realspeak laboratory TTS system,'' in Proc. Int. Conf. Spoken Lang. Process. , 2000,pp. 395–398. ]
http://msdn. microsoft. com/en-us/library/ms720151(v=vs. 85). aspx
http://msdn. microsoft. com/library/windowsphone/develop/ff402529(v=vs. 105). aspx
Zeng et a," Speech dynamic range for cochlear implants" . J. Acoust. Soc. Am. , Vol. 111, No. 1, Pt. 1, Jan. 2002.

Index Terms

Computer Science

Information Sciences

Keywords

TTS SDK Concatenative synthesis GUI