Performance Evaluation of Speech Synthesis Techniques for Marathi Language

Sangramsing Kayte; Monica Mundada; Charansing Kayte

Call for Paper

September Edition

IJCA solicits high quality original research papers for the upcoming September edition of the journal. The last date of research paper submission is 20 August 2025

Submit your paper

Know more

The week's pick

The Incorporation Of Register Capping To The Model Of The Rename Register File Using Markov Chain

An Do Wei-Ming Lin

Random Articles

Reseach Article

Performance Evaluation of Speech Synthesis Techniques for Marathi Language

by Sangramsing Kayte, Monica Mundada, Charansing Kayte

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 130 - Number 3

Year of Publication: 2015

Authors: Sangramsing Kayte, Monica Mundada, Charansing Kayte

10.5120/ijca2015907023

Sangramsing Kayte, Monica Mundada, Charansing Kayte . Performance Evaluation of Speech Synthesis Techniques for Marathi Language. International Journal of Computer Applications. 130, 3 ( November 2015), 45-50. DOI=10.5120/ijca2015907023

@article{ 10.5120/ijca2015907023,

author = { Sangramsing Kayte, Monica Mundada, Charansing Kayte },

title = { Performance Evaluation of Speech Synthesis Techniques for Marathi Language },

journal = { International Journal of Computer Applications },

issue_date = { November 2015 },

volume = { 130 },

number = { 3 },

month = { November },

year = { 2015 },

issn = { 0975-8887 },

pages = { 45-50 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume130/number3/23193-2015907023/ },

doi = { 10.5120/ijca2015907023 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T23:24:14.812248+05:30

%A Sangramsing Kayte

%A Monica Mundada

%A Charansing Kayte

%T Performance Evaluation of Speech Synthesis Techniques for Marathi Language

%J International Journal of Computer Applications

%@ 0975-8887

%V 130

%N 3

%P 45-50

%D 2015

%I Foundation of Computer Science (FCS), NY, USA

Abstract

Text to speech synthesis (TTS) is the production of artificial speech by a machine for the given text as input. The speech synthesis can be achieved by concatenation and Hidden Markov Model techniques. The voice synthesized by these techniques should be evaluated for quality. The study extends towards the comparative analysis for quality of speech synthesis using hidden markov model and unit selection approach. The quality of synthesized speech is analyzed for subjective measurement using mean opinion score and objective measurement based on mean square score and peak signal-to-noise ratio (PSNR). The quality is also accessed by Mel-frequency cepstral coefficient features for synthesized speech. The experimental analysis shows that unit selection method results in better synthesized voice than hidden markov model.

References

Mohammed Waseem, C.N Sujatha, “Speech Synthesis System for Indian Accent using Festvox”, International journal of Scientific Engineering and Technology Research, ISSN 2319-8885 Vol.03,Issue.34 November-2014, Pages:6903-6911
Sangramsing Kayte, Kavita waghmare ,Dr. Bharti Gawali “Marathi Speech Synthesis: A review” International Journal on Recent and Innovation Trends in Computing and Communication ISSN: 2321-8169 Volume: 3 Issue: 6 3708 – 3711.
S. Martincic- Ipsic and I. Ipsic, “Croatian HMM Based Speech Synthesis,” 28th Int. Conf. Information Technology Interfaces ITI 2006, pp.19-22, 2006, Cavtat, Croatia.
Alex Acero, “Formant Analysis and Synthesis using Hidden Markov Models”. Proceedings of Eurospeech conference. September 1999
R.sproat, J. Hirschberg, and D. Yarowsky, “A corpus-based synthesizer”, Proc. ICSLP, pp.563-566, 1992.
T.Yoshimura, K.Tokuda, T. Masuko, T. Kobayashi and T. Kitamura,“Simultaneous Modeling of Spectrum, Pitch and Duration in HMM-Based Speech Synthesis”In Proc. of ICASSP 2000, vol 3, pp.1315-1318, June 2000.
A. Black, P. Taylor, and R. Caley, “The Festival Speech Synthesis System System documentation Edition 1.4, for Festival Version 1.4.3 27th December 2002.
Series P: Telephone Transmission Quality “Methods for objective and subjective assessment of quality "- Methods for Subjective Determination of Transmission Quality ITU-T Recommendation P.800.
ITU-T P.830, Subjective performance assessment of telephone-band and wideband digital codecs
Lehmann, E. L.; Casella, George. “Theory of Point Estimation (2nd ed.). New York: Springer. ISBN 0-387-98502-6. MR 1639875
Huynh-Thu, Q.; Ghanbari, M. (2008). "Scope of validity of PSNR in image/video quality assessment". Electronics Letters 44 (13): 800. doi:10.1049/el:20080522
SR Quackenbush, TP Barnwell, MA Clements, Objective Measures of Speech Quality(Prentice-Hall, New York, NY, USA, 1988)
AW Rix, MP Hollier, AP Hekstra, JG Beerends, PESQ, the new ITU standard for objective measurement of perceived speech quality—part 1: time alignment. Journal of the Audio Engineering Society 50, 755–764 (2002)
JG Beerends, AP Hekstra, AW Rix, MP Hollier, PESQ, the new ITU standard for objective measurement of perceived speech quality—part II: perceptual model. Journal of the Audio Engineering Society 50, 765–778 (2002)
ITU-T P.862, Perceptual evaluation of speech quality: an objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech.codecs 2001
Monica Mundada, Bharti Gawali, Sangramsing Kayte "Recognition and classification of speech and its related fluency disorders" Monica Mundada et al, / (IJCSIT) nternational Journal of Computer Science and Information Technologies, Vol. 5 (5) , 2014, 6764-6767
Monica Mundada, Sangramsing Kayte, Dr. Bharti Gawali "Classification of Fluent and Dysfluent Speech Using KNN Classifier" International Journal of Advanced Research in Computer Science and Software Engineering Volume 4,Issue 9, September 2014
Sangramsing N.kayte “Marathi Isolated-Word Automatic Speech Recognition System based on Vector Quantization (VQ) approach” 101th Indian Science Congress Jammu University 03th Feb to 07 Feb 2014.
Sangramsing Kayte, Monica Mundada "Study of Marathi Phones for Synthesis of Marathi Speech from Text" International Journal of Emerging Research in Management &Technology ISSN: 2278-9359 (Volume-4, Issue-10) October 2015
Sangramsing Kayte, Monica Mundada "Study of Marathi Phones for Synthesis of Marathi Speech from Text" International Journal of Emerging Research in Management &Technology ISSN: 2278-9359 (Volume-4, Issue-10) October 2015

Index Terms

Computer Science

Information Sciences

Keywords

Keyword TTS MOS HMM Unit Selection Mean Variance.