We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 November 2024
Reseach Article

Robust Automatic Continuous Speech Segmentation for Indian Languages to Improve Speech to Speech Translation

by J. Sangeetha, S. Jothilakshmi
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 53 - Number 15
Year of Publication: 2012
Authors: J. Sangeetha, S. Jothilakshmi
10.5120/8496-2444

J. Sangeetha, S. Jothilakshmi . Robust Automatic Continuous Speech Segmentation for Indian Languages to Improve Speech to Speech Translation. International Journal of Computer Applications. 53, 15 ( September 2012), 13-16. DOI=10.5120/8496-2444

@article{ 10.5120/8496-2444,
author = { J. Sangeetha, S. Jothilakshmi },
title = { Robust Automatic Continuous Speech Segmentation for Indian Languages to Improve Speech to Speech Translation },
journal = { International Journal of Computer Applications },
issue_date = { September 2012 },
volume = { 53 },
number = { 15 },
month = { September },
year = { 2012 },
issn = { 0975-8887 },
pages = { 13-16 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume53/number15/8496-2444/ },
doi = { 10.5120/8496-2444 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:54:10.026036+05:30
%A J. Sangeetha
%A S. Jothilakshmi
%T Robust Automatic Continuous Speech Segmentation for Indian Languages to Improve Speech to Speech Translation
%J International Journal of Computer Applications
%@ 0975-8887
%V 53
%N 15
%P 13-16
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

This paper provides an analysis of phrase and word boundary detection in a background of noise, which occurs in the context of Automatic Recognition System (ASR) and Text-To-Speech (TTS) synthesis systems for Indian languages. ASR and TTS are the major components in Speech To Speech Translation (STST) system. Both are always need a speech signal to be segmented into some basic units like phrases, words, phonemes and syllables. Normal speech is a continuous sequence of sounds with no specific pause to indicate word boundaries. Hence to convert speech into corresponding text, it is necessary to identify the boundaries and phrases present in the continuous speech signal. In this work a robust algorithm for automatic continuous speech segmentation for Indian languages using short time energy and zero crossing rates has been proposed. This proposed method has been tested on various speakers' speech in four different Indian languages such as Tamil, Telugu, Hindi and Malayalam. The results shown to be computationally efficient for real time applications and it performs better than conventional methods for speech samples collected from noisy as well as noise free environment.

References
  1. Jayasankar. T, Dr. R. Thangarajan, Dr. Arputha Vijayaselvi . J " Automatic continuous speech segmentation to improve Tamil text to speech synthesis system", International Journal of Computer Applications (0975 – 8887) Volume 25 No. 1, July 2011.
  2. Er. Amanpreet Kaur and Er. Tarandeep Singh, "Segmentation of Continuous PunjabiSpeech signal into syllables", Proceedings of the World Congress on Engineering and Computer Science 2010 Vol IWCECS 2010, October 20-22, 2010, SanFrancisco,USA.
  3. G. Lakshmisarada, A. Lakshmi, Hema A Moorthy and T. Nagarajan, "Automatic transcription of continuous speech into syllable-like units for Indian languages", Sadhana Vol. 34, Part 2, April 2009, pp. 221–233. © Printed in India.
  4. T. Nagarajan, H. A. Murthy "Subband –Based Group Delay Segmentation Spontaneous Speech into Syllable like Units" EURASIP JOURNAL on Applied signalprocessing 2004.
  5. Deller J. R. Jr. , Hansen J. L. H. and Proakis J. G. : "Discrete Time Processing of Speech Signals", IEEE Press, NJ, 2000.
  6. K. Ishizaka and J. L Flanagan, "Synthesis of voiced Sounds from a Two-Mass Model of the Vocal Chords," Bell System Technical J. , 50(6): 1233-1268, July-Aug. , 1972.
  7. Atal, B. ; Rabiner, L. , "A pattern recognition approach to voiced-unvoiced-silence Classification with applications to speech recognition" Acoustics, Speech, and Signal Processing [see also IEEE Transactions on Signal Processing], IEEE Transactions on , Volume: 24 , Issue: 3 , Jun 1976, Pages: 201 - 212.
  8. G. Childers, M. Hand, J. M. Larar, "Silent and Voiced/Unvoiced/ Mixed Excitation(Four-Way), Classification of Speech", IEEE Transaction on ASSP, Vol-37, No-11, pp. 1771-74, Nov 1989. 9.
  9. http://www. nowpublishers. com/product. aspx?product=SIG&doi=2000000001&section=x1-56r1
Index Terms

Computer Science
Information Sciences

Keywords

Automatic Segmentation Indian languages Short Time Energy Zero Crossing Rate