Robust Automatic Continuous Speech Segmentation for Indian Languages to Improve Speech to Speech Translation

J. Sangeetha; S. Jothilakshmi

Call for Paper

May Edition

IJCA solicits high quality original research papers for the upcoming May edition of the journal. The last date of research paper submission is 20 April 2026

Submit your paper

Know more

The week's pick

A Unified NIST SP 800-90B Validation Framework for CMOS True Random Number Generators and Quantum Random Number Generators

Che-Ping Lin

Random Articles

Reseach Article

Robust Automatic Continuous Speech Segmentation for Indian Languages to Improve Speech to Speech Translation

by J. Sangeetha, S. Jothilakshmi

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 53 - Number 15

Year of Publication: 2012

Authors: J. Sangeetha, S. Jothilakshmi

10.5120/8496-2444

J. Sangeetha, S. Jothilakshmi . Robust Automatic Continuous Speech Segmentation for Indian Languages to Improve Speech to Speech Translation. International Journal of Computer Applications. 53, 15 ( September 2012), 13-16. DOI=10.5120/8496-2444

@article{ 10.5120/8496-2444,

author = { J. Sangeetha, S. Jothilakshmi },

title = { Robust Automatic Continuous Speech Segmentation for Indian Languages to Improve Speech to Speech Translation },

journal = { International Journal of Computer Applications },

issue_date = { September 2012 },

volume = { 53 },

number = { 15 },

month = { September },

year = { 2012 },

issn = { 0975-8887 },

pages = { 13-16 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume53/number15/8496-2444/ },

doi = { 10.5120/8496-2444 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T20:54:10.026036+05:30

%A J. Sangeetha

%A S. Jothilakshmi

%T Robust Automatic Continuous Speech Segmentation for Indian Languages to Improve Speech to Speech Translation

%J International Journal of Computer Applications

%@ 0975-8887

%V 53

%N 15

%P 13-16

%D 2012

%I Foundation of Computer Science (FCS), NY, USA

Abstract

This paper provides an analysis of phrase and word boundary detection in a background of noise, which occurs in the context of Automatic Recognition System (ASR) and Text-To-Speech (TTS) synthesis systems for Indian languages. ASR and TTS are the major components in Speech To Speech Translation (STST) system. Both are always need a speech signal to be segmented into some basic units like phrases, words, phonemes and syllables. Normal speech is a continuous sequence of sounds with no specific pause to indicate word boundaries. Hence to convert speech into corresponding text, it is necessary to identify the boundaries and phrases present in the continuous speech signal. In this work a robust algorithm for automatic continuous speech segmentation for Indian languages using short time energy and zero crossing rates has been proposed. This proposed method has been tested on various speakers' speech in four different Indian languages such as Tamil, Telugu, Hindi and Malayalam. The results shown to be computationally efficient for real time applications and it performs better than conventional methods for speech samples collected from noisy as well as noise free environment.

References

Jayasankar. T, Dr. R. Thangarajan, Dr. Arputha Vijayaselvi . J " Automatic continuous speech segmentation to improve Tamil text to speech synthesis system", International Journal of Computer Applications (0975 – 8887) Volume 25 No. 1, July 2011.
Er. Amanpreet Kaur and Er. Tarandeep Singh, "Segmentation of Continuous PunjabiSpeech signal into syllables", Proceedings of the World Congress on Engineering and Computer Science 2010 Vol IWCECS 2010, October 20-22, 2010, SanFrancisco,USA.
G. Lakshmisarada, A. Lakshmi, Hema A Moorthy and T. Nagarajan, "Automatic transcription of continuous speech into syllable-like units for Indian languages", Sadhana Vol. 34, Part 2, April 2009, pp. 221–233. © Printed in India.
T. Nagarajan, H. A. Murthy "Subband –Based Group Delay Segmentation Spontaneous Speech into Syllable like Units" EURASIP JOURNAL on Applied signalprocessing 2004.
Deller J. R. Jr. , Hansen J. L. H. and Proakis J. G. : "Discrete Time Processing of Speech Signals", IEEE Press, NJ, 2000.
K. Ishizaka and J. L Flanagan, "Synthesis of voiced Sounds from a Two-Mass Model of the Vocal Chords," Bell System Technical J. , 50(6): 1233-1268, July-Aug. , 1972.
Atal, B. ; Rabiner, L. , "A pattern recognition approach to voiced-unvoiced-silence Classification with applications to speech recognition" Acoustics, Speech, and Signal Processing [see also IEEE Transactions on Signal Processing], IEEE Transactions on , Volume: 24 , Issue: 3 , Jun 1976, Pages: 201 - 212.
G. Childers, M. Hand, J. M. Larar, "Silent and Voiced/Unvoiced/ Mixed Excitation(Four-Way), Classification of Speech", IEEE Transaction on ASSP, Vol-37, No-11, pp. 1771-74, Nov 1989. 9.
http://www. nowpublishers. com/product. aspx?product=SIG&doi=2000000001&section=x1-56r1

Index Terms

Computer Science

Information Sciences

Keywords

Automatic Segmentation Indian languages Short Time Energy Zero Crossing Rate