A Review on Acoustic Phonetic Approach for Marathi Speech Recognition

Rohini B. Shinde; V. P. Pawar

Call for Paper

August Edition

IJCA solicits high quality original research papers for the upcoming August edition of the journal. The last date of research paper submission is 21 July 2025

Submit your paper

Know more

The week's pick

FORENSIC ANALYSIS FRAMEWORKS FOR ENCRYPTED CLOUD STORAGE INVESTIGATIONS

Joy Awoleye Sarah Mavire Allan Munyira Kelvin Magora

Random Articles

Impact of using Snowflake Schema and Bitmap Index on Data Warehouse Querying

Jan

2018

Customer Complain Detection in E-commerce Platforms using NLP

Dec

2022

Comparative Analysis of Search Algorithms

Jun

2018

Enhanced HMM Speech Emotion Recognition using SVM and Neural Classifier

February

2014

Reseach Article

A Review on Acoustic Phonetic Approach for Marathi Speech Recognition

by Rohini B. Shinde, V. P. Pawar

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 59 - Number 2

Year of Publication: 2012

Authors: Rohini B. Shinde, V. P. Pawar

10.5120/9523-3934

Rohini B. Shinde, V. P. Pawar . A Review on Acoustic Phonetic Approach for Marathi Speech Recognition. International Journal of Computer Applications. 59, 2 ( December 2012), 40-44. DOI=10.5120/9523-3934

@article{ 10.5120/9523-3934,

author = { Rohini B. Shinde, V. P. Pawar },

title = { A Review on Acoustic Phonetic Approach for Marathi Speech Recognition },

journal = { International Journal of Computer Applications },

issue_date = { December 2012 },

volume = { 59 },

number = { 2 },

month = { December },

year = { 2012 },

issn = { 0975-8887 },

pages = { 40-44 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume59/number2/9523-3934/ },

doi = { 10.5120/9523-3934 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T21:05:05.157541+05:30

%A Rohini B. Shinde

%A V. P. Pawar

%T A Review on Acoustic Phonetic Approach for Marathi Speech Recognition

%J International Journal of Computer Applications

%@ 0975-8887

%V 59

%N 2

%P 40-44

%D 2012

%I Foundation of Computer Science (FCS), NY, USA

Abstract

This paper discusses the phoneme used in Marathi language as a possible basic unit of speech recognition, for which there is some empirical psychoacoustic support in the case of human and some engineering justification in the case of machines striving to imitate human abilities. For the purpose of the research described in this paper, a basic unit of speech recognition is the intermediate form of speech information around which much of the recognition processing is organized for human beings or for machines. The general opinion of phonetician and psycholinguists is that there is indeed such a unit with relatively few distinct types1. For this research a basic unit is ideally an output of acoustic-phonetic processing and an input to the lexical processing stages.

References

Takayuki Arai and Steven Greenberg. "The temporal properties of spoken Japanese are similar to those of English. " Published in Eurospeech, Rhodes, Greece September 1997. ESCA.
Richard Schwartz, Jack Klovstad, John Makhoul, and John Sorensen. A Preliminary design of a phonetic vocoder based on a diphone model. In ICASSP, Volume 1, Pages 32-35, Denver, Colorado, April 1980 IEEE
M. Cravero, R. Pieraccini. And F. Raineri. "Definition and evaluation of phonetic units for speech recognition by hidden Markov Models. " In ICASSP, volume 3, pages 2235-2238, Tokyo, Japan, April 1986. IEEE.
N. Rex Dixon and Harvey F. Silverman. "The 1976 modular acoustic processor. " IEEE Transactions of Acoustics, Speech and signal processing, ASSP-25(5):367-379, October 1977.
Nelson Morgan, Herve Bourlard, Steve Greenberg, and hynek Hermansky. Stochastic Perceptual auditory-event-based models for speech recognition. In ICSLP, Pages 1943-1946, Yokohama, Japan, September 1994.
L. Bahl, P. Cohen, A. Cole, F. Jelinek, B. Lewis, and R. mercer. "Further results on the recognition of a contineously read natural corpus. " In ICASSP, Volume 3, pages 872-875. Denver, Colorado, April 1980. IEEE.
Osamu Fujimura. "Syllable as concatenated demisyllables and affixes. " Journal of the Acoustical Society of America, 59 (suppl. 1):S55,Spring 1976.
Osamu Fujimura. "Syllable as a unit of speech recognition. " IEEE Transactions on Acoustics, Speech and signal processing, ASSP-23(1):82-87, February 1975.
Gopalakrishna Anumanchipalli, Rahul Chitturi, "Development of Indian Language Speech Databases for Large Vocabulary Speech Recognition Systems"
"Digital Signal Processing" By-P. Ramesh Babu Scitech Publications (India) PVT, LTD.
"Fundamental of Speech Recognition" By-Lawrence Rabiner , Biing-Hwang Juang, Published by Pearson Education (Singapore) Pte. Ltd. Indian Branch.
"Digital Signal Processing" A MATLAB based approach. By- Vinay K. Ingle, John G. Proakis.
"Digital Signal Processing-Principles, Algorithms and Applications" John G. Proakis. , Dimitris G. Manolakis.
"Digital Signal Processing" by Farooq Husain.
"Marathi Grammar Book" by Shripad Bhagwat.
"A course in Phonetics and Spoken English"-J. Sethi, P. V. Dhamija

Index Terms

Computer Science

Information Sciences

Keywords

ANN (Artificial Neural Network) Discrete Cosine Transform (DCT) Fast Fourier Transform (FFT) Linear Predictor Coefficients (LPC) Swara (Vowels in Marathi) Vyanjana (Consonants in Marathi) MLSR (Marathi Language Speech Recognition)