We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 November 2024
Call for Paper
December Edition
IJCA solicits high quality original research papers for the upcoming December edition of the journal. The last date of research paper submission is 20 November 2024

Submit your paper
Know more
Reseach Article

A Review on Acoustic Phonetic Approach for Marathi Speech Recognition

by Rohini B. Shinde, V. P. Pawar
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 59 - Number 2
Year of Publication: 2012
Authors: Rohini B. Shinde, V. P. Pawar
10.5120/9523-3934

Rohini B. Shinde, V. P. Pawar . A Review on Acoustic Phonetic Approach for Marathi Speech Recognition. International Journal of Computer Applications. 59, 2 ( December 2012), 40-44. DOI=10.5120/9523-3934

@article{ 10.5120/9523-3934,
author = { Rohini B. Shinde, V. P. Pawar },
title = { A Review on Acoustic Phonetic Approach for Marathi Speech Recognition },
journal = { International Journal of Computer Applications },
issue_date = { December 2012 },
volume = { 59 },
number = { 2 },
month = { December },
year = { 2012 },
issn = { 0975-8887 },
pages = { 40-44 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume59/number2/9523-3934/ },
doi = { 10.5120/9523-3934 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:05:05.157541+05:30
%A Rohini B. Shinde
%A V. P. Pawar
%T A Review on Acoustic Phonetic Approach for Marathi Speech Recognition
%J International Journal of Computer Applications
%@ 0975-8887
%V 59
%N 2
%P 40-44
%D 2012
%I Foundation of Computer Science (FCS), NY, USA
Abstract

This paper discusses the phoneme used in Marathi language as a possible basic unit of speech recognition, for which there is some empirical psychoacoustic support in the case of human and some engineering justification in the case of machines striving to imitate human abilities. For the purpose of the research described in this paper, a basic unit of speech recognition is the intermediate form of speech information around which much of the recognition processing is organized for human beings or for machines. The general opinion of phonetician and psycholinguists is that there is indeed such a unit with relatively few distinct types1. For this research a basic unit is ideally an output of acoustic-phonetic processing and an input to the lexical processing stages.

References
  1. Takayuki Arai and Steven Greenberg. "The temporal properties of spoken Japanese are similar to those of English. " Published in Eurospeech, Rhodes, Greece September 1997. ESCA.
  2. Richard Schwartz, Jack Klovstad, John Makhoul, and John Sorensen. A Preliminary design of a phonetic vocoder based on a diphone model. In ICASSP, Volume 1, Pages 32-35, Denver, Colorado, April 1980 IEEE
  3. M. Cravero, R. Pieraccini. And F. Raineri. "Definition and evaluation of phonetic units for speech recognition by hidden Markov Models. " In ICASSP, volume 3, pages 2235-2238, Tokyo, Japan, April 1986. IEEE.
  4. N. Rex Dixon and Harvey F. Silverman. "The 1976 modular acoustic processor. " IEEE Transactions of Acoustics, Speech and signal processing, ASSP-25(5):367-379, October 1977.
  5. Nelson Morgan, Herve Bourlard, Steve Greenberg, and hynek Hermansky. Stochastic Perceptual auditory-event-based models for speech recognition. In ICSLP, Pages 1943-1946, Yokohama, Japan, September 1994.
  6. L. Bahl, P. Cohen, A. Cole, F. Jelinek, B. Lewis, and R. mercer. "Further results on the recognition of a contineously read natural corpus. " In ICASSP, Volume 3, pages 872-875. Denver, Colorado, April 1980. IEEE.
  7. Osamu Fujimura. "Syllable as concatenated demisyllables and affixes. " Journal of the Acoustical Society of America, 59 (suppl. 1):S55,Spring 1976.
  8. Osamu Fujimura. "Syllable as a unit of speech recognition. " IEEE Transactions on Acoustics, Speech and signal processing, ASSP-23(1):82-87, February 1975.
  9. Gopalakrishna Anumanchipalli, Rahul Chitturi, "Development of Indian Language Speech Databases for Large Vocabulary Speech Recognition Systems"
  10. "Digital Signal Processing" By-P. Ramesh Babu Scitech Publications (India) PVT, LTD.
  11. "Fundamental of Speech Recognition" By-Lawrence Rabiner , Biing-Hwang Juang, Published by Pearson Education (Singapore) Pte. Ltd. Indian Branch.
  12. "Digital Signal Processing" A MATLAB based approach. By- Vinay K. Ingle, John G. Proakis.
  13. "Digital Signal Processing-Principles, Algorithms and Applications" John G. Proakis. , Dimitris G. Manolakis.
  14. "Digital Signal Processing" by Farooq Husain.
  15. "Marathi Grammar Book" by Shripad Bhagwat.
  16. "A course in Phonetics and Spoken English"-J. Sethi, P. V. Dhamija
Index Terms

Computer Science
Information Sciences

Keywords

ANN (Artificial Neural Network) Discrete Cosine Transform (DCT) Fast Fourier Transform (FFT) Linear Predictor Coefficients (LPC) Swara (Vowels in Marathi) Vyanjana (Consonants in Marathi) MLSR (Marathi Language Speech Recognition)