CFP last date
20 December 2024
Reseach Article

Machine Recognition of Emotion Rich Hindi Digits

Published on May 2012 by Shweta Sinha, Anurag Jain
National Workshop-Cum-Conference on Recent Trends in Mathematics and Computing 2011
Foundation of Computer Science USA
RTMC - Number 3
May 2012
Authors: Shweta Sinha, Anurag Jain
6dc3fe9f-0ca6-424d-9718-9398dd66dc38

Shweta Sinha, Anurag Jain . Machine Recognition of Emotion Rich Hindi Digits. National Workshop-Cum-Conference on Recent Trends in Mathematics and Computing 2011. RTMC, 3 (May 2012), 26-30.

@article{
author = { Shweta Sinha, Anurag Jain },
title = { Machine Recognition of Emotion Rich Hindi Digits },
journal = { National Workshop-Cum-Conference on Recent Trends in Mathematics and Computing 2011 },
issue_date = { May 2012 },
volume = { RTMC },
number = { 3 },
month = { May },
year = { 2012 },
issn = 0975-8887,
pages = { 26-30 },
numpages = 5,
url = { /proceedings/rtmc/number3/6639-1022/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Proceeding Article
%1 National Workshop-Cum-Conference on Recent Trends in Mathematics and Computing 2011
%A Shweta Sinha
%A Anurag Jain
%T Machine Recognition of Emotion Rich Hindi Digits
%J National Workshop-Cum-Conference on Recent Trends in Mathematics and Computing 2011
%@ 0975-8887
%V RTMC
%N 3
%P 26-30
%D 2012
%I International Journal of Computer Applications
Abstract

This paper discusses the recognition of Hindi digits based on emotion rich small vocabulary. A feed forward multilayer neural network is trained by Back propagation method for speaker independent isolated word recognition. Mel Frequency Cepstral Coefficients(MFCC) are extracted as speech features. These features are used to train the Multi Layer Feed Forward network(MLFFN) Network . The same routine is applied to signals during recognition stage and unknown test patterns are classified to the nearest pattern. Analysis based on varying number of hidden neurons in the network and variation in number of speech features is presented here. The network is trained with input waves captured in neutral emotion and is tested against data in sad and surprise emotion. It has been observed that the MLFFN works as good classifier for test data and number of speech features extracted plays a very important role in recognition of isolated Hindi digits through machine.

References
  1. Lawrence Rabiner, Biing-Hwang Juang , "Fundamental of speech Recognition", Pearson Publications,Second Edition.
  2. H. S. Li, J. Liu, R. S. Liu, "High Performance Mandarin Digit Speech Recognition". Journal of Tsinghua University (Science and Technology), 2000.
  3. H. S. Li, M. J. Yang, R. S. Liu, "Mandarin Digital Speech Recognition Adaptive Algorithm". Journal of Circuits and Systems, Vol. 4, No. 2, 1999.
  4. BinLu, Jing-Jing Su," Research on Isolated Word Speech Recognition Based On Biomimetic Pattern Recognition", International Conference on Artificial Intelligence and Computational Intelligence, IEEE Computer Society, pp. 436-439 , 2009
  5. J. Chen, K. K. Paliwal, S. Makamura, "Cepstrum derived from differentiated power spectrum for robust speech recognition," Speech communication, vol. 41, pp. 469- 484, 2003
  6. Mike Schuster and Kuldip K. Paliwal , "Bidirectional Recurrent NeuralNetworks" ,IEEE Transaction on Signal Processing , Vol. 45, No. 11, pp. 2673- 2681, November 1997
  7. K B Khanchandani and Moiz A Hussain , "Emotion Recognition using Multilayer Perceptron and Generalized Feed Forward Neural Network",Journal of Scientific & Industrial Research , Vol. 68, pp. 367-371, May 2009
  8. Eric H . C . Choi " On Compensating The Mel-Frequency Cepstral Coefficients for Noisy Speech Recognition", 29th Australian Computer Science Conference , 2006
  9. Lecture Notes of Summer School on ASR-10 (2010), 5th-9th Sep 2010, Osmania University, Organized by IIIT Hyderabad.
Index Terms

Computer Science
Information Sciences

Keywords

"gpu Nvidia Cuda Ann Classifier training Pattern Recognition. "