CFP last date
20 January 2025
Reseach Article

A Review on Improvising Robustness of Speaker Recognition System

Published on February 2015 by Kailashnath J K, Rathnakara. S
Advanced Computing and Communication Techniques for High Performance Applications
Foundation of Computer Science USA
ICACCTHPA2014 - Number 5
February 2015
Authors: Kailashnath J K, Rathnakara. S
bf640251-2623-4a89-879d-d5d73bd90687

Kailashnath J K, Rathnakara. S . A Review on Improvising Robustness of Speaker Recognition System. Advanced Computing and Communication Techniques for High Performance Applications. ICACCTHPA2014, 5 (February 2015), 30-33.

@article{
author = { Kailashnath J K, Rathnakara. S },
title = { A Review on Improvising Robustness of Speaker Recognition System },
journal = { Advanced Computing and Communication Techniques for High Performance Applications },
issue_date = { February 2015 },
volume = { ICACCTHPA2014 },
number = { 5 },
month = { February },
year = { 2015 },
issn = 0975-8887,
pages = { 30-33 },
numpages = 4,
url = { /proceedings/icaccthpa2014/number5/19497-6058/ },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Proceeding Article
%1 Advanced Computing and Communication Techniques for High Performance Applications
%A Kailashnath J K
%A Rathnakara. S
%T A Review on Improvising Robustness of Speaker Recognition System
%J Advanced Computing and Communication Techniques for High Performance Applications
%@ 0975-8887
%V ICACCTHPA2014
%N 5
%P 30-33
%D 2015
%I International Journal of Computer Applications
Abstract

Speaker Recognition is a process by which a machine authenticates the claimed of a person from voice characteristics. A Major application includes biometric identification and security. Speaker recognition consists of the process to convert a speech waveform into features that are useful for further processing. A direct analysis and Synthesizing the complex voice signal is due to too much information contained in the signal .Therefore the digital signal processes such as Feature Extraction and Feature Matching are introduced to represent the voice signal .There are many algorithms and techniques such as Linear Predictive Coding (LPC), Hidden Markov Model (HMM), Artificial Neural Networks (ANN) and etc. Firstly, human voice is converted into digital signal form to produce digital data representing each level of signal at every discrete time step. The digitized speech samples are then processed using MFCC to produce voice features. After that, the coefficient of voice features can go through ANN to select the pattern that matches the database and input frame in order to minimize the resulting error between them .This paper present the speaker recognition system with modification in the Computation Phases of Mel Frequency Cepstral Coefficients (MFCC) during Feature Extraction and Artificial Neural Networks for Feature matching for designing an accurate/Robust Speaker recognition.

References
  1. Lindasalwa Muda, Mumtaj Begaum and I.Elamvazuthi Voice Recognition Algorithms using Mel Frequency Cepstral (MFCC) and Dynamic Time Wrapping(DTW) Technique ,university Teknologi PETRONAS,Tronoh, Perak
  2. Anand Vardhan Bhalla, Shailesh Kharparkar, Mudit Ratna Bhalla , Performance Improvement of Speaker Recognition system,http://www.ijarcsse.com/ docs/ papers/March2012/volume_2_Issue_3/V2I30050..
  3. Bansood, N.S Seema Kawathekar and Dabhade S.B, Review of Different techniques for speaker Recognition System, Dept of CS & IT, Dr Babashaheb Ambedkar Marathwada University, Aurangabad, MH, India, 2012.
  4. Jamal Price, sophomore student, Design an automatic speech recognition system Using Malta, University of Maryland Eastern Shore Princess Anne.
  5. Douglas A. Reynolds, Member, IEEE, and Richard C. Rose, Member, IEEE, “Robust Text- Independent Speaker Identification Using Gaussian Mixture Speaker Models”, TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995
  6. Sujit kumar Behera, Jetendra, Speaker verification using Mel frequency cepstral coefficient and artificial neural ,network NIT ,Rourkela. http://ethesis.nitrkl.ac.in /3745/1/final_yr_project__thesis.pdf
  7. Speaker Recognition System, minhdo, teaching/speaker recognition, DSP mini Project.
  8. Hui Kong, Xuchun Li, Lei Wang, Earn Khwang Teoh, Jian-Gang Wang, Venkateswarlu.R “Generalized 2D principal component analysis”,Proc. 2005 IEEE International Joint on Volume 1, Aug. 2005.
  9. Geoffrey Hinton, Li Deng, Dong Yu, George Dahl, Abdel-Rahman Mohamed et.al. “Deep Neural Networks For Acoustic Modeling In Speech Recognition”, IEEE Signal Processing Magazine, November 2012.
  10. Zaidi Razak,Noor Jamilah Ibrahim, Emran mohd tamil,mohd Yamani Idna Idris, Mohd yaakob Yusoff,Quranic verse recitation feature extraction using Mel frequency costrel coefficient (MFCC),Universiti Malaya.
  11. Eko Riyanto ,Suryono ,Informatics Engineering STMIK HIMSYA, Semarang, Indonesia
  12. Adjoudj Reda ,Boukelif Aoued ,Evolutionary Engineering and Distributed Information System Laboratory, EEDIS, Computer Science Department, University of sidi Bel- Abbes, Algeria
Index Terms

Computer Science
Information Sciences

Keywords

ANN MFCC Speaker recognition system windowing.