A Review on Improvising Robustness of Speaker Recognition System

Call for Paper

May Edition

IJCA solicits high quality original research papers for the upcoming May edition of the journal. The last date of research paper submission is 20 April 2026

Submit your paper

Know more

The week's pick

Evaluating Text-to-Text Generation from LLMs: A Case Study and Scalable Framework

Ziqiao Ao Juhi Singh Sebastian Antinome

Random Articles

Reseach Article

A Review on Improvising Robustness of Speaker Recognition System

Published on February 2015 by Kailashnath J K, Rathnakara. S

Advanced Computing and Communication Techniques for High Performance Applications

Foundation of Computer Science USA

ICACCTHPA2014 - Number 5

February 2015

Authors: Kailashnath J K, Rathnakara. S

Kailashnath J K, Rathnakara. S . A Review on Improvising Robustness of Speaker Recognition System. Advanced Computing and Communication Techniques for High Performance Applications. ICACCTHPA2014, 5 (February 2015), 30-33.

@article{

author = { Kailashnath J K, Rathnakara. S },

title = { A Review on Improvising Robustness of Speaker Recognition System },

journal = { Advanced Computing and Communication Techniques for High Performance Applications },

issue_date = { February 2015 },

volume = { ICACCTHPA2014 },

number = { 5 },

month = { February },

year = { 2015 },

issn = 0975-8887,

pages = { 30-33 },

numpages = 4,

url = { /proceedings/icaccthpa2014/number5/19497-6058/ },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Proceeding Article

%1 Advanced Computing and Communication Techniques for High Performance Applications

%A Kailashnath J K

%A Rathnakara. S

%T A Review on Improvising Robustness of Speaker Recognition System

%J Advanced Computing and Communication Techniques for High Performance Applications

%@ 0975-8887

%V ICACCTHPA2014

%N 5

%P 30-33

%D 2015

%I International Journal of Computer Applications

Abstract

Speaker Recognition is a process by which a machine authenticates the claimed of a person from voice characteristics. A Major application includes biometric identification and security. Speaker recognition consists of the process to convert a speech waveform into features that are useful for further processing. A direct analysis and Synthesizing the complex voice signal is due to too much information contained in the signal .Therefore the digital signal processes such as Feature Extraction and Feature Matching are introduced to represent the voice signal .There are many algorithms and techniques such as Linear Predictive Coding (LPC), Hidden Markov Model (HMM), Artificial Neural Networks (ANN) and etc. Firstly, human voice is converted into digital signal form to produce digital data representing each level of signal at every discrete time step. The digitized speech samples are then processed using MFCC to produce voice features. After that, the coefficient of voice features can go through ANN to select the pattern that matches the database and input frame in order to minimize the resulting error between them .This paper present the speaker recognition system with modification in the Computation Phases of Mel Frequency Cepstral Coefficients (MFCC) during Feature Extraction and Artificial Neural Networks for Feature matching for designing an accurate/Robust Speaker recognition.

References

Lindasalwa Muda, Mumtaj Begaum and I.Elamvazuthi Voice Recognition Algorithms using Mel Frequency Cepstral (MFCC) and Dynamic Time Wrapping(DTW) Technique ,university Teknologi PETRONAS,Tronoh, Perak
Anand Vardhan Bhalla, Shailesh Kharparkar, Mudit Ratna Bhalla , Performance Improvement of Speaker Recognition system,http://www.ijarcsse.com/ docs/ papers/March2012/volume_2_Issue_3/V2I30050..
Bansood, N.S Seema Kawathekar and Dabhade S.B, Review of Different techniques for speaker Recognition System, Dept of CS & IT, Dr Babashaheb Ambedkar Marathwada University, Aurangabad, MH, India, 2012.
Jamal Price, sophomore student, Design an automatic speech recognition system Using Malta, University of Maryland Eastern Shore Princess Anne.
Douglas A. Reynolds, Member, IEEE, and Richard C. Rose, Member, IEEE, “Robust Text- Independent Speaker Identification Using Gaussian Mixture Speaker Models”, TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995
Sujit kumar Behera, Jetendra, Speaker verification using Mel frequency cepstral coefficient and artificial neural ,network NIT ,Rourkela. http://ethesis.nitrkl.ac.in /3745/1/final_yr_project__thesis.pdf
Speaker Recognition System, minhdo, teaching/speaker recognition, DSP mini Project.
Hui Kong, Xuchun Li, Lei Wang, Earn Khwang Teoh, Jian-Gang Wang, Venkateswarlu.R “Generalized 2D principal component analysis”,Proc. 2005 IEEE International Joint on Volume 1, Aug. 2005.
Geoffrey Hinton, Li Deng, Dong Yu, George Dahl, Abdel-Rahman Mohamed et.al. “Deep Neural Networks For Acoustic Modeling In Speech Recognition”, IEEE Signal Processing Magazine, November 2012.
Zaidi Razak,Noor Jamilah Ibrahim, Emran mohd tamil,mohd Yamani Idna Idris, Mohd yaakob Yusoff,Quranic verse recitation feature extraction using Mel frequency costrel coefficient (MFCC),Universiti Malaya.
Eko Riyanto ,Suryono ,Informatics Engineering STMIK HIMSYA, Semarang, Indonesia
Adjoudj Reda ,Boukelif Aoued ,Evolutionary Engineering and Distributed Information System Laboratory, EEDIS, Computer Science Department, University of sidi Bel- Abbes, Algeria

Index Terms

Computer Science

Information Sciences

Keywords

ANN MFCC Speaker recognition system windowing.