A Novel Feature Extraction Technique for Speaker Identification

Amita Dev

Call for Paper

April Edition

IJCA solicits high quality original research papers for the upcoming April edition of the journal. The last date of research paper submission is 20 March 2026

Submit your paper

Know more

The week's pick

Explainable Hybrid Deep Learning for Automated Diagnosis of Canine Mammary Tumors

Elham Shawky Salama Heba Askr Ashraf Darwish Aboul Ella Hassanien

Random Articles

Reseach Article

A Novel Feature Extraction Technique for Speaker Identification

by Amita Dev

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 16 - Number 6

Year of Publication: 2011

Authors: Amita Dev

10.5120/2016-2720

Amita Dev . A Novel Feature Extraction Technique for Speaker Identification. International Journal of Computer Applications. 16, 6 ( February 2011), 25-28. DOI=10.5120/2016-2720

@article{ 10.5120/2016-2720,

author = { Amita Dev },

title = { A Novel Feature Extraction Technique for Speaker Identification },

journal = { International Journal of Computer Applications },

issue_date = { February 2011 },

volume = { 16 },

number = { 6 },

month = { February },

year = { 2011 },

issn = { 0975-8887 },

pages = { 25-28 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume16/number6/2016-2720/ },

doi = { 10.5120/2016-2720 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T20:04:10.678516+05:30

%A Amita Dev

%T A Novel Feature Extraction Technique for Speaker Identification

%J International Journal of Computer Applications

%@ 0975-8887

%V 16

%N 6

%P 25-28

%D 2011

%I Foundation of Computer Science (FCS), NY, USA

Abstract

This paper presents a novel feature extraction approach for speaker identification when the speech is corrupted by additive noise. The environmental mismatch between training and testing data degrades the performance of speaker identification system. The performance degradation is primarily due to presence of background noise when try to match a given speaker to the set of known speakers in a database. Mel frequency cepstral coefficients (MFCCs) are perhaps the most widely used front ends in the state of the art speaker identification systems. One of the major issues with MFCCs is that they are very sensitive to additive noise. To overcome this bottleneck, a temporal filtering procedure on the autocorrelation sequence is proposed to minimize the effect of additive noise. The proposed feature is called Relative Autocorrelation Mel Frequency Cepstral Coefficients (A-MFCC) which is derived based on filtering the temporal trajectories of short time one sided autocorrelation sequence. This filtering process minimizes the effect of additive noise. No prior knowledge of noise characteristics is required. The additive noise can be a colored noise. For speaker identification, Hindi database was constructed from the speech samples of each known speaker. Feature vectors (MFCCs and A-MFCCs) were extracted from the samples by short-term spectral analysis, and processed further by vector quantization for locating the clusters in the feature space. Experimental results indicated that A-MFCCs significantly improved the performance of speaker identification system in noisy environment.

References

Y. GONG, Speech recognition in noisy environments: A survey. Speech Communication, Vol. 16 (1995), pp. 261–291.
S. F. BOLL, Suppression of acoustic noise in speech using spectral subtraction. IEEE Transactions on Acoustic Speech and Signal Processing, 27 (2), (1979), pp. 113–120.
HERMANSKY AND MORGAN, RASTA processing of speech. Speech Communication, Vol. 41, (2003), pp. 469–484.
J. HERNANDO AND C. NADEU, Linear prediction of the one-sided autocorrelation sequence for noisy speech recognition. IEEE Trans. Speech Audio Processing, Vol. 2, No. 5, (1994) pp. 578-586.
A. VARGA AND H. J. M. STEENEKEN, Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems. Speech Communication, Vol. 12, (1993), pp. 247–251.

Index Terms

Computer Science

Information Sciences

Keywords

Speaker identification vector quantization relative autocorrelation