CFP last date
20 January 2025
Reseach Article

A Novel Feature Extraction Technique for Speaker Identification

by Amita Dev
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 16 - Number 6
Year of Publication: 2011
Authors: Amita Dev
10.5120/2016-2720

Amita Dev . A Novel Feature Extraction Technique for Speaker Identification. International Journal of Computer Applications. 16, 6 ( February 2011), 25-28. DOI=10.5120/2016-2720

@article{ 10.5120/2016-2720,
author = { Amita Dev },
title = { A Novel Feature Extraction Technique for Speaker Identification },
journal = { International Journal of Computer Applications },
issue_date = { February 2011 },
volume = { 16 },
number = { 6 },
month = { February },
year = { 2011 },
issn = { 0975-8887 },
pages = { 25-28 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume16/number6/2016-2720/ },
doi = { 10.5120/2016-2720 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T20:04:10.678516+05:30
%A Amita Dev
%T A Novel Feature Extraction Technique for Speaker Identification
%J International Journal of Computer Applications
%@ 0975-8887
%V 16
%N 6
%P 25-28
%D 2011
%I Foundation of Computer Science (FCS), NY, USA
Abstract

This paper presents a novel feature extraction approach for speaker identification when the speech is corrupted by additive noise. The environmental mismatch between training and testing data degrades the performance of speaker identification system. The performance degradation is primarily due to presence of background noise when try to match a given speaker to the set of known speakers in a database. Mel frequency cepstral coefficients (MFCCs) are perhaps the most widely used front ends in the state of the art speaker identification systems. One of the major issues with MFCCs is that they are very sensitive to additive noise. To overcome this bottleneck, a temporal filtering procedure on the autocorrelation sequence is proposed to minimize the effect of additive noise. The proposed feature is called Relative Autocorrelation Mel Frequency Cepstral Coefficients (A-MFCC) which is derived based on filtering the temporal trajectories of short time one sided autocorrelation sequence. This filtering process minimizes the effect of additive noise. No prior knowledge of noise characteristics is required. The additive noise can be a colored noise. For speaker identification, Hindi database was constructed from the speech samples of each known speaker. Feature vectors (MFCCs and A-MFCCs) were extracted from the samples by short-term spectral analysis, and processed further by vector quantization for locating the clusters in the feature space. Experimental results indicated that A-MFCCs significantly improved the performance of speaker identification system in noisy environment.

References
  1. Y. GONG, Speech recognition in noisy environments: A survey. Speech Communication, Vol. 16 (1995), pp. 261–291.
  2. S. F. BOLL, Suppression of acoustic noise in speech using spectral subtraction. IEEE Transactions on Acoustic Speech and Signal Processing, 27 (2), (1979), pp. 113–120.
  3. HERMANSKY AND MORGAN, RASTA processing of speech. Speech Communication, Vol. 41, (2003), pp. 469–484.
  4. J. HERNANDO AND C. NADEU, Linear prediction of the one-sided autocorrelation sequence for noisy speech recognition. IEEE Trans. Speech Audio Processing, Vol. 2, No. 5, (1994) pp. 578-586.
  5. A. VARGA AND H. J. M. STEENEKEN, Assessment for automatic speech recognition: II. NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition systems. Speech Communication, Vol. 12, (1993), pp. 247–251.
Index Terms

Computer Science
Information Sciences

Keywords

Speaker identification vector quantization relative autocorrelation