International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 50 - Number 20 |
Year of Publication: 2012 |
Authors: H. B. Kekre, Vaishali Kulkarni, Prashant Gaikar, Nishant Gupta |
10.5120/7921-1228 |
H. B. Kekre, Vaishali Kulkarni, Prashant Gaikar, Nishant Gupta . Speaker Identification using Spectrograms of Varying Frame Sizes. International Journal of Computer Applications. 50, 20 ( July 2012), 27-33. DOI=10.5120/7921-1228
In this paper, a text dependent speaker recognition algorithm based on spectrogram is proposed. The spectrograms have been generated using Discrete Fourier Transform for varying frame sizes with 25% and 50% overlap between speech frames. Feature vector extraction has been done by using the row mean vector of the spectrograms. For feature matching, two distance measures, namely Euclidean distance and Manhattan distance have been used. The results have been computed using two databases: a locally created database and CSLU speaker recognition database. The maximum accuracy is 92. 52% for an overlap of 50% between speech frames with Manhattan distance as similarity measure.