2nd National Conference on Innovative Paradigms in Engineering and Technology (NCIPET 2013) |
Foundation of Computer Science USA |
NCIPET - Number 1 |
March 2012 |
Authors: N. P. Jawarkar, R. S. Holambe, T. K. Basu |
d1b0a798-24f2-449c-b573-d891e5f910a6 |
N. P. Jawarkar, R. S. Holambe, T. K. Basu . Audio Segmentation using Line Spectral Pairs. 2nd National Conference on Innovative Paradigms in Engineering and Technology (NCIPET 2013). NCIPET, 1 (March 2012), 1-5.
This paper describes a technique for unsupervised audio segmentation. Main objective of the work presented in this paper is to study the performance of audio segmentation system using metric-based method. The system first classifies the audio signal into speech and nonspeech signal using variance of zero crossing rate. The feature Line spectral pair is used for automatically detecting the speaker change point. Hotelling T2 distance metric is used in the first stage for coarse speaker change detection. The Bayesian information criterion (BIC) is used in the second stage to validate the potential speaker change point detected by the coarse segmentation procedure to reduce the false alarm rate. Database of four files containing the speech recorded from different combinations of male and female speakers mixed with nonspeech signal such as music and environmental sound are used for segmentation. The database-file with one male and one female gives the best performance with F1 measure of 0.9474.