International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 153 - Number 9 |
Year of Publication: 2016 |
Authors: A. Adewusi, K. A. Amusa, A. R. Zubair |
10.5120/ijca2016912112 |
A. Adewusi, K. A. Amusa, A. R. Zubair . Itakura-Saito Divergence Non Negative Matrix Factorization with Application to Monaural Speech Separation. International Journal of Computer Applications. 153, 9 ( Nov 2016), 17-22. DOI=10.5120/ijca2016912112
Monaural source separation is an interesting area that has received much attention in the signal processing community as it is a pre-processing step in many applications. However, many solutions have been developed to achieve clean separation based on Non-Negative Matrix Factorization (NMF). In this work, we proposed a variant of Itakura-Saito Divergence NMF based on source filter model that captures the temporal continuity of speech signal. The algorithm shows a very good separation results for mixture of two speech sources in terms of artifacts reduction. Besides that, Source to distortion ratio (SDR) and Source to Artifact Ratio (SAR) were found to be higher when compared with NMF algorithms with Kullback-Leibler and Euclidean divergences.