We apologize for a recent technical issue with our email system, which temporarily affected account activations. Accounts have now been activated. Authors may proceed with paper submissions. PhDFocusTM
CFP last date
20 December 2024
Reseach Article

Incorporating Auditory Masking Properties for Speech Enhancement in presence of Near-end Noise

by Premananda B. S., Uma B. V.
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 106 - Number 15
Year of Publication: 2014
Authors: Premananda B. S., Uma B. V.
10.5120/18593-9835

Premananda B. S., Uma B. V. . Incorporating Auditory Masking Properties for Speech Enhancement in presence of Near-end Noise. International Journal of Computer Applications. 106, 15 ( November 2014), 1-6. DOI=10.5120/18593-9835

@article{ 10.5120/18593-9835,
author = { Premananda B. S., Uma B. V. },
title = { Incorporating Auditory Masking Properties for Speech Enhancement in presence of Near-end Noise },
journal = { International Journal of Computer Applications },
issue_date = { November 2014 },
volume = { 106 },
number = { 15 },
month = { November },
year = { 2014 },
issn = { 0975-8887 },
pages = { 1-6 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume106/number15/18593-9835/ },
doi = { 10.5120/18593-9835 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T22:39:26.695186+05:30
%A Premananda B. S.
%A Uma B. V.
%T Incorporating Auditory Masking Properties for Speech Enhancement in presence of Near-end Noise
%J International Journal of Computer Applications
%@ 0975-8887
%V 106
%N 15
%P 1-6
%D 2014
%I Foundation of Computer Science (FCS), NY, USA
Abstract

In mobile devices, perceived speech signal degrades significantly in the presence of background noise as it reaches directly at the listener's ears. There is a need to improve the intelligibility and quality of the received speech signal in noisy environments by incorporating speech enhancement algorithms. This paper focuses on speech enhancement method including auditory masking properties of the human ear to improve the intelligibility and quality of the speech signal in the presence of near-end noise. Implemented by dynamically enhancing the speech signal when the near-end noise dominates. Intelligibility and quality of enhanced speech signal are measured using SII and PESQ. Experimental results show improvement in the intelligibility and quality of the enhanced speech signal with the proposed approach over the unprocessed speech signal. This particular approach is far more efficient in overcoming the degradation of speech signals in noisy environments.

References
  1. Premananda B. S. , and Uma B. V. , "Speech Enhancement Algorithm to Reduce the Effect of Background Noise in Mobile Phones", International Journal of Wireless and Mobile Networks (IJWMN), Vol. 5, No. 1, pp. 177 - 189, Feb. 2013.
  2. Premananda B. S. , and Uma B. V. , "Low Complexity Speech Enhancement Algorithm for Improved Perception in Mobile Devices", International Workshop on Wireless and Mobile Networks, WiMoNe-2012, Lecture Notes in Electrical Engineering, Vol. 131, Springer, pp. 699 - 707, Feb. 2013.
  3. Premananda B. S. , Manoj and Uma B. V. , "Near-End Perception Enhancement using Dominant Frequency Extraction", International Journal of Advanced engineering and Research Development (IJAERD), Vol. 1, No. 6, pp. 351-358, June 2014.
  4. Bastian Sauert and Peter Vary, "Near End Listening Enhancement Considering Thermal Limit of Mobile Phone Loudspeakers," Proceedings of Elektronische Sprach Signal Verarbeitun (ESSV), Vol. 61, Germany, pp. 333–340, Sept. 2011.
  5. Bastian Sauert and Peter Vary, "Near End Listening Enhancement: Speech Intelligibility Improvement in Noisy Environments", Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, France, pp. 493 - 496, 2006.
  6. Bastian Sauert and Peter Vary, "Near-End Listening Enhancement Optimized with respect to Speech Intelligibility Index and Audio Power Limitations", Proceedings of European Signal Processing Conference, Aalborg, Denmark, pp. 1919 - 1923, August 2010.
  7. Taal C. H. , Jensen J. , and Leijon A. , "On Optimal Linear Filtering of Speech for Near-End Listening Enhancement", IEEE Signal Processing Letters, Vol. 20, No. 3, pp. 225 - 228, March 2013.
  8. Taal C. H. , and Jesper Jensen, "SII-based Speech Pre-processing for Intelligibility Improvement in Noise", Proceeding of Interspeech, Lyon, France, pp. 3582 - 3586, August 2013.
  9. Jeon Yu-young and Lee Sang-min, "A Speech Enhancement Algorithm to Reduce Noise and Compensate for Partial Masking Effect", Journal of Central South University of Technology, Vol. 18, issue 4, pp. 1121 - 1127, August 2011.
  10. Gunawan T. S. , and Ambikairajah E. , "Speech Enhancement using Temporal Masking and Fractional Bark Gammatone Filters", Proceedings of the 10th Australian International Conference on Speech Science & Technology, Sydney, pp. 420 - 425, 2004.
  11. Gunawan T. S. , Khalifa O. O. , and Ambikairajah E. , "Forward Masking Threshold Estimation using Neural Networks and its Application to Parallel Speech Enhancement", in International Conference on Computer and Communication Engineering, Vol. 11, No 1, pp. 15 - 26, 2010.
  12. Yi Hu and Philipos C. Loizou, "Incorporating a Psychoacoustic Model in Frequency Domain Speech Enhancement", IEEE signal processing letters, Vol. 11, No. 2, pp. 270 - 273, February 2004.
  13. Eberhard Zwicker and Hugo Fastl, Psychoacoustics, Facts and Models. New York: Springer, 2007.
  14. American National Standard. Methods for the Calculation of the Speech Intelligibility Index. ANSI S3. 5-1997, 1997.
  15. "PESQ: An Introduction", Psytechnics Limited, White paper, September 2001.
  16. Rix A. W. , Beerends J. G. , Hollier M. P. , and Hekstra A. P. , "Perceptual Evaluation of Speech Quality (PESQ)-A New Method for Speech Quality Assessment of Telephone Networks and Codecs", IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol. 2, pp. 749 - 752, May 2001.
  17. ITU-T P. 862, Perceptual evaluation of speech quality, and objective method for end-to-end speech quality assessment of narrowband telephone networks and speech codecs, ITU-T Recommendation P. 862, 2000.
  18. Ephraim Y. , and Malah D. , "Speech Enhancement using a Minimum Mean Square Error Short-Time Spectral Amplitude Estimator", IEEE Transaction on Acoustic, Speech, and Signal Processing, Vol. ASSP-32, pp. 1109 - 1121, December 1984.
  19. Malihe Hassani and Karami Mollaei M. R. , "Speech Enhancement based on Spectral Subtraction in Wavelet Domain", Seventh IEEE International Colloquium on Signal Processing and its Applications, pp. 366 - 370, March 2011.
Index Terms

Computer Science
Information Sciences

Keywords

Gain Masking Near-end noise Speech enhancement Speech intelligibility Speech quality