International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 186 - Number 69 |
Year of Publication: 2025 |
Authors: Nataliya Boyko, Petro Slobodian |
![]() |
Nataliya Boyko, Petro Slobodian . Application of Machine Learning Methods for Enhancing the Quality of Medical Audio Recordings: Comparative Analysis of Classical and Modern Approaches. International Journal of Computer Applications. 186, 69 ( Mar 2025), 31-43. DOI=10.5120/ijca2025924502
The aim of the study is to solve the problem of noise in audio recordings and improve sound quality using existing machine learning methods; compare different existing methods. In order to test, analyze and compare methods of machine learning based on sound processing problem, it is proposed to use several different approaches. The work will use both classical methods of audio signal processing, such as the wiener filter and spectral subtraction, and more modern ones, namely convolutional neural networks. Each of these methods has its own pros and cons that will be analyzed during experiments, in order to determine in which case which method will be useful. Using these methods will allow for in-depth analysis and comprehensive results for audio processing. Based on the research, it was determined that Spectral subtraction performs slightly better than the Wiener filter. This is evidenced by both the PESQ scores for the two methods and the audiovisual analysis. Among all the selected methods, convolutional neural networks perform the best, and based on the metrics, conclusion was made that the best results for CNN’s can be achieved using L1/L2 regularization and Dropout. Further research may include investigating new CNN architectures for audio de-noising, exploring the possibilities of using other types of neural networks such as Recurrent Neural Networks, Generative Adversarial Networks for audio de-noising.