CFP last date
20 January 2025
Reseach Article

Style Transfer for Audio using Convolutional Neural Networks

by Bhaumik Choksi, Alisha Sawant, Swati Mali
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 175 - Number 8
Year of Publication: 2017
Authors: Bhaumik Choksi, Alisha Sawant, Swati Mali
10.5120/ijca2017915612

Bhaumik Choksi, Alisha Sawant, Swati Mali . Style Transfer for Audio using Convolutional Neural Networks. International Journal of Computer Applications. 175, 8 ( Oct 2017), 17-20. DOI=10.5120/ijca2017915612

@article{ 10.5120/ijca2017915612,
author = { Bhaumik Choksi, Alisha Sawant, Swati Mali },
title = { Style Transfer for Audio using Convolutional Neural Networks },
journal = { International Journal of Computer Applications },
issue_date = { Oct 2017 },
volume = { 175 },
number = { 8 },
month = { Oct },
year = { 2017 },
issn = { 0975-8887 },
pages = { 17-20 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume175/number8/28508-2017915612/ },
doi = { 10.5120/ijca2017915612 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-07T00:24:31.169210+05:30
%A Bhaumik Choksi
%A Alisha Sawant
%A Swati Mali
%T Style Transfer for Audio using Convolutional Neural Networks
%J International Journal of Computer Applications
%@ 0975-8887
%V 175
%N 8
%P 17-20
%D 2017
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Convolutional neural networks have recently become extremely popular in various deep learning applications. One such application is style transfer for images. Following this trend, this paper explores how this technique can be applied to audio data. The technique discussed involves combining the content features of one audio sample with the style features of another audio sample. The results produced show how a Convolutional Neural Network can be used to extract features from audio signals. The paper also discusses the various modifications made in the algorithm used for image style transfer in order to apply it to audio signals.

References
  1. Gatys, Leon A., Alexander S. Ecker, and Matthias Bethge. "Image style transfer using convolutional neural networks." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2016.
  2. Neumann, László, and Attila Neumann. "Color style transfer techniques using hue, lightness and saturation histogram matching." Computational Aesthetics. 2005.
  3. Ruder, Manuel, Alexey Dosovitskiy, and Thomas Brox. "Artistic style transfer for videos." German Conference on Pattern Recognition. Springer International Publishing, 2016.
  4. Bruckner, Stefan, and M. Eduard Gröller. "Style transfer functions for illustrative volume rendering." Computer Graphics Forum. Vol. 26. No. 3. Blackwell Publishing Ltd, 2007.
  5. Johnson, Justin, Alexandre Alahi, and Li Fei-Fei. "Perceptual losses for real-time style transfer and super-resolution." European Conference on Computer Vision. Springer International Publishing, 2016.
  6. Selim, Ahmed, Mohamed Elgharib, and Linda Doyle. "Painting style transfer for head portraits using convolutional neural networks." ACM Transactions on Graphics (ToG) 35.4 (2016): 129.
  7. Krizhevsky, Alex, Ilya Sutskever, and Geoffrey E. Hinton. "Imagenet classification with deep convolutional neural networks." Advances in neural information processing systems. 2012.
  8. Drineas, Petros, and Michael W. Mahoney. "On the Nyström method for approximating a Gram matrix for improved kernel-based learning." journal of machine learning research 6.Dec (2005): 2153-2175.
  9. Li, Yanghao, et al. "Demystifying neural style transfer." arXiv preprint arXiv:1701.01036 (2017).
  10. Griffin, Daniel, and Jae Lim. "Signal estimation from modified short-time Fourier transform." IEEE Transactions on Acoustics, Speech, and Signal Processing 32.2 (1984): 236-243.
Index Terms

Computer Science
Information Sciences

Keywords

Convolutional Neural Network Deep learning Style Transfer Gram Matrix.