Most music production nowadays is carried out using software tools: for this reason, the market demands faithful audio effect simulations. Traditional methods for modeling nonlinear systems are effect-specific or labor-intensive; however, recent works yielded promising results by black-box simulation of these effects using neural networks. This work aims to explore two models of distortion effects based on autoencoders: one makes use of fully-connected layers only, and the other employs convolutional layers. Both models were trained using clean sounds as input and distorted sounds as target, thus, the learning method was not self-supervised, as it is mostly the case when dealing with autoencoders. The networks were then tested with visual inspection of the output spectrograms, as well as with an informal listening test, and performed well in reconstructing the distorted signal spectra, however a fair amount of noise was also introduced.
|Title of host publication||Intelligent Technologies for Interactive Entertainment : 12th EAI International Conference, INTETAIN 2020|
|Publication status||Published - 2021|
|Event||EAI Intetain 2020 – 12th EAI International Conference on Intelligent Technologies for Interactive Entertainment - Online, Santa Clara, United States|
Duration: 12 Dec 2020 → 14 Dec 2020
|Conference||EAI Intetain 2020 – 12th EAI International Conference on Intelligent Technologies for Interactive Entertainment|
|Period||12/12/2020 → 14/12/2020|
|Series||Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering|