TY - GEN
T1 - Fusion of Classical Digital Signal Processing and Deep Learning Methods (FTCAPPS)
AU - Gomez, Angel
AU - Sánchez, Victoria E.
AU - Peinado, Antonio
AU - Martín-Doñas, Juan M.
AU - Gómez-Alanís, Alejandro
AU - Villegas-Morcillo, Amelia
AU - Rosello, Eros
AU - Chica, Manuel
AU - Garcia, Celia
AU - Espejo, Ivan Lopez
PY - 2022/11
Y1 - 2022/11
N2 - The use of deep learning approaches in Signal Processing is finally showing a trend towards a rational use. After an effervescent period where research activity seemed to focus on seeking old problems to apply solutions entirely based on neural networks, we have reached a more mature stage where integrative approaches are on the rise. These approaches gather the best from each paradigm: on the one hand, the knowledge and elegance of classical signal processing and, on the other, the great ability to model and learn from data which is inherent to deep learning methods. In this project we aim towards a new signal processing paradigm where classical and deep learning techniques not only collaborate, but fuse themselves. In particular, we focus on two objectives: 1) the development of deep learning architectures based on or inspired by signal processing schemes, and 2) the improvement of current deep learning training methods by means of classical techniques and algorithms, particularly, by exploiting the knowledge legacy they treasure. These innovations will be applied to two socially and scientifically relevant topics in which our research group has been working for years. The first one is the enhancement of speech signal acquired under acoustic adverse conditions (e.g., noise, reverberation, other speakers, ...). The second one is the development of anti-fraud measures for biometric voice authentication, in which banking corporations and other large companies are strongly interested.
AB - The use of deep learning approaches in Signal Processing is finally showing a trend towards a rational use. After an effervescent period where research activity seemed to focus on seeking old problems to apply solutions entirely based on neural networks, we have reached a more mature stage where integrative approaches are on the rise. These approaches gather the best from each paradigm: on the one hand, the knowledge and elegance of classical signal processing and, on the other, the great ability to model and learn from data which is inherent to deep learning methods. In this project we aim towards a new signal processing paradigm where classical and deep learning techniques not only collaborate, but fuse themselves. In particular, we focus on two objectives: 1) the development of deep learning architectures based on or inspired by signal processing schemes, and 2) the improvement of current deep learning training methods by means of classical techniques and algorithms, particularly, by exploiting the knowledge legacy they treasure. These innovations will be applied to two socially and scientifically relevant topics in which our research group has been working for years. The first one is the enhancement of speech signal acquired under acoustic adverse conditions (e.g., noise, reverberation, other speakers, ...). The second one is the development of anti-fraud measures for biometric voice authentication, in which banking corporations and other large companies are strongly interested.
KW - Machine Learning
KW - Deep Neural Networks
KW - Speech Enhancement
KW - Multichannel speech processing
KW - Voice Anti-spoofing
U2 - 10.21437/IberSPEECH.2022-48
DO - 10.21437/IberSPEECH.2022-48
M3 - Article in proceeding
SP - 237
EP - 240
BT - Proc. IberSPEECH 2022
T2 - IberSPEECH 2022
Y2 - 14 November 2022 through 16 November 2022
ER -