Fusion of Classical Digital Signal Processing and Deep Learning Methods (FTCAPPS)

Angel Gomez; Victoria E. Sánchez; Antonio Peinado; Juan M. Martín-Doñas; Alejandro Gómez-Alanís; Amelia Villegas-Morcillo; Eros Rosello; Manuel Chica; Celia Garcia; Ivan Lopez Espejo

doi:10.21437/IberSPEECH.2022-48

Fusion of Classical Digital Signal Processing and Deep Learning Methods (FTCAPPS)

Angel Gomez, Victoria E. Sánchez, Antonio Peinado, Juan M. Martín-Doñas, Alejandro Gómez-Alanís, Amelia Villegas-Morcillo, Eros Rosello, Manuel Chica, Celia Garcia, Ivan Lopez Espejo

Publikation: Bidrag til bog/antologi/rapport/konference proceeding › Konferenceartikel i proceeding › Forskning › peer review

35 Downloads (Pure)

Abstract

The use of deep learning approaches in Signal Processing is finally showing a trend towards a rational use. After an effervescent period where research activity seemed to focus on seeking old problems to apply solutions entirely based on neural networks, we have reached a more mature stage where integrative approaches are on the rise. These approaches gather the best from each paradigm: on the one hand, the knowledge and elegance of classical signal processing and, on the other, the great ability to model and learn from data which is inherent to deep learning methods. In this project we aim towards a new signal processing paradigm where classical and deep learning techniques not only collaborate, but fuse themselves. In particular, we focus on two objectives: 1) the development of deep learning architectures based on or inspired by signal processing schemes, and 2) the improvement of current deep learning training methods by means of classical techniques and algorithms, particularly, by exploiting the knowledge legacy they treasure. These innovations will be applied to two socially and scientifically relevant topics in which our research group has been working for years. The first one is the enhancement of speech signal acquired under acoustic adverse conditions (e.g., noise, reverberation, other speakers, ...). The second one is the development of anti-fraud measures for biometric voice authentication, in which banking corporations and other large companies are strongly interested.

Originalsprog	Engelsk
Titel	Proc. IberSPEECH 2022
Publikationsdato	nov. 2022
Sider	237-240
DOI	https://doi.org/10.21437/IberSPEECH.2022-48
Status	Udgivet - nov. 2022
Begivenhed	IberSPEECH 2022 - Granada, Spanien Varighed: 14 nov. 2022 → 16 nov. 2022

Konference

Konference	IberSPEECH 2022
Land/Område	Spanien
By	Granada
Periode	14/11/2022 → 16/11/2022

Adgang til dokumentet

10.21437/IberSPEECH.2022-48

gomez22_iberspeechForlagets udgivne version, 1,26 MB

AUB Link

Søg efter materialet i Aalborg Universitetsbiblioteks søgemaskine

Citationsformater

@inproceedings{43460efdb59049268989f90fba857b4e,

title = "Fusion of Classical Digital Signal Processing and Deep Learning Methods (FTCAPPS)",

abstract = "The use of deep learning approaches in Signal Processing is finally showing a trend towards a rational use. After an effervescent period where research activity seemed to focus on seeking old problems to apply solutions entirely based on neural networks, we have reached a more mature stage where integrative approaches are on the rise. These approaches gather the best from each paradigm: on the one hand, the knowledge and elegance of classical signal processing and, on the other, the great ability to model and learn from data which is inherent to deep learning methods. In this project we aim towards a new signal processing paradigm where classical and deep learning techniques not only collaborate, but fuse themselves. In particular, we focus on two objectives: 1) the development of deep learning architectures based on or inspired by signal processing schemes, and 2) the improvement of current deep learning training methods by means of classical techniques and algorithms, particularly, by exploiting the knowledge legacy they treasure. These innovations will be applied to two socially and scientifically relevant topics in which our research group has been working for years. The first one is the enhancement of speech signal acquired under acoustic adverse conditions (e.g., noise, reverberation, other speakers, ...). The second one is the development of anti-fraud measures for biometric voice authentication, in which banking corporations and other large companies are strongly interested.",

keywords = "Machine Learning, Deep Neural Networks, Speech Enhancement, Multichannel speech processing, Voice Anti-spoofing",

author = "Angel Gomez and S{\'a}nchez, {Victoria E.} and Antonio Peinado and Mart{\'i}n-Do{\~n}as, {Juan M.} and Alejandro G{\'o}mez-Alan{\'i}s and Amelia Villegas-Morcillo and Eros Rosello and Manuel Chica and Celia Garcia and Espejo, {Ivan Lopez}",

year = "2022",

month = nov,

doi = "10.21437/IberSPEECH.2022-48",

language = "English",

pages = "237--240",

booktitle = "Proc. IberSPEECH 2022",

note = "IberSPEECH 2022 ; Conference date: 14-11-2022 Through 16-11-2022",

}

TY - GEN

T1 - Fusion of Classical Digital Signal Processing and Deep Learning Methods (FTCAPPS)

AU - Gomez, Angel

AU - Sánchez, Victoria E.

AU - Peinado, Antonio

AU - Martín-Doñas, Juan M.

AU - Gómez-Alanís, Alejandro

AU - Villegas-Morcillo, Amelia

AU - Rosello, Eros

AU - Chica, Manuel

AU - Garcia, Celia

AU - Espejo, Ivan Lopez

PY - 2022/11

Y1 - 2022/11

N2 - The use of deep learning approaches in Signal Processing is finally showing a trend towards a rational use. After an effervescent period where research activity seemed to focus on seeking old problems to apply solutions entirely based on neural networks, we have reached a more mature stage where integrative approaches are on the rise. These approaches gather the best from each paradigm: on the one hand, the knowledge and elegance of classical signal processing and, on the other, the great ability to model and learn from data which is inherent to deep learning methods. In this project we aim towards a new signal processing paradigm where classical and deep learning techniques not only collaborate, but fuse themselves. In particular, we focus on two objectives: 1) the development of deep learning architectures based on or inspired by signal processing schemes, and 2) the improvement of current deep learning training methods by means of classical techniques and algorithms, particularly, by exploiting the knowledge legacy they treasure. These innovations will be applied to two socially and scientifically relevant topics in which our research group has been working for years. The first one is the enhancement of speech signal acquired under acoustic adverse conditions (e.g., noise, reverberation, other speakers, ...). The second one is the development of anti-fraud measures for biometric voice authentication, in which banking corporations and other large companies are strongly interested.

AB - The use of deep learning approaches in Signal Processing is finally showing a trend towards a rational use. After an effervescent period where research activity seemed to focus on seeking old problems to apply solutions entirely based on neural networks, we have reached a more mature stage where integrative approaches are on the rise. These approaches gather the best from each paradigm: on the one hand, the knowledge and elegance of classical signal processing and, on the other, the great ability to model and learn from data which is inherent to deep learning methods. In this project we aim towards a new signal processing paradigm where classical and deep learning techniques not only collaborate, but fuse themselves. In particular, we focus on two objectives: 1) the development of deep learning architectures based on or inspired by signal processing schemes, and 2) the improvement of current deep learning training methods by means of classical techniques and algorithms, particularly, by exploiting the knowledge legacy they treasure. These innovations will be applied to two socially and scientifically relevant topics in which our research group has been working for years. The first one is the enhancement of speech signal acquired under acoustic adverse conditions (e.g., noise, reverberation, other speakers, ...). The second one is the development of anti-fraud measures for biometric voice authentication, in which banking corporations and other large companies are strongly interested.

KW - Machine Learning

KW - Deep Neural Networks

KW - Speech Enhancement

KW - Multichannel speech processing

KW - Voice Anti-spoofing

U2 - 10.21437/IberSPEECH.2022-48

DO - 10.21437/IberSPEECH.2022-48

M3 - Article in proceeding

SP - 237

EP - 240

BT - Proc. IberSPEECH 2022

T2 - IberSPEECH 2022

Y2 - 14 November 2022 through 16 November 2022

ER -

Fusion of Classical Digital Signal Processing and Deep Learning Methods (FTCAPPS)

Abstract

Konference

Adgang til dokumentet

AUB Link

Fingeraftryk

Citationsformater