Abstract
In this work, we propose a neural network approach for speech reconstruction from mel spectrograms, a crucial task in achieving high-quality data after processing speech signals in the time-frequency domain. Specifically, we propose a two-stage deep learning approach based on an overcomplete deep autoencoder (DAE) for the mel filter bank inversion coupled with the deep version of the Griffin-Lim (DeGLI) algorithm for the phase information recovery. After the pre-training of both parts of the architecture, a final fine-tuning on the whole system is performed. Some numerical results, evaluated on the well-known TIMIT dataset, demonstrate the effectiveness of the proposed idea by obtaining a PESQ of 3.996, a STOI equal to 0.994, and a mean opinion score evaluated as 4.15.
Originalsprog | Engelsk |
---|---|
Titel | Advanced Neural Artificial Intelligence : Theories and Applications |
Redaktører | Anna Esposito, Marcos Faundez-Zanuy, Francesco C. Morabito, Eros Pasero, Gennaro Cordasco |
Antal sider | 12 |
Forlag | Springer |
Publikationsdato | maj 2025 |
Sider | 267-278 |
ISBN (Trykt) | 978-981-96-0993-2, 978-981-96-0996-3 |
ISBN (Elektronisk) | 978-981-96-0994-9 |
DOI | |
Status | Udgivet - maj 2025 |
Udgivet eksternt | Ja |
Begivenhed | 30th International Workshops on Neural Network, WIRN 2023 - Vietri sul Mare, Italien Varighed: 7 jun. 2023 → 9 jun. 2023 |
Konference
Konference | 30th International Workshops on Neural Network, WIRN 2023 |
---|---|
Land/Område | Italien |
By | Vietri sul Mare |
Periode | 07/06/2023 → 09/06/2023 |
Sponsor | International Institute for Advanced Scientific Studies (IIASS)# Department of Psychology, Università della Campania “Luigi Vanvitelli”, IT# Provincia di Salerno# Comune di Vietri sul Mare# International Neural Network Society (INNS)# Università Mediterranea di Reggio Calabria# Società Italiana Reti Neuroniche (SIREN)# |
Navn | Smart Innovation, Systems and Technologies |
---|---|
Vol/bind | 428 |
ISSN | 2190-3018 |
Bibliografisk note
Publisher Copyright:© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.