Abstract
Supervised non-negative matrix factorization (NMF) is effective in speech enhancement through training spectral models of speech and noise signals. However, the enhancement quality reduces when the models are trained on data that is not highly relevant to a speech signal and a noise signal in a noisy observation. In this paper, we propose to train a classifier in order to overcome such poor characterization of the signals through the trained models. The main idea is to decompose the noisy observation into parts and the enhanced signal is reconstructed by combining the less-corrupted ones which are identified in the cepstral domain using the trained classifier. We apply unsupervised NMF followed by Wiener filtering for the decomposition, and use a support vector machine trained on the mel-frequency cepstral coefficients of the parts of training speech and noise signals for the classification. The results show the effectiveness of the proposed method compared with the supervised NMF.
Original language | English |
---|---|
Title of host publication | 26th European Signal Processing Conference (EUSIPCO) |
Number of pages | 5 |
Publisher | IEEE |
Publication date | 2018 |
Article number | 8553123 |
ISBN (Electronic) | 978-9-0827-9701-5 |
DOIs | |
Publication status | Published - 2018 |
Event | 26th European Signal Processing Conference (EUSIPCO 2018) - Rome, Italy Duration: 3 Sept 2018 → 7 Sept 2018 Conference number: 26 http://www.eusipco2018.org |
Conference
Conference | 26th European Signal Processing Conference (EUSIPCO 2018) |
---|---|
Number | 26 |
Country/Territory | Italy |
City | Rome |
Period | 03/09/2018 → 07/09/2018 |
Internet address |
Series | Proceedings of the European Signal Processing Conference |
---|---|
ISSN | 2076-1465 |