Speech Enhancement by Classification of Noisy Signals Decomposed Using NMF and Wiener Filtering

Mahmoud Fakhry, Amir Hossein Poorjam, Mads Græsbøll Christensen

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

10 Citations (Scopus)
318 Downloads (Pure)

Abstract

Supervised non-negative matrix factorization (NMF) is effective in speech enhancement through training spectral models of speech and noise signals. However, the enhancement quality reduces when the models are trained on data that is not highly relevant to a speech signal and a noise signal in a noisy observation. In this paper, we propose to train a classifier in order to overcome such poor characterization of the signals through the trained models. The main idea is to decompose the noisy observation into parts and the enhanced signal is reconstructed by combining the less-corrupted ones which are identified in the cepstral domain using the trained classifier. We apply unsupervised NMF followed by Wiener filtering for the decomposition, and use a support vector machine trained on the mel-frequency cepstral coefficients of the parts of training speech and noise signals for the classification. The results show the effectiveness of the proposed method compared with the supervised NMF.
Original languageEnglish
Title of host publication26th European Signal Processing Conference (EUSIPCO)
Number of pages5
PublisherIEEE
Publication date2018
Article number8553123
ISBN (Electronic)978-9-0827-9701-5
DOIs
Publication statusPublished - 2018
Event26th European Signal Processing Conference (EUSIPCO 2018) - Rome, Italy
Duration: 3 Sept 20187 Sept 2018
Conference number: 26
http://www.eusipco2018.org

Conference

Conference26th European Signal Processing Conference (EUSIPCO 2018)
Number26
Country/TerritoryItaly
CityRome
Period03/09/201807/09/2018
Internet address
SeriesProceedings of the European Signal Processing Conference
ISSN2076-1465

Fingerprint

Dive into the research topics of 'Speech Enhancement by Classification of Noisy Signals Decomposed Using NMF and Wiener Filtering'. Together they form a unique fingerprint.

Cite this