A novel NMF-HMM speech enhancement algorithm based on Poisson mixture model

Yang Xiang; Liming Shi; Jesper  Lisby Højvang; Morten  Højfeldt Rasmussen; Mads Græsbøll Christensen

doi:10.1109/ICASSP39728.2021.9414620

A novel NMF-HMM speech enhancement algorithm based on Poisson mixture model

Yang Xiang, Liming Shi, Jesper Lisby Højvang, Morten Højfeldt Rasmussen, Mads Græsbøll Christensen

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

6 Citations (Scopus)

58 Downloads (Pure)

Abstract

In this paper, we propose a novel non-negative matrix factorization (NMF) and hidden Markov model (NMF-HMM) based speech enhancement algorithm, which employs a Poisson mixture model (PMM). {Compared to} the previously proposed NMF-HMM method, the new algorithm, termed PMM-NMF-HMM, {uses} the Poisson mixture distribution for the state conditional likelihood function for a HMM rather than the single Poisson distribution. {This means that there are the more basis matrices that can be used to model the speech and noise signals, so more signal information can be captured by the resulting model. The proposed method is supervised and thus includes a training and an enhancement stage. It is shown that, in the training stage, the proposed method can be implemented efficiently using multiplicative update (MU) for the model parameters, much like the NMF-HMM algorithm. In the speech enhancement stage, which can be performed online, a novel PMM-NMF-HMM minimum mean-square error (MMSE) estimator is developed. The experimental results indicate that the PMM-NMF-HMM method can obtain higher short-time objective intelligibility (STOI) and perceptual evaluation of speech quality (PESQ) score than NMF-HMM. Additionally, the {method also outperforms other state-of-the-art NMF-based supervised speech enhancement algorithms.

Original language	English
Title of host publication	ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Number of pages	5
Publisher	IEEE
Publication date	11 Jun 2021
Pages	721-725
Article number	9414620
ISBN (Print)	978-1-7281-7606-2
ISBN (Electronic)	978-1-7281-7605-5
DOIs	https://doi.org/10.1109/ICASSP39728.2021.9414620
Publication status	Published - 11 Jun 2021
Event	ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) - Toronto, Canada Duration: 6 Jun 2021 → 11 Jun 2021

Conference

Conference	ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Country/Territory	Canada
City	Toronto
Period	06/06/2021 → 11/06/2021

Series	I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings
ISSN	1520-6149

Keywords

Hidden Markov model (HMM)
Minimum mean-square error (MMSE)
Non-negative matrix factorization (NMF)
Poisson mixture model (PMM)
Speech enhancement

Access to Document

10.1109/ICASSP39728.2021.9414620

Accepted ManuscriptAccepted author manuscript, 276 KBLicence: CC BY-NC-ND 4.0

AUB Link

Search for the material in Aalborg University Library's search engine

Cite this

@inproceedings{8e55175bf5b74102af48326b75330e52,

title = "A novel NMF-HMM speech enhancement algorithm based on Poisson mixture model",

abstract = "In this paper, we propose a novel non-negative matrix factorization (NMF) and hidden Markov model (NMF-HMM) based speech enhancement algorithm, which employs a Poisson mixture model (PMM). {Compared to} the previously proposed NMF-HMM method, the new algorithm, termed PMM-NMF-HMM, {uses} the Poisson mixture distribution for the state conditional likelihood function for a HMM rather than the single Poisson distribution. {This means that there are the more basis matrices that can be used to model the speech and noise signals, so more signal information can be captured by the resulting model. The proposed method is supervised and thus includes a training and an enhancement stage. It is shown that, in the training stage, the proposed method can be implemented efficiently using multiplicative update (MU) for the model parameters, much like the NMF-HMM algorithm. In the speech enhancement stage, which can be performed online, a novel PMM-NMF-HMM minimum mean-square error (MMSE) estimator is developed. The experimental results indicate that the PMM-NMF-HMM method can obtain higher short-time objective intelligibility (STOI) and perceptual evaluation of speech quality (PESQ) score than NMF-HMM. Additionally, the {method also outperforms other state-of-the-art NMF-based supervised speech enhancement algorithms.",

keywords = "Hidden Markov model (HMM), Minimum mean-square error (MMSE), Non-negative matrix factorization (NMF), Poisson mixture model (PMM), Speech enhancement",

author = "Yang Xiang and Liming Shi and {Lisby H{\o}jvang}, Jesper and {H{\o}jfeldt Rasmussen}, Morten and Christensen, {Mads Gr{\ae}sb{\o}ll}",

year = "2021",

month = jun,

day = "11",

doi = "10.1109/ICASSP39728.2021.9414620",

language = "English",

isbn = "978-1-7281-7606-2",

series = "I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings",

publisher = "IEEE",

pages = "721--725",

booktitle = "ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)",

address = "United States",

note = " ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ; Conference date: 06-06-2021 Through 11-06-2021",

}

Xiang, Y, Shi, L, Lisby Højvang, J, Højfeldt Rasmussen, M & Christensen, MG 2021, A novel NMF-HMM speech enhancement algorithm based on Poisson mixture model. in ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)., 9414620, IEEE, I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings, pp. 721-725, ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, Ontario, Canada, 06/06/2021. https://doi.org/10.1109/ICASSP39728.2021.9414620

A novel NMF-HMM speech enhancement algorithm based on Poisson mixture model. / Xiang, Yang; Shi, Liming; Lisby Højvang, Jesper et al.
ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2021. p. 721-725 9414620 (I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings).

Research output: Contribution to book/anthology/report/conference proceeding › Article in proceeding › Research › peer-review

TY - GEN

T1 - A novel NMF-HMM speech enhancement algorithm based on Poisson mixture model

AU - Xiang, Yang

AU - Shi, Liming

AU - Lisby Højvang, Jesper

AU - Højfeldt Rasmussen, Morten

AU - Christensen, Mads Græsbøll

PY - 2021/6/11

Y1 - 2021/6/11

N2 - In this paper, we propose a novel non-negative matrix factorization (NMF) and hidden Markov model (NMF-HMM) based speech enhancement algorithm, which employs a Poisson mixture model (PMM). {Compared to} the previously proposed NMF-HMM method, the new algorithm, termed PMM-NMF-HMM, {uses} the Poisson mixture distribution for the state conditional likelihood function for a HMM rather than the single Poisson distribution. {This means that there are the more basis matrices that can be used to model the speech and noise signals, so more signal information can be captured by the resulting model. The proposed method is supervised and thus includes a training and an enhancement stage. It is shown that, in the training stage, the proposed method can be implemented efficiently using multiplicative update (MU) for the model parameters, much like the NMF-HMM algorithm. In the speech enhancement stage, which can be performed online, a novel PMM-NMF-HMM minimum mean-square error (MMSE) estimator is developed. The experimental results indicate that the PMM-NMF-HMM method can obtain higher short-time objective intelligibility (STOI) and perceptual evaluation of speech quality (PESQ) score than NMF-HMM. Additionally, the {method also outperforms other state-of-the-art NMF-based supervised speech enhancement algorithms.

AB - In this paper, we propose a novel non-negative matrix factorization (NMF) and hidden Markov model (NMF-HMM) based speech enhancement algorithm, which employs a Poisson mixture model (PMM). {Compared to} the previously proposed NMF-HMM method, the new algorithm, termed PMM-NMF-HMM, {uses} the Poisson mixture distribution for the state conditional likelihood function for a HMM rather than the single Poisson distribution. {This means that there are the more basis matrices that can be used to model the speech and noise signals, so more signal information can be captured by the resulting model. The proposed method is supervised and thus includes a training and an enhancement stage. It is shown that, in the training stage, the proposed method can be implemented efficiently using multiplicative update (MU) for the model parameters, much like the NMF-HMM algorithm. In the speech enhancement stage, which can be performed online, a novel PMM-NMF-HMM minimum mean-square error (MMSE) estimator is developed. The experimental results indicate that the PMM-NMF-HMM method can obtain higher short-time objective intelligibility (STOI) and perceptual evaluation of speech quality (PESQ) score than NMF-HMM. Additionally, the {method also outperforms other state-of-the-art NMF-based supervised speech enhancement algorithms.

KW - Hidden Markov model (HMM)

KW - Minimum mean-square error (MMSE)

KW - Non-negative matrix factorization (NMF)

KW - Poisson mixture model (PMM)

KW - Speech enhancement

UR - http://www.scopus.com/inward/record.url?scp=85115136673&partnerID=8YFLogxK

U2 - 10.1109/ICASSP39728.2021.9414620

DO - 10.1109/ICASSP39728.2021.9414620

M3 - Article in proceeding

SN - 978-1-7281-7606-2

T3 - I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings

SP - 721

EP - 725

BT - ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

PB - IEEE

T2 - ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Y2 - 6 June 2021 through 11 June 2021

ER -

Xiang Y, Shi L, Lisby Højvang J, Højfeldt Rasmussen M, Christensen MG. A novel NMF-HMM speech enhancement algorithm based on Poisson mixture model. In ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE. 2021. p. 721-725. 9414620. (I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings). doi: 10.1109/ICASSP39728.2021.9414620

A novel NMF-HMM speech enhancement algorithm based on Poisson mixture model

Abstract

Conference

Keywords

Access to Document

AUB Link

Other files and links

Fingerprint

Cite this