An NMF-HMM Speech Enhancement Method based on Kullback-Leibler Divergence

Yang Xiang; Liming Shi; Jesper  Lisby Højvang; Morten  Højfeldt Rasmussen; Mads Græsbøll Christensen

An NMF-HMM Speech Enhancement Method based on Kullback-Leibler Divergence

Yang Xiang, Liming Shi, Jesper Lisby Højvang, Morten Højfeldt Rasmussen, Mads Græsbøll Christensen

Publikation: Bidrag til bog/antologi/rapport/konference proceeding › Konferenceartikel i proceeding › Forskning › peer review

11 Citationer (Scopus)

200 Downloads (Pure)

Abstract

In this paper, we present a novel supervised Non-negative Matrix
Factorization (NMF) speech enhancement method, which
is based on Hidden Markov Model (HMM) and Kullback-
Leibler (KL) divergence (NMF-HMM). Our algorithm applies
theHMMto capture the timing information, so the temporal dynamics
of speech signal can be considered by comparing with
the traditional NMF-based speech enhancement method. More
specifically, the sum of Poisson, leading to the KL divergence
measure, is used as the observation model for each state of
HMM. This ensures that the parameter update rule of the proposed
algorithm is identical to the multiplicative update rule,
which is quick and efficient. In the training stage, this update
rule is applied to train the NMF-HMM model. In the online enhancement
stage, a novel minimum mean-square error (MMSE)
estimator that combines the NMF-HMM is proposed to conduct
speech enhancement. The performance of the proposed
algorithm is evaluated by perceptual evaluation of speech quality
(PESQ) and short-timeobjective intelligibility (STOI). The
experimental results indicate that the STOI score of proposed
strategy is able to outperform 7% than current state-of-the-art
NMF-based speech enhancement methods.

Originalsprog	Engelsk
Titel	Interspeech
Antal sider	5
Publikationsdato	22 okt. 2020
Sider	2667-2671
Status	Udgivet - 22 okt. 2020
Begivenhed	Interspeech 2020 - Shanghai, Kina Varighed: 25 okt. 2020 → 29 okt. 2020

Konference

Konference	Interspeech 2020
Land/Område	Kina
By	Shanghai
Periode	25/10/2020 → 29/10/2020

Adgang til dokumentet

An NMF-HMM Speech Enhancement Method based on Kullback-Leibler Divergence

https://indico2.conference4me.psnc.pl/event/35/contributions/3537/attachments/1043/1084/Wed-2-5-2.pdf

AUB Link

Søg efter materialet i Aalborg Universitetsbiblioteks søgemaskine

Andre filer og links

Konference program

Citationsformater

@inproceedings{4f30e2af47f84391a33fc3fdb40033b9,

title = "An NMF-HMM Speech Enhancement Method based on Kullback-Leibler Divergence",

abstract = "In this paper, we present a novel supervised Non-negative MatrixFactorization (NMF) speech enhancement method, whichis based on Hidden Markov Model (HMM) and Kullback-Leibler (KL) divergence (NMF-HMM). Our algorithm appliestheHMMto capture the timing information, so the temporal dynamicsof speech signal can be considered by comparing withthe traditional NMF-based speech enhancement method. Morespecifically, the sum of Poisson, leading to the KL divergencemeasure, is used as the observation model for each state ofHMM. This ensures that the parameter update rule of the proposedalgorithm is identical to the multiplicative update rule,which is quick and efficient. In the training stage, this updaterule is applied to train the NMF-HMM model. In the online enhancementstage, a novel minimum mean-square error (MMSE)estimator that combines the NMF-HMM is proposed to conductspeech enhancement. The performance of the proposedalgorithm is evaluated by perceptual evaluation of speech quality(PESQ) and short-timeobjective intelligibility (STOI). Theexperimental results indicate that the STOI score of proposedstrategy is able to outperform 7% than current state-of-the-artNMF-based speech enhancement methods.",

keywords = "Speech Enhancement, minimum mean-square error, Non-negative matrix factorization (NMF), Hidden Markov Model",

author = "Yang Xiang and Liming Shi and {Lisby H{\o}jvang}, Jesper and {H{\o}jfeldt Rasmussen}, Morten and Christensen, {Mads Gr{\ae}sb{\o}ll}",

year = "2020",

month = oct,

day = "22",

language = "English",

pages = "2667--2671",

booktitle = "Interspeech",

note = "Interspeech 2020 ; Conference date: 25-10-2020 Through 29-10-2020",

}

TY - GEN

T1 - An NMF-HMM Speech Enhancement Method based on Kullback-Leibler Divergence

AU - Xiang, Yang

AU - Shi, Liming

AU - Lisby Højvang, Jesper

AU - Højfeldt Rasmussen, Morten

AU - Christensen, Mads Græsbøll

PY - 2020/10/22

Y1 - 2020/10/22

N2 - In this paper, we present a novel supervised Non-negative MatrixFactorization (NMF) speech enhancement method, whichis based on Hidden Markov Model (HMM) and Kullback-Leibler (KL) divergence (NMF-HMM). Our algorithm appliestheHMMto capture the timing information, so the temporal dynamicsof speech signal can be considered by comparing withthe traditional NMF-based speech enhancement method. Morespecifically, the sum of Poisson, leading to the KL divergencemeasure, is used as the observation model for each state ofHMM. This ensures that the parameter update rule of the proposedalgorithm is identical to the multiplicative update rule,which is quick and efficient. In the training stage, this updaterule is applied to train the NMF-HMM model. In the online enhancementstage, a novel minimum mean-square error (MMSE)estimator that combines the NMF-HMM is proposed to conductspeech enhancement. The performance of the proposedalgorithm is evaluated by perceptual evaluation of speech quality(PESQ) and short-timeobjective intelligibility (STOI). Theexperimental results indicate that the STOI score of proposedstrategy is able to outperform 7% than current state-of-the-artNMF-based speech enhancement methods.

AB - In this paper, we present a novel supervised Non-negative MatrixFactorization (NMF) speech enhancement method, whichis based on Hidden Markov Model (HMM) and Kullback-Leibler (KL) divergence (NMF-HMM). Our algorithm appliestheHMMto capture the timing information, so the temporal dynamicsof speech signal can be considered by comparing withthe traditional NMF-based speech enhancement method. Morespecifically, the sum of Poisson, leading to the KL divergencemeasure, is used as the observation model for each state ofHMM. This ensures that the parameter update rule of the proposedalgorithm is identical to the multiplicative update rule,which is quick and efficient. In the training stage, this updaterule is applied to train the NMF-HMM model. In the online enhancementstage, a novel minimum mean-square error (MMSE)estimator that combines the NMF-HMM is proposed to conductspeech enhancement. The performance of the proposedalgorithm is evaluated by perceptual evaluation of speech quality(PESQ) and short-timeobjective intelligibility (STOI). Theexperimental results indicate that the STOI score of proposedstrategy is able to outperform 7% than current state-of-the-artNMF-based speech enhancement methods.

KW - Speech Enhancement

KW - minimum mean-square error

KW - Non-negative matrix factorization (NMF)

KW - Hidden Markov Model

UR - https://indico2.conference4me.psnc.pl/event/35/contributions/3537/attachments/1043/1084/Wed-2-5-2.pdf

M3 - Article in proceeding

SP - 2667

EP - 2671

BT - Interspeech

T2 - Interspeech 2020

Y2 - 25 October 2020 through 29 October 2020

ER -

An NMF-HMM Speech Enhancement Method based on Kullback-Leibler Divergence

Abstract

Konference

Adgang til dokumentet

AUB Link

Andre filer og links

Fingeraftryk

Citationsformater