A Perceptually Motivated LP Residual Estimator in Noisy and Reverberant Environments

Renhua Peng; Zheng-Hua Tan; Xiaodong Li; Chengshi Zheng

doi:10.1016/j.specom.2017.12.004

A Perceptually Motivated LP Residual Estimator in Noisy and Reverberant Environments

Renhua Peng, Zheng-Hua Tan, Xiaodong Li, Chengshi Zheng

Publikation: Bidrag til tidsskrift › Tidsskriftartikel › Forskning › peer review

7 Citationer (Scopus)

Abstract

Both reverberation and additive noise can degrade the quality of recorded speech and thus should be suppressed simultaneously. Previous studies have shown that the generalized singular value decomposition (GSVD) has the capability of suppressing the additive noise effectively, but it is not often applied for speech dereverberation since reverberation is considered to be convolutive as well as colored noise. Recently, we revealed that late reverberation is also additive and relatively white interference component in the linear prediction (LP) residual domain. To suppress both late reverberation and additive noise, we have proposed an optimal filter for LP residual estimator (LPRE) based on a constrained minimum mean square error (CMMSE) by using GSVD in single channel speech enhancement, where the algorithm is referred as CMMSE-GSVD-LPRE. Experimental results have shown a better performance of the CMMSE-GSVD-LPRE than spectral subtraction methods, but some residual noise and reverberation components are still audible and annoying. To solve this problem, this paper incorporates the masking properties of the human auditory system in the LP residual domain to further suppress these residual noise and reverberation components while reducing speech distortion at the same time. Various simulation experiments are conducted, and the results show an improved performance of the proposed algorithm. Experimental results with speech recorded in noisy and reverberant environments further confirm the effectiveness of the proposed algorithm in real-world environments.

Originalsprog	Engelsk
Tidsskrift	Speech Communication
Vol/bind	96
Sider (fra-til)	129-141
Antal sider	13
ISSN	0167-6393
DOI	https://doi.org/10.1016/j.specom.2017.12.004
Status	Udgivet - feb. 2018

Adgang til dokumentet

10.1016/j.specom.2017.12.004

AUB Link

Søg efter materialet i Aalborg Universitetsbiblioteks søgemaskine

Andre filer og links

http://www.scopus.com/inward/record.url?scp=85038031269&partnerID=8YFLogxK

Citationsformater

@article{8ee6d24012c24f23a249cc6b2f13b061,

title = "A Perceptually Motivated LP Residual Estimator in Noisy and Reverberant Environments",

abstract = "Both reverberation and additive noise can degrade the quality of recorded speech and thus should be suppressed simultaneously. Previous studies have shown that the generalized singular value decomposition (GSVD) has the capability of suppressing the additive noise effectively, but it is not often applied for speech dereverberation since reverberation is considered to be convolutive as well as colored noise. Recently, we revealed that late reverberation is also additive and relatively white interference component in the linear prediction (LP) residual domain. To suppress both late reverberation and additive noise, we have proposed an optimal filter for LP residual estimator (LPRE) based on a constrained minimum mean square error (CMMSE) by using GSVD in single channel speech enhancement, where the algorithm is referred as CMMSE-GSVD-LPRE. Experimental results have shown a better performance of the CMMSE-GSVD-LPRE than spectral subtraction methods, but some residual noise and reverberation components are still audible and annoying. To solve this problem, this paper incorporates the masking properties of the human auditory system in the LP residual domain to further suppress these residual noise and reverberation components while reducing speech distortion at the same time. Various simulation experiments are conducted, and the results show an improved performance of the proposed algorithm. Experimental results with speech recorded in noisy and reverberant environments further confirm the effectiveness of the proposed algorithm in real-world environments.",

keywords = "Auditory masking, Generalized singular value decomposition, Linear prediction residual, Minimum mean square error, Speech dereverberation",

author = "Renhua Peng and Zheng-Hua Tan and Xiaodong Li and Chengshi Zheng",

year = "2018",

month = feb,

doi = "10.1016/j.specom.2017.12.004",

language = "English",

volume = "96",

pages = "129--141",

journal = "Speech Communication",

issn = "0167-6393",

publisher = "Elsevier",

}

TY - JOUR

T1 - A Perceptually Motivated LP Residual Estimator in Noisy and Reverberant Environments

AU - Peng, Renhua

AU - Tan, Zheng-Hua

AU - Li, Xiaodong

AU - Zheng, Chengshi

PY - 2018/2

Y1 - 2018/2

N2 - Both reverberation and additive noise can degrade the quality of recorded speech and thus should be suppressed simultaneously. Previous studies have shown that the generalized singular value decomposition (GSVD) has the capability of suppressing the additive noise effectively, but it is not often applied for speech dereverberation since reverberation is considered to be convolutive as well as colored noise. Recently, we revealed that late reverberation is also additive and relatively white interference component in the linear prediction (LP) residual domain. To suppress both late reverberation and additive noise, we have proposed an optimal filter for LP residual estimator (LPRE) based on a constrained minimum mean square error (CMMSE) by using GSVD in single channel speech enhancement, where the algorithm is referred as CMMSE-GSVD-LPRE. Experimental results have shown a better performance of the CMMSE-GSVD-LPRE than spectral subtraction methods, but some residual noise and reverberation components are still audible and annoying. To solve this problem, this paper incorporates the masking properties of the human auditory system in the LP residual domain to further suppress these residual noise and reverberation components while reducing speech distortion at the same time. Various simulation experiments are conducted, and the results show an improved performance of the proposed algorithm. Experimental results with speech recorded in noisy and reverberant environments further confirm the effectiveness of the proposed algorithm in real-world environments.

AB - Both reverberation and additive noise can degrade the quality of recorded speech and thus should be suppressed simultaneously. Previous studies have shown that the generalized singular value decomposition (GSVD) has the capability of suppressing the additive noise effectively, but it is not often applied for speech dereverberation since reverberation is considered to be convolutive as well as colored noise. Recently, we revealed that late reverberation is also additive and relatively white interference component in the linear prediction (LP) residual domain. To suppress both late reverberation and additive noise, we have proposed an optimal filter for LP residual estimator (LPRE) based on a constrained minimum mean square error (CMMSE) by using GSVD in single channel speech enhancement, where the algorithm is referred as CMMSE-GSVD-LPRE. Experimental results have shown a better performance of the CMMSE-GSVD-LPRE than spectral subtraction methods, but some residual noise and reverberation components are still audible and annoying. To solve this problem, this paper incorporates the masking properties of the human auditory system in the LP residual domain to further suppress these residual noise and reverberation components while reducing speech distortion at the same time. Various simulation experiments are conducted, and the results show an improved performance of the proposed algorithm. Experimental results with speech recorded in noisy and reverberant environments further confirm the effectiveness of the proposed algorithm in real-world environments.

KW - Auditory masking

KW - Generalized singular value decomposition

KW - Linear prediction residual

KW - Minimum mean square error

KW - Speech dereverberation

UR - http://www.scopus.com/inward/record.url?scp=85038031269&partnerID=8YFLogxK

U2 - 10.1016/j.specom.2017.12.004

DO - 10.1016/j.specom.2017.12.004

M3 - Journal article

SN - 0167-6393

VL - 96

SP - 129

EP - 141

JO - Speech Communication

JF - Speech Communication

ER -

A Perceptually Motivated LP Residual Estimator in Noisy and Reverberant Environments

Abstract

Adgang til dokumentet

AUB Link

Andre filer og links

Fingeraftryk

Citationsformater