Variable Span Filters for Speech Enhancement

Jesper Rindom Jensen; Jacob Benesty; Mads Græsbøll Christensen

doi:10.1109/ICASSP.2016.7472930

Variable Span Filters for Speech Enhancement

Jesper Rindom Jensen, Jacob Benesty, Mads Græsbøll Christensen

Research output: Contribution to journal › Conference article in Journal › Research › peer-review

1 Citation (Scopus)

404 Downloads (Pure)

Abstract

In this work, we consider enhancement of multichannel speech recordings. Linear filtering and subspace approaches have been considered previously for solving the problem. The current linear filtering methods, although many variants exist, have limited control of noise reduction and speech distortion. Subspace approaches, on the other hand, can potentially yield better control by filtering in the eigen-domain, but traditionally these approaches have not been optimized explicitly for traditional noise reduction and signal distortion measures. Herein, we combine these approaches by deriving optimal filters using a joint diagonalization as a basis. This gives excellent control over the performance, as we can optimize for noise reduction or signal distortion performance. Results from real data experiments show that the proposed variable span filters can achieve better performance than existing filters. In terms of output SNR, the gain was more than 8~dB, and more than 0.1 in mean opinion score in the conducted experiments.

Original language	English
Journal	I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings
Number of pages	5
ISSN	1520-6149
DOIs	https://doi.org/10.1109/ICASSP.2016.7472930
Publication status	Published - Mar 2016
Event	The 41st IEEE International Conference on Acoustics, Speech and Signal Processing - Shanghai, China Duration: 20 Mar 2016 → 25 Mar 2016 http://www.icassp2016.org/

Conference

Conference	The 41st IEEE International Conference on Acoustics, Speech and Signal Processing
Country/Territory	China
City	Shanghai
Period	20/03/2016 → 25/03/2016
Internet address	http://www.icassp2016.org/

Keywords

speech enhancement
joint diagonlization
optimal filtering
multichannel enhancement
tradeoff filter

Access to Document

10.1109/ICASSP.2016.7472930

Jensen_et_al_ICASSP2016_varSpan_revAccepted author manuscript, 221 KB

AUB Link

Search for the material in Aalborg University Library's search engine

Localization and Tracking of Speech - a Joint Audio-Visual Approach
Jensen, J. R.
01/10/2013 → 30/09/2016
Project: Research
Spatio-Temporal Filtering Methods for Enhancement and Separation of Speech Signals
Christensen, M. G., Nørholm, S. M., Karimian-Azari, S. & Jensen, J. R.
01/08/2012 → 30/06/2015
Project: Research

Cite this

@inproceedings{c0713686d024421687f962f03b6da195,

title = "Variable Span Filters for Speech Enhancement",

abstract = "In this work, we consider enhancement of multichannel speech recordings. Linear filtering and subspace approaches have been considered previously for solving the problem. The current linear filtering methods, although many variants exist, have limited control of noise reduction and speech distortion. Subspace approaches, on the other hand, can potentially yield better control by filtering in the eigen-domain, but traditionally these approaches have not been optimized explicitly for traditional noise reduction and signal distortion measures. Herein, we combine these approaches by deriving optimal filters using a joint diagonalization as a basis. This gives excellent control over the performance, as we can optimize for noise reduction or signal distortion performance. Results from real data experiments show that the proposed variable span filters can achieve better performance than existing filters. In terms of output SNR, the gain was more than 8~dB, and more than 0.1 in mean opinion score in the conducted experiments.",

keywords = "speech enhancement, joint diagonalization, optimal filtering, multichannel enhancement, tradeoff filter, speech enhancement, joint diagonlization, optimal filtering, multichannel enhancement, tradeoff filter",

author = "Jensen, {Jesper Rindom} and Jacob Benesty and Christensen, {Mads Gr{\ae}sb{\o}ll}",

year = "2016",

month = mar,

doi = "10.1109/ICASSP.2016.7472930",

language = "English",

journal = "I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings",

issn = "1520-6149",

publisher = "IEEE Signal Processing Society",

note = "The 41st IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2016 ; Conference date: 20-03-2016 Through 25-03-2016",

url = "http://www.icassp2016.org/",

}

TY - GEN

T1 - Variable Span Filters for Speech Enhancement

AU - Jensen, Jesper Rindom

AU - Benesty, Jacob

AU - Christensen, Mads Græsbøll

PY - 2016/3

Y1 - 2016/3

N2 - In this work, we consider enhancement of multichannel speech recordings. Linear filtering and subspace approaches have been considered previously for solving the problem. The current linear filtering methods, although many variants exist, have limited control of noise reduction and speech distortion. Subspace approaches, on the other hand, can potentially yield better control by filtering in the eigen-domain, but traditionally these approaches have not been optimized explicitly for traditional noise reduction and signal distortion measures. Herein, we combine these approaches by deriving optimal filters using a joint diagonalization as a basis. This gives excellent control over the performance, as we can optimize for noise reduction or signal distortion performance. Results from real data experiments show that the proposed variable span filters can achieve better performance than existing filters. In terms of output SNR, the gain was more than 8~dB, and more than 0.1 in mean opinion score in the conducted experiments.

AB - In this work, we consider enhancement of multichannel speech recordings. Linear filtering and subspace approaches have been considered previously for solving the problem. The current linear filtering methods, although many variants exist, have limited control of noise reduction and speech distortion. Subspace approaches, on the other hand, can potentially yield better control by filtering in the eigen-domain, but traditionally these approaches have not been optimized explicitly for traditional noise reduction and signal distortion measures. Herein, we combine these approaches by deriving optimal filters using a joint diagonalization as a basis. This gives excellent control over the performance, as we can optimize for noise reduction or signal distortion performance. Results from real data experiments show that the proposed variable span filters can achieve better performance than existing filters. In terms of output SNR, the gain was more than 8~dB, and more than 0.1 in mean opinion score in the conducted experiments.

KW - speech enhancement

KW - joint diagonalization

KW - optimal filtering

KW - multichannel enhancement

KW - tradeoff filter

KW - speech enhancement

KW - joint diagonlization

KW - optimal filtering

KW - multichannel enhancement

KW - tradeoff filter

U2 - 10.1109/ICASSP.2016.7472930

DO - 10.1109/ICASSP.2016.7472930

M3 - Conference article in Journal

SN - 1520-6149

JO - I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings

JF - I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings

T2 - The 41st IEEE International Conference on Acoustics, Speech and Signal Processing

Y2 - 20 March 2016 through 25 March 2016

ER -

Variable Span Filters for Speech Enhancement

Abstract

Conference

Keywords

Access to Document

AUB Link

Fingerprint

Projects

Localization and Tracking of Speech - a Joint Audio-Visual Approach

Spatio-Temporal Filtering Methods for Enhancement and Separation of Speech Signals

Cite this