Sinusoidal masks for single channel speech separation

Pejman Mowlaee; Mads Græsbøll Christensen; Søren Holdt Jensen

doi:10.1109/ICASSP.2010.5495679

Sinusoidal masks for single channel speech separation

Pejman Mowlaee, Mads Græsbøll Christensen, Søren Holdt Jensen

Research output: Contribution to journal › Conference article in Journal › Research › peer-review

8 Citations (Scopus)

484 Downloads (Pure)

Abstract

In this paper we present a new approach for binary and soft masks
used in single-channel speech separation. We present a novel approach
called the sinusoidal mask (binary mask and Wiener filter)
in a sinusoidal space. Theoretical analysis is presented for the proposed
method, and we show that the proposed method is able to minimize
the target speech distortion while suppressing the crosstalk to
a predetermined threshold. It is observed that compared to the STFTbased
masks, the proposed sinusoidal masks improve the separation
performance in terms of objective measures (SSNR and PESQ) and
are mostly preferred by listeners.

Original language	English
Journal	I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings
Pages (from-to)	4262-4265
ISSN	1520-6149
DOIs	https://doi.org/10.1109/ICASSP.2010.5495679
Publication status	Published - 14 Mar 2010
Event	2010 IEEE International Conference on Acoustics, Speech, and Signal Processing - Dallas, United States Duration: 14 Mar 2010 → 17 Mar 2010

Conference

Conference	2010 IEEE International Conference on Acoustics, Speech, and Signal Processing
Country/Territory	United States
City	Dallas
Period	14/03/2010 → 17/03/2010

Keywords

Mask-based method
mixture estimator
sinusoidal mask
single-channel speech separation

Access to Document

10.1109/ICASSP.2010.5495679

Icassp2010bAccepted author manuscript, 177 KB

AUB Link

Search for the material in Aalborg University Library's search engine

Cite this

@inproceedings{a0d76a6489ae4f54af7cfdafa8efb4f4,

title = "Sinusoidal masks for single channel speech separation",

abstract = "In this paper we present a new approach for binary and soft masksused in single-channel speech separation. We present a novel approachcalled the sinusoidal mask (binary mask and Wiener filter)in a sinusoidal space. Theoretical analysis is presented for the proposedmethod, and we show that the proposed method is able to minimizethe target speech distortion while suppressing the crosstalk toa predetermined threshold. It is observed that compared to the STFTbasedmasks, the proposed sinusoidal masks improve the separationperformance in terms of objective measures (SSNR and PESQ) andare mostly preferred by listeners.",

keywords = "Mask-based method, mixture estimator, sinusoidal mask, single-channel speech separation",

author = "Pejman Mowlaee and Christensen, {Mads Gr{\ae}sb{\o}ll} and Jensen, {S{\o}ren Holdt}",

year = "2010",

month = mar,

day = "14",

doi = "10.1109/ICASSP.2010.5495679",

language = "English",

pages = "4262--4265",

journal = "I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings",

issn = "1520-6149",

publisher = "IEEE Signal Processing Society",

note = "2010 IEEE International Conference on Acoustics, Speech, and Signal Processing ; Conference date: 14-03-2010 Through 17-03-2010",

}

TY - GEN

T1 - Sinusoidal masks for single channel speech separation

AU - Mowlaee, Pejman

AU - Christensen, Mads Græsbøll

AU - Jensen, Søren Holdt

PY - 2010/3/14

Y1 - 2010/3/14

N2 - In this paper we present a new approach for binary and soft masksused in single-channel speech separation. We present a novel approachcalled the sinusoidal mask (binary mask and Wiener filter)in a sinusoidal space. Theoretical analysis is presented for the proposedmethod, and we show that the proposed method is able to minimizethe target speech distortion while suppressing the crosstalk toa predetermined threshold. It is observed that compared to the STFTbasedmasks, the proposed sinusoidal masks improve the separationperformance in terms of objective measures (SSNR and PESQ) andare mostly preferred by listeners.

AB - In this paper we present a new approach for binary and soft masksused in single-channel speech separation. We present a novel approachcalled the sinusoidal mask (binary mask and Wiener filter)in a sinusoidal space. Theoretical analysis is presented for the proposedmethod, and we show that the proposed method is able to minimizethe target speech distortion while suppressing the crosstalk toa predetermined threshold. It is observed that compared to the STFTbasedmasks, the proposed sinusoidal masks improve the separationperformance in terms of objective measures (SSNR and PESQ) andare mostly preferred by listeners.

KW - Mask-based method

KW - mixture estimator

KW - sinusoidal mask

KW - single-channel speech separation

U2 - 10.1109/ICASSP.2010.5495679

DO - 10.1109/ICASSP.2010.5495679

M3 - Conference article in Journal

SN - 1520-6149

SP - 4262

EP - 4265

JO - I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings

JF - I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings

T2 - 2010 IEEE International Conference on Acoustics, Speech, and Signal Processing

Y2 - 14 March 2010 through 17 March 2010

ER -

Sinusoidal masks for single channel speech separation

Abstract

Conference

Keywords

Access to Document

AUB Link

Fingerprint

Cite this