Sinusoidal masks for single channel speech separation

Pejman Mowlaee, Mads Græsbøll Christensen, Søren Holdt Jensen

Research output: Contribution to journalConference article in JournalResearchpeer-review

8 Citations (Scopus)
484 Downloads (Pure)

Abstract

In this paper we present a new approach for binary and soft masks
used in single-channel speech separation. We present a novel approach
called the sinusoidal mask (binary mask and Wiener filter)
in a sinusoidal space. Theoretical analysis is presented for the proposed
method, and we show that the proposed method is able to minimize
the target speech distortion while suppressing the crosstalk to
a predetermined threshold. It is observed that compared to the STFTbased
masks, the proposed sinusoidal masks improve the separation
performance in terms of objective measures (SSNR and PESQ) and
are mostly preferred by listeners.
Original languageEnglish
JournalI E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings
Pages (from-to)4262-4265
ISSN1520-6149
DOIs
Publication statusPublished - 14 Mar 2010
Event2010 IEEE International Conference on Acoustics, Speech, and Signal Processing - Dallas, United States
Duration: 14 Mar 201017 Mar 2010

Conference

Conference2010 IEEE International Conference on Acoustics, Speech, and Signal Processing
Country/TerritoryUnited States
CityDallas
Period14/03/201017/03/2010

Keywords

  • Mask-based method
  • mixture estimator
  • sinusoidal mask
  • single-channel speech separation

Fingerprint

Dive into the research topics of 'Sinusoidal masks for single channel speech separation'. Together they form a unique fingerprint.

Cite this