Sinusoidal masks for single channel speech separation

Pejman Mowlaee, Mads Græsbøll Christensen, Søren Holdt Jensen

Publikation: Bidrag til tidsskriftKonferenceartikel i tidsskriftForskningpeer review

8 Citationer (Scopus)
482 Downloads (Pure)

Abstract

In this paper we present a new approach for binary and soft masks
used in single-channel speech separation. We present a novel approach
called the sinusoidal mask (binary mask and Wiener filter)
in a sinusoidal space. Theoretical analysis is presented for the proposed
method, and we show that the proposed method is able to minimize
the target speech distortion while suppressing the crosstalk to
a predetermined threshold. It is observed that compared to the STFTbased
masks, the proposed sinusoidal masks improve the separation
performance in terms of objective measures (SSNR and PESQ) and
are mostly preferred by listeners.
OriginalsprogEngelsk
TidsskriftI E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings
Sider (fra-til)4262-4265
ISSN1520-6149
DOI
StatusUdgivet - 14 mar. 2010
Begivenhed2010 IEEE International Conference on Acoustics, Speech, and Signal Processing - Dallas, USA
Varighed: 14 mar. 201017 mar. 2010

Konference

Konference2010 IEEE International Conference on Acoustics, Speech, and Signal Processing
Land/OmrådeUSA
ByDallas
Periode14/03/201017/03/2010

Fingeraftryk

Dyk ned i forskningsemnerne om 'Sinusoidal masks for single channel speech separation'. Sammen danner de et unikt fingeraftryk.

Citationsformater