Improved single-channel speech separation using sinusoidal modeling

Pejman Mowlaee, Mads Græsbøll Christensen, Søren Holdt Jensen

Research output: Contribution to journalConference article in JournalResearchpeer-review

16 Citations (Scopus)
360 Downloads (Pure)

Abstract

We present a novel single-channel separation approach to improve the separation performance while recovering the signals from a mixture. The key idea in this research is to employ a mixture estimator based on unconstrained modified sinusoidal parameters. Compared to the mixmax (binary mask) and Wiener filter (softmask) approaches, the proposed approach works independently of pitch estimates. Furthermore, it is observed that it can achieve acceptable perceptual speech quality with less cross-talk at different signal-tosignal ratios while bringing down the complexity by replacing STFT with sinusoidal parameters. Improvementsmade by the proposed approach are demonstrated by employing PESQ as our objective measureand MUSHRA listening test as our subjective evaluation.
Original languageEnglish
JournalI E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings
Volume2010
Pages (from-to)21-24
ISSN1520-6149
DOIs
Publication statusPublished - 14 Mar 2010
Event2010 IEEE International Conference on Acoustics, Speech, and Signal Processing - Dallas, United States
Duration: 14 Mar 201017 Mar 2010

Conference

Conference2010 IEEE International Conference on Acoustics, Speech, and Signal Processing
Country/TerritoryUnited States
CityDallas
Period14/03/201017/03/2010

Keywords

  • Mixture estimation
  • single-channel speech separation
  • mask-based methods
  • speaker codebook

Fingerprint

Dive into the research topics of 'Improved single-channel speech separation using sinusoidal modeling'. Together they form a unique fingerprint.

Cite this