Improved single-channel speech separation using sinusoidal modeling

Pejman Mowlaee; Mads Græsbøll Christensen; Søren Holdt Jensen

doi:10.1109/ICASSP.2010.5496263

Improved single-channel speech separation using sinusoidal modeling

Pejman Mowlaee, Mads Græsbøll Christensen, Søren Holdt Jensen

Research output: Contribution to journal › Conference article in Journal › Research › peer-review

16 Citations (Scopus)

360 Downloads (Pure)

Abstract

We present a novel single-channel separation approach to improve the separation performance while recovering the signals from a mixture. The key idea in this research is to employ a mixture estimator based on unconstrained modified sinusoidal parameters. Compared to the mixmax (binary mask) and Wiener filter (softmask) approaches, the proposed approach works independently of pitch estimates. Furthermore, it is observed that it can achieve acceptable perceptual speech quality with less cross-talk at different signal-tosignal ratios while bringing down the complexity by replacing STFT with sinusoidal parameters. Improvementsmade by the proposed approach are demonstrated by employing PESQ as our objective measureand MUSHRA listening test as our subjective evaluation.

Original language	English
Journal	I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings
Volume	2010
Pages (from-to)	21-24
ISSN	1520-6149
DOIs	https://doi.org/10.1109/ICASSP.2010.5496263
Publication status	Published - 14 Mar 2010
Event	2010 IEEE International Conference on Acoustics, Speech, and Signal Processing - Dallas, United States Duration: 14 Mar 2010 → 17 Mar 2010

Conference

Conference	2010 IEEE International Conference on Acoustics, Speech, and Signal Processing
Country/Territory	United States
City	Dallas
Period	14/03/2010 → 17/03/2010

Keywords

Mixture estimation
single-channel speech separation
mask-based methods
speaker codebook

Access to Document

10.1109/ICASSP.2010.5496263

Icassp2010aAccepted author manuscript, 172 KB

AUB Link

Search for the material in Aalborg University Library's search engine

Cite this

@inproceedings{1e0a66c6e75e4c158b9212aa91a618d6,

title = "Improved single-channel speech separation using sinusoidal modeling",

abstract = "We present a novel single-channel separation approach to improve the separation performance while recovering the signals from a mixture. The key idea in this research is to employ a mixture estimator based on unconstrained modified sinusoidal parameters. Compared to the mixmax (binary mask) and Wiener filter (softmask) approaches, the proposed approach works independently of pitch estimates. Furthermore, it is observed that it can achieve acceptable perceptual speech quality with less cross-talk at different signal-tosignal ratios while bringing down the complexity by replacing STFT with sinusoidal parameters. Improvementsmade by the proposed approach are demonstrated by employing PESQ as our objective measureand MUSHRA listening test as our subjective evaluation.",

keywords = "Mixture estimation, single-channel speech separation, mask-based methods, speaker codebook",

author = "Pejman Mowlaee and Christensen, {Mads Gr{\ae}sb{\o}ll} and Jensen, {S{\o}ren Holdt}",

year = "2010",

month = mar,

day = "14",

doi = "10.1109/ICASSP.2010.5496263",

language = "English",

volume = "2010",

pages = "21--24",

journal = "I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings",

issn = "1520-6149",

publisher = "IEEE Signal Processing Society",

note = "2010 IEEE International Conference on Acoustics, Speech, and Signal Processing ; Conference date: 14-03-2010 Through 17-03-2010",

}

TY - GEN

T1 - Improved single-channel speech separation using sinusoidal modeling

AU - Mowlaee, Pejman

AU - Christensen, Mads Græsbøll

AU - Jensen, Søren Holdt

PY - 2010/3/14

Y1 - 2010/3/14

N2 - We present a novel single-channel separation approach to improve the separation performance while recovering the signals from a mixture. The key idea in this research is to employ a mixture estimator based on unconstrained modified sinusoidal parameters. Compared to the mixmax (binary mask) and Wiener filter (softmask) approaches, the proposed approach works independently of pitch estimates. Furthermore, it is observed that it can achieve acceptable perceptual speech quality with less cross-talk at different signal-tosignal ratios while bringing down the complexity by replacing STFT with sinusoidal parameters. Improvementsmade by the proposed approach are demonstrated by employing PESQ as our objective measureand MUSHRA listening test as our subjective evaluation.

AB - We present a novel single-channel separation approach to improve the separation performance while recovering the signals from a mixture. The key idea in this research is to employ a mixture estimator based on unconstrained modified sinusoidal parameters. Compared to the mixmax (binary mask) and Wiener filter (softmask) approaches, the proposed approach works independently of pitch estimates. Furthermore, it is observed that it can achieve acceptable perceptual speech quality with less cross-talk at different signal-tosignal ratios while bringing down the complexity by replacing STFT with sinusoidal parameters. Improvementsmade by the proposed approach are demonstrated by employing PESQ as our objective measureand MUSHRA listening test as our subjective evaluation.

KW - Mixture estimation

KW - single-channel speech separation

KW - mask-based methods

KW - speaker codebook

U2 - 10.1109/ICASSP.2010.5496263

DO - 10.1109/ICASSP.2010.5496263

M3 - Conference article in Journal

SN - 1520-6149

VL - 2010

SP - 21

EP - 24

JO - I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings

JF - I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings

T2 - 2010 IEEE International Conference on Acoustics, Speech, and Signal Processing

Y2 - 14 March 2010 through 17 March 2010

ER -

Improved single-channel speech separation using sinusoidal modeling

Abstract

Conference

Keywords

Access to Document

AUB Link

Fingerprint

Cite this