A MAP Criterion for Detecting the Number of Speakers at frame level in Model-based Single-Channel Speech Separation

Pejman Mowlaee, Mads Græsbøll Christensen, Zheng-Hua Tan, Søren Holdt Jensen

Publikation: Bidrag til tidsskriftKonferenceartikel i tidsskriftForskningpeer review

7 Citationer (Scopus)
341 Downloads (Pure)

Abstract

The problem of detecting the number of speakers for a particular segment occurs in many dif-
ferent speech applications. In single channel speech separation, for example, this information is
often used to simplify the separation process, as the signal has to be treated differently depending
on the number of speakers. Inspired by the asymptotic maximum a posteriori rule proposed for
model selection, we pose the problem as a model selection problem. More specifically, we derive
a multiple hypotheses test for determining the number of speakers at a frame level in an observed
signal based on underlying parametric speaker models, trained a priori. The experimental results
indicate that the suggested method improves the quality of the separated signals in a single-channel
speech separation scenario at different signal-to-signal ratio levels.
OriginalsprogEngelsk
TidsskriftAsilomar Conference on Signals, Systems and Computers. Conference Record
Sider (fra-til)538 - 541
ISSN1058-6393
DOI
StatusUdgivet - 2010
Begivenhed44th Asilomar Conference on Signals, Systems and Computers - Pacific Grove, USA
Varighed: 7 nov. 201010 nov. 2010

Konference

Konference44th Asilomar Conference on Signals, Systems and Computers
Land/OmrådeUSA
ByPacific Grove
Periode07/11/201010/11/2010

Fingeraftryk

Dyk ned i forskningsemnerne om 'A MAP Criterion for Detecting the Number of Speakers at frame level in Model-based Single-Channel Speech Separation'. Sammen danner de et unikt fingeraftryk.

Citationsformater