Joint Single-Channel Speech Separation and Speaker Identification

Pejman Mowlaee; Rahim  Saeidi; Zheng-Hua Tan; Mads Græsbøll Christensen; Pasi  Fränti; Søren Holdt Jensen

doi:10.1109/ICASSP.2010.5495619

Joint Single-Channel Speech Separation and Speaker Identification

Pejman Mowlaee, Rahim Saeidi, Zheng-Hua Tan, Mads Græsbøll Christensen, Pasi Fränti, Søren Holdt Jensen

Publikation: Bidrag til tidsskrift › Konferenceartikel i tidsskrift › Forskning › peer review

17 Citationer (Scopus)

941 Downloads (Pure)

Abstract

In this paper, we propose a closed loop system to improve the performance
of single-channel speech separation in a speaker independent
scenario. The system is composed of two interconnected blocks: a
separation block and a speaker identiſcation block. The improvement
is accomplished by incorporating the speaker identities found
by the speaker identiſcation block as additional information for the
separation block, which converts the speaker-independent separation
problem to a speaker-dependent one where the speaker codebooks
are known. Simulation results show that the closed loop system enhances
the quality of the separated output signals. To assess the improvements,
the results are reported in terms of PESQ for both target
and masked signals.

Originalsprog	Engelsk
Tidsskrift	I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings
Vol/bind	2010
Sider (fra-til)	4430 - 4433
ISSN	1520-6149
DOI	https://doi.org/10.1109/ICASSP.2010.5495619
Status	Udgivet - 2010
Begivenhed	2010 IEEE International Conference on Acoustics, Speech, and Signal Processing - Dallas, USA Varighed: 14 mar. 2010 → 17 mar. 2010

Konference

Konference	2010 IEEE International Conference on Acoustics, Speech, and Signal Processing
Land/Område	USA
By	Dallas
Periode	14/03/2010 → 17/03/2010

Adgang til dokumentet

10.1109/ICASSP.2010.5495619

05597373Indsendt manuskript, 830 KB

AUB Link

Søg efter materialet i Aalborg Universitetsbiblioteks søgemaskine

Citationsformater

@inproceedings{59729371a9224edc930b925e1074df9c,

title = "Joint Single-Channel Speech Separation and Speaker Identification",

abstract = "In this paper, we propose a closed loop system to improve the performanceof single-channel speech separation in a speaker independentscenario. The system is composed of two interconnected blocks: aseparation block and a speaker identiſcation block. The improvementis accomplished by incorporating the speaker identities foundby the speaker identiſcation block as additional information for theseparation block, which converts the speaker-independent separationproblem to a speaker-dependent one where the speaker codebooksare known. Simulation results show that the closed loop system enhancesthe quality of the separated output signals. To assess the improvements,the results are reported in terms of PESQ for both targetand masked signals.",

keywords = "Single-channel speech separation, speaker identification, sinusoidal mixture estimator, vector quantization, Gaussian mixture model",

author = "Pejman Mowlaee and Rahim Saeidi and Zheng-Hua Tan and Christensen, {Mads Gr{\ae}sb{\o}ll} and Pasi Fr{\"a}nti and Jensen, {S{\o}ren Holdt}",

year = "2010",

doi = "10.1109/ICASSP.2010.5495619",

language = "English",

volume = "2010",

pages = "4430 -- 4433 ",

journal = "I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings",

issn = "1520-6149",

publisher = "IEEE Signal Processing Society",

note = "2010 IEEE International Conference on Acoustics, Speech, and Signal Processing ; Conference date: 14-03-2010 Through 17-03-2010",

}

TY - GEN

T1 - Joint Single-Channel Speech Separation and Speaker Identification

AU - Mowlaee, Pejman

AU - Saeidi, Rahim

AU - Tan, Zheng-Hua

AU - Christensen, Mads Græsbøll

AU - Fränti, Pasi

AU - Jensen, Søren Holdt

PY - 2010

Y1 - 2010

N2 - In this paper, we propose a closed loop system to improve the performanceof single-channel speech separation in a speaker independentscenario. The system is composed of two interconnected blocks: aseparation block and a speaker identiſcation block. The improvementis accomplished by incorporating the speaker identities foundby the speaker identiſcation block as additional information for theseparation block, which converts the speaker-independent separationproblem to a speaker-dependent one where the speaker codebooksare known. Simulation results show that the closed loop system enhancesthe quality of the separated output signals. To assess the improvements,the results are reported in terms of PESQ for both targetand masked signals.

AB - In this paper, we propose a closed loop system to improve the performanceof single-channel speech separation in a speaker independentscenario. The system is composed of two interconnected blocks: aseparation block and a speaker identiſcation block. The improvementis accomplished by incorporating the speaker identities foundby the speaker identiſcation block as additional information for theseparation block, which converts the speaker-independent separationproblem to a speaker-dependent one where the speaker codebooksare known. Simulation results show that the closed loop system enhancesthe quality of the separated output signals. To assess the improvements,the results are reported in terms of PESQ for both targetand masked signals.

KW - Single-channel speech separation

KW - speaker identification

KW - sinusoidal mixture estimator

KW - vector quantization

KW - Gaussian mixture model

U2 - 10.1109/ICASSP.2010.5495619

DO - 10.1109/ICASSP.2010.5495619

M3 - Conference article in Journal

SN - 1520-6149

VL - 2010

SP - 4430

EP - 4433

JO - I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings

JF - I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings

T2 - 2010 IEEE International Conference on Acoustics, Speech, and Signal Processing

Y2 - 14 March 2010 through 17 March 2010

ER -

Joint Single-Channel Speech Separation and Speaker Identification

Abstract

Konference

Adgang til dokumentet

AUB Link

Fingeraftryk

Citationsformater