Signal-to-Signal Ratio Independent Speaker Identification for Co-channel Speech Signals

Rahim Saeidi, Pejman Mowlaee, Tomi Kinnunen, Zheng-Hua Tan, Mads Græsbøll Christensen, Søren Holdt Jensen, Pasi Fränti

Publikation: Bidrag til bog/antologi/rapport/konference proceedingKonferenceartikel i proceedingForskningpeer review

19 Citationer (Scopus)
562 Downloads (Pure)

Abstract

In this paper, we consider speaker identification
for the co-channel scenario in which speech mixture from
speakers is recorded by one microphone only. The goal is to
identify both of the speakers from their mixed signal. High
recognition accuracies have already been reported when an
accurately estimated signal-to-signal ratio (SSR) is available. In
this paper, we approach the problem without estimating SSR.
We show that a simple method based on fusion of adapted
Gaussian mixture models and Kullback-Leibler divergence
calculated between models, achieves an accuracy of 97% and
93% when the two target speakers enlisted as three and two
most probable speakers, respectively.
OriginalsprogEngelsk
TitelIEEE International Conference on Pattern Recognition (ICPR 2010) : Proceedings
ForlagIEEE Press
Publikationsdato2010
Sider4565-4568
ISBN (Elektronisk)978-0-7695-4109-9
DOI
StatusUdgivet - 2010
Begivenhed20 th International Conference on Pattern Recognition (ICPR) - Istanbul, Tyrkiet
Varighed: 23 aug. 201026 aug. 2010

Konference

Konference20 th International Conference on Pattern Recognition (ICPR)
Land/OmrådeTyrkiet
ByIstanbul
Periode23/08/201026/08/2010
NavnProceeding IEEE International Conference on Pattern Recognition (ICPR)
ISSN1051-4651

Fingeraftryk

Dyk ned i forskningsemnerne om 'Signal-to-Signal Ratio Independent Speaker Identification for Co-channel Speech Signals'. Sammen danner de et unikt fingeraftryk.

Citationsformater