Signal-to-Signal Ratio Independent Speaker Identification for Co-channel Speech Signals

Rahim Saeidi, Pejman Mowlaee, Tomi Kinnunen, Zheng-Hua Tan, Mads Græsbøll Christensen, Søren Holdt Jensen, Pasi Fränti

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

19 Citations (Scopus)
562 Downloads (Pure)

Abstract

In this paper, we consider speaker identification
for the co-channel scenario in which speech mixture from
speakers is recorded by one microphone only. The goal is to
identify both of the speakers from their mixed signal. High
recognition accuracies have already been reported when an
accurately estimated signal-to-signal ratio (SSR) is available. In
this paper, we approach the problem without estimating SSR.
We show that a simple method based on fusion of adapted
Gaussian mixture models and Kullback-Leibler divergence
calculated between models, achieves an accuracy of 97% and
93% when the two target speakers enlisted as three and two
most probable speakers, respectively.
Original languageEnglish
Title of host publicationIEEE International Conference on Pattern Recognition (ICPR 2010) : Proceedings
PublisherIEEE Press
Publication date2010
Pages4565-4568
ISBN (Electronic)978-0-7695-4109-9
DOIs
Publication statusPublished - 2010
Event20 th International Conference on Pattern Recognition (ICPR) - Istanbul, Turkey
Duration: 23 Aug 201026 Aug 2010

Conference

Conference20 th International Conference on Pattern Recognition (ICPR)
Country/TerritoryTurkey
CityIstanbul
Period23/08/201026/08/2010
SeriesProceeding IEEE International Conference on Pattern Recognition (ICPR)
ISSN1051-4651

Fingerprint

Dive into the research topics of 'Signal-to-Signal Ratio Independent Speaker Identification for Co-channel Speech Signals'. Together they form a unique fingerprint.

Cite this