Abstract
This paper describes a novel approach to improve monoaural
speaker identification where two speakers are present in a
single-microphone recording. The goal is to identify both of
the underlying speakers in the given mixture. The proposed
approach is composed of a double-talk detector (DTD) as a preprocessor
and speaker identification back-end. We demonstrate
that including the double-talk detector improves the speaker
identification accuracy. Experiments on GRID corpus show that
including the DTD improves average recognition accuracy from
96.53% to 97.43%.
speaker identification where two speakers are present in a
single-microphone recording. The goal is to identify both of
the underlying speakers in the given mixture. The proposed
approach is composed of a double-talk detector (DTD) as a preprocessor
and speaker identification back-end. We demonstrate
that including the double-talk detector improves the speaker
identification accuracy. Experiments on GRID corpus show that
including the DTD improves average recognition accuracy from
96.53% to 97.43%.
Originalsprog | Engelsk |
---|---|
Tidsskrift | Proceedings of the International Conference on Spoken Language Processing |
Sider (fra-til) | 1069-1072 |
ISSN | 1990-9772 |
Status | Udgivet - 26 sep. 2010 |
Begivenhed | Interspeech 2010 - Makuhari, Japan Varighed: 26 sep. 2010 → 30 sep. 2010 |
Konference
Konference | Interspeech 2010 |
---|---|
Land/Område | Japan |
By | Makuhari |
Periode | 26/09/2010 → 30/09/2010 |