Abstract
In this work, we present the system description of the UIAI entry for the short-duration speaker verification (SdSV) challenge 2020. Our focus is on Task 1 dedicated to text-dependent speaker verification. We investigate different feature extraction and modeling approaches for automatic speaker verification (ASV) and utterance verification (UV). We have also studied different fusion strategies for combining UV and ASV modules. Our primary submission to the challenge is the fusion of seven subsystems which yields a normalized minimum detection cost function (minDCF) of 0.072 and an equal error rate (EER) of 2.14% on the evaluation set. The single system consisting of a pass-phrase identification based model with phone-discriminative bottleneck features gives a normalized minDCF of 0.118 and achieves 19% relative improvement over the state-of-the-art challenge baseline.
Original language | English |
---|---|
Title of host publication | 2021 IEEE Spoken Language Technology Workshop (SLT) |
Number of pages | 7 |
Publisher | IEEE (Institute of Electrical and Electronics Engineers) |
Publication date | 25 Mar 2021 |
Pages | 323-329 |
Article number | 9383596 |
ISBN (Print) | 978-1-7281-7067-1 |
ISBN (Electronic) | 978-1-7281-7066-4 |
DOIs | |
Publication status | Published - 25 Mar 2021 |
Event | 2021 IEEE Spoken Language Technology Workshop (SLT) - Shenzhen, China Duration: 19 Jan 2021 → 22 Jan 2021 |
Conference
Conference | 2021 IEEE Spoken Language Technology Workshop (SLT) |
---|---|
Country/Territory | China |
City | Shenzhen |
Period | 19/01/2021 → 22/01/2021 |
Keywords
- Bottleneck feature
- Fusion
- SdSV challenge 2020
- Text-dependent speaker verification
- Utterance verification