Multi-Task Adversarial Network Bottleneck Features for Noise-Robust Speaker Verification

Hong Yu, Tianrui Hu, Zhanyu Ma, Zheng-Hua Tan, Jun Guo

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

3 Citations (Scopus)

Abstract

Modern automatic speaker verification (ASV) systems need to be robust under various noisy conditions. Motivated by the success of generative adversarial networks (GANs), this paper proposes a multi-task adversarial network (MAN) for extracting noise-invariant bottleneck (BN) features. The MAN consists of three component networks, a feature encoding network (FEN), a speaker discriminative network (SDN) and a noise-domain adaptation network (NAN). The FEN aims to generate noise-robustness BN features, the SDN makes the features from the FEN more speaker-discriminative and the NAN guides the FEN to learn more noise-invariant feature representations. The MAN is trained using an adversarial method. When training FEN and SDN, speaker identities and the label of being clean speech are used as target labels, which can make BN features, extracted from noisy or clean speech, similar. When training NAN, on the contrary, noise types are used as training targets. We evaluate the newly proposed MAN-BN feature extraction method on a Gaussian mixture model-universal background model (GMM-UBM) based ASV system. The experimental results on the RSR2015 database show that the proposed MAN-BN feature can dramatically improve the accuracy of the ASV system under different noise-type and signal-to-noise-ratio conditions.

Original languageEnglish
Title of host publication2018 International Conference on Network Infrastructure and Digital Content (IC-NIDC)
Number of pages5
PublisherIEEE
Publication date6 Nov 2018
Pages165-169
Article number8525526
ISBN (Print)978-1-5386-6066-9
ISBN (Electronic)978-1-5386-6067-6
DOIs
Publication statusPublished - 6 Nov 2018
Event2018 International Conference on Network Infrastructure and Digital Content (IC-NIDC) - Guiyang, China
Duration: 22 Aug 201824 Aug 2018

Conference

Conference2018 International Conference on Network Infrastructure and Digital Content (IC-NIDC)
Country/TerritoryChina
CityGuiyang
Period22/08/201824/08/2018
SeriesInternational Conference on Network Infrastructure and Digital Content (IC-NIDC)
ISSN2575-4955

Keywords

  • Bottleneck Features
  • Multi-task Adversarial Training
  • Speaker Verification

Fingerprint

Dive into the research topics of 'Multi-Task Adversarial Network Bottleneck Features for Noise-Robust Speaker Verification'. Together they form a unique fingerprint.

Cite this