Filterbank Learning for Noise-Robust Small-Footprint Keyword Spotting

Ivan Lopez Espejo, Ram C. M. C. Shekar, Zheng-Hua Tan, Jesper Jensen, John Hansen

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

Abstract

In the context of keyword spotting (KWS), the replacement of handcrafted speech features by learnable features has not yielded superior KWS performance. In this study, we demonstrate that filterbank learning outperforms handcrafted speech features for KWS whenever the number of filterbank channels is severely decreased. Reducing the number of channels might yield certain KWS performance drop, but also a substantial energy consumption reduction, which is key when deploying common always-on KWS on low-resource devices. Experimental results on a noisy version of the Google Speech Commands Dataset show that filterbank learning adapts to noise characteristics to provide a higher degree of robustness to noise, especially when dropout is integrated. Thus, switching from typically used 40-channel log-Mel features to 8channel learned features leads to a relative KWS accuracy loss of only 3.5% while simultaneously achieving a 6.3× energy consumption reduction.
Original languageEnglish
Title of host publicationICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing, Proceedings
Number of pages5
PublisherIEEE
Publication dateMay 2023
Article number10095436
ISBN (Print)978-1-7281-6328-4
ISBN (Electronic)978-1-7281-6327-7
DOIs
Publication statusPublished - May 2023
Event48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023 - Rhodes Island, Greece
Duration: 4 Jun 202310 Jun 2023

Conference

Conference48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023
Country/TerritoryGreece
CityRhodes Island
Period04/06/202310/06/2023
SponsorIEEE, IEEE Signal Processing Society
SeriesInternational Conference on Acoustics Speech and Signal Processing (ICASSP)
ISSN1520-6149

Keywords

  • Keyword spotting
  • end-to-end
  • filterbank learning
  • noise robustness
  • small footprint

Fingerprint

Dive into the research topics of 'Filterbank Learning for Noise-Robust Small-Footprint Keyword Spotting'. Together they form a unique fingerprint.

Cite this