Broadband DOA Estimation using Learning-based Optimal Statistics Estimates

Qinzheng Zhang, Haiyan Wang, Jesper Rindom Jensen, Yingying Zhu, Shuai Tao, Mads Græsbøll Christensen

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

Abstract

For accurate Direction of Arrival (DOA) estimation in challenging high reverberation and low signal-to-noise ratio (SNR) scenarios, various deep learning (DL) techniques have been developed to incorporate and enhance existing algorithms. However, addressing the challenges of enhancing network manipulability, constructing efficient learning features, and integrating algorithms in a rational manner remains a set of significant hurdles. In this paper, we use DL to obtain the Speech Presence Probability (SPP) to construct the optimal statistics estimates, which are then combined with traditional algorithms to achieve accurate DOA estimation. Specifically, we explore the application of the a posteriori SPP in DOA estimation, design a reverberation separation model for practical scenarios, and derive and validate a new computational equation for SPP under this model. Besides, we propose a frequency bin-wise network structure to improve network fitting efficiency and construct input features accordingly. Moreover, by adopting a combined structure, we avoid full-angle network feature training and instead train on partial angles under deliberate subset classification. We then evaluate the DOA estimation performance for the entire direction range with fine resolution using this approach. Simulation results demonstrate that the proposed method requires smaller data sets compared to end-to-end deep learning algorithms. Furthermore, the results validate that the proposed method outperforms both DL-based end-to-end approaches and traditional full-band approaches in terms of accuracy and error rate across various reverberation and signal-to-noise ratio conditions.
OriginalsprogEngelsk
Artikelnummer10776021
TidsskriftIEEE Sensors Journal
ISSN2379-9153
DOI
StatusE-pub ahead of print - 3 dec. 2024

Emneord

  • Accuracy
  • Covariance matrices
  • Deep learning
  • Direction-of-arrival estimation
  • Estimation
  • Feature extraction
  • Frequency estimation
  • Reverberation
  • Sensors
  • Training

Fingeraftryk

Dyk ned i forskningsemnerne om 'Broadband DOA Estimation using Learning-based Optimal Statistics Estimates'. Sammen danner de et unikt fingeraftryk.

Citationsformater