Model-Based Distributed Node Clustering and Multi-Speaker Speech Presence Probability Estimation in Wireless Acoustic Sensor Networks

Yingke Zhao, Jesper Kjær Nielsen, Jingdong Chen, Mads Græsbøll Christensen

Research output: Contribution to journalJournal articleResearchpeer-review

13 Citations (Scopus)
122 Downloads (Pure)

Abstract

The knowledge of speech presence probability (SPP) plays an essential role in noise estimation and speech enhancement. Single channel SPP estimation and centralized multi-channel SPP estimation have been well studied. However, how to estimate SPP in wireless acoustic sensor networks (WASNs) remains a great challenge and few efforts can be found in this topic, particularly for WASN applications with multiple speakers. Accordingly, this paper is devoted to the problem of SPP estimation in WASNs and it presents a distributed model-based SPP estimation method for multi-speaker detection, which does not need any fusion center. A distributed k-means clustering method is first used to cluster the nodes into subnetworks, which detect different speakers. For each node in the subnetwork, the speech and noise power spectral densities are estimated locally by using a model-based method, then a distributed SPP estimator is developed and applied in every subnetwork. A distributed consensus method is used to obtain the distributed clustering and the distributed SPP estimation. Simulation results show that the proposed distributed clustering method can assign nodes into subnetworks based on their noisy observations. Moreover, the proposed distributed SPP estimator achieves robust speech detection performance under different noise conditions.

Original languageEnglish
JournalThe Journal of the Acoustical Society of America
Volume147
Issue number6
Pages (from-to)4189-4201
Number of pages13
ISSN0001-4966
DOIs
Publication statusPublished - 2020

Fingerprint

Dive into the research topics of 'Model-Based Distributed Node Clustering and Multi-Speaker Speech Presence Probability Estimation in Wireless Acoustic Sensor Networks'. Together they form a unique fingerprint.

Cite this