TY - JOUR
T1 - Model-Based Distributed Node Clustering and Multi-Speaker Speech Presence Probability Estimation in Wireless Acoustic Sensor Networks
AU - Zhao, Yingke
AU - Nielsen, Jesper Kjær
AU - Chen, Jingdong
AU - Christensen, Mads Græsbøll
PY - 2020
Y1 - 2020
N2 - The knowledge of speech presence probability (SPP) plays an essential role in noise estimation and speech enhancement. Single channel SPP estimation and centralized multi-channel SPP estimation have been well studied. However, how to estimate SPP in wireless acoustic sensor networks (WASNs) remains a great challenge and few efforts can be found in this topic, particularly for WASN applications with multiple speakers. Accordingly, this paper is devoted to the problem of SPP estimation in WASNs and it presents a distributed model-based SPP estimation method for multi-speaker detection, which does not need any fusion center. A distributed k-means clustering method is first used to cluster the nodes into subnetworks, which detect different speakers. For each node in the subnetwork, the speech and noise power spectral densities are estimated locally by using a model-based method, then a distributed SPP estimator is developed and applied in every subnetwork. A distributed consensus method is used to obtain the distributed clustering and the distributed SPP estimation. Simulation results show that the proposed distributed clustering method can assign nodes into subnetworks based on their noisy observations. Moreover, the proposed distributed SPP estimator achieves robust speech detection performance under different noise conditions.
AB - The knowledge of speech presence probability (SPP) plays an essential role in noise estimation and speech enhancement. Single channel SPP estimation and centralized multi-channel SPP estimation have been well studied. However, how to estimate SPP in wireless acoustic sensor networks (WASNs) remains a great challenge and few efforts can be found in this topic, particularly for WASN applications with multiple speakers. Accordingly, this paper is devoted to the problem of SPP estimation in WASNs and it presents a distributed model-based SPP estimation method for multi-speaker detection, which does not need any fusion center. A distributed k-means clustering method is first used to cluster the nodes into subnetworks, which detect different speakers. For each node in the subnetwork, the speech and noise power spectral densities are estimated locally by using a model-based method, then a distributed SPP estimator is developed and applied in every subnetwork. A distributed consensus method is used to obtain the distributed clustering and the distributed SPP estimation. Simulation results show that the proposed distributed clustering method can assign nodes into subnetworks based on their noisy observations. Moreover, the proposed distributed SPP estimator achieves robust speech detection performance under different noise conditions.
UR - http://www.scopus.com/inward/record.url?scp=85087657734&partnerID=8YFLogxK
U2 - 10.1121/10.0001449
DO - 10.1121/10.0001449
M3 - Journal article
SN - 0001-4966
VL - 147
SP - 4189
EP - 4201
JO - The Journal of the Acoustical Society of America
JF - The Journal of the Acoustical Society of America
IS - 6
ER -