Model-Based Distributed Node Clustering and Multi-Speaker Speech Presence Probability Estimation in Wireless Acoustic Sensor Networks

Yingke Zhao; Jesper Kjær Nielsen; Jingdong Chen; Mads Græsbøll Christensen

doi:10.1121/10.0001449

Model-Based Distributed Node Clustering and Multi-Speaker Speech Presence Probability Estimation in Wireless Acoustic Sensor Networks

Yingke Zhao, Jesper Kjær Nielsen, Jingdong Chen, Mads Græsbøll Christensen

Research output: Contribution to journal › Journal article › Research › peer-review

13 Citations (Scopus)

122 Downloads (Pure)

Abstract

The knowledge of speech presence probability (SPP) plays an essential role in noise estimation and speech enhancement. Single channel SPP estimation and centralized multi-channel SPP estimation have been well studied. However, how to estimate SPP in wireless acoustic sensor networks (WASNs) remains a great challenge and few efforts can be found in this topic, particularly for WASN applications with multiple speakers. Accordingly, this paper is devoted to the problem of SPP estimation in WASNs and it presents a distributed model-based SPP estimation method for multi-speaker detection, which does not need any fusion center. A distributed k-means clustering method is first used to cluster the nodes into subnetworks, which detect different speakers. For each node in the subnetwork, the speech and noise power spectral densities are estimated locally by using a model-based method, then a distributed SPP estimator is developed and applied in every subnetwork. A distributed consensus method is used to obtain the distributed clustering and the distributed SPP estimation. Simulation results show that the proposed distributed clustering method can assign nodes into subnetworks based on their noisy observations. Moreover, the proposed distributed SPP estimator achieves robust speech detection performance under different noise conditions.

Original language	English
Journal	The Journal of the Acoustical Society of America
Volume	147
Issue number	6
Pages (from-to)	4189-4201
Number of pages	13
ISSN	0001-4966
DOIs	https://doi.org/10.1121/10.0001449
Publication status	Published - 2020

Access to Document

10.1121/10.0001449

ManuscriptSubmitted manuscript, 881 KB

AUB Link

Search for the material in Aalborg University Library's search engine

Cite this

@article{16825df3d94241b2945b11c0d40bdfad,

title = "Model-Based Distributed Node Clustering and Multi-Speaker Speech Presence Probability Estimation in Wireless Acoustic Sensor Networks",

abstract = "The knowledge of speech presence probability (SPP) plays an essential role in noise estimation and speech enhancement. Single channel SPP estimation and centralized multi-channel SPP estimation have been well studied. However, how to estimate SPP in wireless acoustic sensor networks (WASNs) remains a great challenge and few efforts can be found in this topic, particularly for WASN applications with multiple speakers. Accordingly, this paper is devoted to the problem of SPP estimation in WASNs and it presents a distributed model-based SPP estimation method for multi-speaker detection, which does not need any fusion center. A distributed k-means clustering method is first used to cluster the nodes into subnetworks, which detect different speakers. For each node in the subnetwork, the speech and noise power spectral densities are estimated locally by using a model-based method, then a distributed SPP estimator is developed and applied in every subnetwork. A distributed consensus method is used to obtain the distributed clustering and the distributed SPP estimation. Simulation results show that the proposed distributed clustering method can assign nodes into subnetworks based on their noisy observations. Moreover, the proposed distributed SPP estimator achieves robust speech detection performance under different noise conditions.",

author = "Yingke Zhao and Nielsen, {Jesper Kj{\ae}r} and Jingdong Chen and Christensen, {Mads Gr{\ae}sb{\o}ll}",

year = "2020",

doi = "10.1121/10.0001449",

language = "English",

volume = "147",

pages = "4189--4201",

journal = "The Journal of the Acoustical Society of America",

issn = "0001-4966",

publisher = "A I P Publishing LLC",

number = "6",

}

Model-Based Distributed Node Clustering and Multi-Speaker Speech Presence Probability Estimation in Wireless Acoustic Sensor Networks. / Zhao, Yingke; Nielsen, Jesper Kjær; Chen, Jingdong et al.
In: The Journal of the Acoustical Society of America, Vol. 147, No. 6, 2020, p. 4189-4201.

Research output: Contribution to journal › Journal article › Research › peer-review

TY - JOUR

T1 - Model-Based Distributed Node Clustering and Multi-Speaker Speech Presence Probability Estimation in Wireless Acoustic Sensor Networks

AU - Zhao, Yingke

AU - Nielsen, Jesper Kjær

AU - Chen, Jingdong

AU - Christensen, Mads Græsbøll

PY - 2020

Y1 - 2020

N2 - The knowledge of speech presence probability (SPP) plays an essential role in noise estimation and speech enhancement. Single channel SPP estimation and centralized multi-channel SPP estimation have been well studied. However, how to estimate SPP in wireless acoustic sensor networks (WASNs) remains a great challenge and few efforts can be found in this topic, particularly for WASN applications with multiple speakers. Accordingly, this paper is devoted to the problem of SPP estimation in WASNs and it presents a distributed model-based SPP estimation method for multi-speaker detection, which does not need any fusion center. A distributed k-means clustering method is first used to cluster the nodes into subnetworks, which detect different speakers. For each node in the subnetwork, the speech and noise power spectral densities are estimated locally by using a model-based method, then a distributed SPP estimator is developed and applied in every subnetwork. A distributed consensus method is used to obtain the distributed clustering and the distributed SPP estimation. Simulation results show that the proposed distributed clustering method can assign nodes into subnetworks based on their noisy observations. Moreover, the proposed distributed SPP estimator achieves robust speech detection performance under different noise conditions.

AB - The knowledge of speech presence probability (SPP) plays an essential role in noise estimation and speech enhancement. Single channel SPP estimation and centralized multi-channel SPP estimation have been well studied. However, how to estimate SPP in wireless acoustic sensor networks (WASNs) remains a great challenge and few efforts can be found in this topic, particularly for WASN applications with multiple speakers. Accordingly, this paper is devoted to the problem of SPP estimation in WASNs and it presents a distributed model-based SPP estimation method for multi-speaker detection, which does not need any fusion center. A distributed k-means clustering method is first used to cluster the nodes into subnetworks, which detect different speakers. For each node in the subnetwork, the speech and noise power spectral densities are estimated locally by using a model-based method, then a distributed SPP estimator is developed and applied in every subnetwork. A distributed consensus method is used to obtain the distributed clustering and the distributed SPP estimation. Simulation results show that the proposed distributed clustering method can assign nodes into subnetworks based on their noisy observations. Moreover, the proposed distributed SPP estimator achieves robust speech detection performance under different noise conditions.

UR - http://www.scopus.com/inward/record.url?scp=85087657734&partnerID=8YFLogxK

U2 - 10.1121/10.0001449

DO - 10.1121/10.0001449

M3 - Journal article

SN - 0001-4966

VL - 147

SP - 4189

EP - 4201

JO - The Journal of the Acoustical Society of America

JF - The Journal of the Acoustical Society of America

IS - 6

ER -

Model-Based Distributed Node Clustering and Multi-Speaker Speech Presence Probability Estimation in Wireless Acoustic Sensor Networks

Abstract

Access to Document

AUB Link

Other files and links

Fingerprint

Cite this