Exploring super-gaussianity towards robust information-theoretical time delay estimation

Theodoros Petsatodis, Fotios Talantzis, Christos Boukis, Zheng-Hua Tan, Ramjee Prasad

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

3 Citationer (Scopus)

Resumé

Time delay estimation (TDE) is a fundamental component of speaker localization and tracking algorithms. Most of the existing systems are based on the generalized cross-correlation method assuming gaussianity of the source. It has been shown that the distribution of speech, captured with far-field microphones, is highly varying, depending on the noise and reverberation conditions. Thus the performance of TDE is expected to fluctuate depending on the underlying assumption for the speech distribution, being also subject to multi-path reflections and competitive background noise. This paper investigates the effect upon TDE when modeling the source signal with different speech-based distributions. An information theoretical TDE method indirectly encapsulating higher order statistics (HOS) formed the basis of this work. The underlying assumption of Gaussian distributed source has been replaced by that of generalized Gaussian distribution that allows evaluating the problem under a larger set of speech-shaped distributions, ranging from Gaussian to Laplacian and Gamma. Closed forms of the univariate and multivariate entropy expressions of the generalized Gaussian distribution are derived to evaluate the TDE. The results indicate that TDE based on the specific criterion is independent of the underlying assumption for the distribution of the source, for the same covariance matrix.
OriginalsprogEngelsk
TidsskriftThe Journal of the Acoustical Society of America
Vol/bind133
Udgave nummer3
Sider (fra-til)1515-1524
Antal sider10
ISSN0001-4966
DOI
StatusUdgivet - 2013

Fingerprint

time lag
normal density functions
encapsulating
reverberation
background noise
microphones
cross correlation
far fields
statistics
entropy

Citer dette

Petsatodis, Theodoros ; Talantzis, Fotios ; Boukis, Christos ; Tan, Zheng-Hua ; Prasad, Ramjee. / Exploring super-gaussianity towards robust information-theoretical time delay estimation. I: The Journal of the Acoustical Society of America. 2013 ; Bind 133, Nr. 3. s. 1515-1524.
@article{a1f57bb51a3740959b3532d194f6fc3c,
title = "Exploring super-gaussianity towards robust information-theoretical time delay estimation",
abstract = "Time delay estimation (TDE) is a fundamental component of speaker localization and tracking algorithms. Most of the existing systems are based on the generalized cross-correlation method assuming gaussianity of the source. It has been shown that the distribution of speech, captured with far-field microphones, is highly varying, depending on the noise and reverberation conditions. Thus the performance of TDE is expected to fluctuate depending on the underlying assumption for the speech distribution, being also subject to multi-path reflections and competitive background noise. This paper investigates the effect upon TDE when modeling the source signal with different speech-based distributions. An information theoretical TDE method indirectly encapsulating higher order statistics (HOS) formed the basis of this work. The underlying assumption of Gaussian distributed source has been replaced by that of generalized Gaussian distribution that allows evaluating the problem under a larger set of speech-shaped distributions, ranging from Gaussian to Laplacian and Gamma. Closed forms of the univariate and multivariate entropy expressions of the generalized Gaussian distribution are derived to evaluate the TDE. The results indicate that TDE based on the specific criterion is independent of the underlying assumption for the distribution of the source, for the same covariance matrix.",
author = "Theodoros Petsatodis and Fotios Talantzis and Christos Boukis and Zheng-Hua Tan and Ramjee Prasad",
year = "2013",
doi = "10.1121/1.4776193",
language = "English",
volume = "133",
pages = "1515--1524",
journal = "The Journal of the Acoustical Society of America",
issn = "0001-4966",
publisher = "A I P Publishing LLC",
number = "3",

}

Exploring super-gaussianity towards robust information-theoretical time delay estimation. / Petsatodis, Theodoros; Talantzis, Fotios ; Boukis, Christos ; Tan, Zheng-Hua; Prasad, Ramjee.

I: The Journal of the Acoustical Society of America, Bind 133, Nr. 3, 2013, s. 1515-1524.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

TY - JOUR

T1 - Exploring super-gaussianity towards robust information-theoretical time delay estimation

AU - Petsatodis, Theodoros

AU - Talantzis, Fotios

AU - Boukis, Christos

AU - Tan, Zheng-Hua

AU - Prasad, Ramjee

PY - 2013

Y1 - 2013

N2 - Time delay estimation (TDE) is a fundamental component of speaker localization and tracking algorithms. Most of the existing systems are based on the generalized cross-correlation method assuming gaussianity of the source. It has been shown that the distribution of speech, captured with far-field microphones, is highly varying, depending on the noise and reverberation conditions. Thus the performance of TDE is expected to fluctuate depending on the underlying assumption for the speech distribution, being also subject to multi-path reflections and competitive background noise. This paper investigates the effect upon TDE when modeling the source signal with different speech-based distributions. An information theoretical TDE method indirectly encapsulating higher order statistics (HOS) formed the basis of this work. The underlying assumption of Gaussian distributed source has been replaced by that of generalized Gaussian distribution that allows evaluating the problem under a larger set of speech-shaped distributions, ranging from Gaussian to Laplacian and Gamma. Closed forms of the univariate and multivariate entropy expressions of the generalized Gaussian distribution are derived to evaluate the TDE. The results indicate that TDE based on the specific criterion is independent of the underlying assumption for the distribution of the source, for the same covariance matrix.

AB - Time delay estimation (TDE) is a fundamental component of speaker localization and tracking algorithms. Most of the existing systems are based on the generalized cross-correlation method assuming gaussianity of the source. It has been shown that the distribution of speech, captured with far-field microphones, is highly varying, depending on the noise and reverberation conditions. Thus the performance of TDE is expected to fluctuate depending on the underlying assumption for the speech distribution, being also subject to multi-path reflections and competitive background noise. This paper investigates the effect upon TDE when modeling the source signal with different speech-based distributions. An information theoretical TDE method indirectly encapsulating higher order statistics (HOS) formed the basis of this work. The underlying assumption of Gaussian distributed source has been replaced by that of generalized Gaussian distribution that allows evaluating the problem under a larger set of speech-shaped distributions, ranging from Gaussian to Laplacian and Gamma. Closed forms of the univariate and multivariate entropy expressions of the generalized Gaussian distribution are derived to evaluate the TDE. The results indicate that TDE based on the specific criterion is independent of the underlying assumption for the distribution of the source, for the same covariance matrix.

U2 - 10.1121/1.4776193

DO - 10.1121/1.4776193

M3 - Journal article

VL - 133

SP - 1515

EP - 1524

JO - The Journal of the Acoustical Society of America

JF - The Journal of the Acoustical Society of America

SN - 0001-4966

IS - 3

ER -