Exploring super-gaussianity towards robust information-theoretical time delay estimation

Theodoros Petsatodis, Fotios Talantzis, Christos Boukis, Zheng-Hua Tan, Ramjee Prasad

Research output: Contribution to journalJournal articleResearchpeer-review

3 Citations (Scopus)

Abstract

Time delay estimation (TDE) is a fundamental component of speaker localization and tracking algorithms. Most of the existing systems are based on the generalized cross-correlation method assuming gaussianity of the source. It has been shown that the distribution of speech, captured with far-field microphones, is highly varying, depending on the noise and reverberation conditions. Thus the performance of TDE is expected to fluctuate depending on the underlying assumption for the speech distribution, being also subject to multi-path reflections and competitive background noise. This paper investigates the effect upon TDE when modeling the source signal with different speech-based distributions. An information theoretical TDE method indirectly encapsulating higher order statistics (HOS) formed the basis of this work. The underlying assumption of Gaussian distributed source has been replaced by that of generalized Gaussian distribution that allows evaluating the problem under a larger set of speech-shaped distributions, ranging from Gaussian to Laplacian and Gamma. Closed forms of the univariate and multivariate entropy expressions of the generalized Gaussian distribution are derived to evaluate the TDE. The results indicate that TDE based on the specific criterion is independent of the underlying assumption for the distribution of the source, for the same covariance matrix.
Original languageEnglish
JournalThe Journal of the Acoustical Society of America
Volume133
Issue number3
Pages (from-to)1515-1524
Number of pages10
ISSN0001-4966
DOIs
Publication statusPublished - 2013

Fingerprint

time lag
normal density functions
encapsulating
reverberation
background noise
microphones
cross correlation
far fields
statistics
entropy

Cite this

Petsatodis, Theodoros ; Talantzis, Fotios ; Boukis, Christos ; Tan, Zheng-Hua ; Prasad, Ramjee. / Exploring super-gaussianity towards robust information-theoretical time delay estimation. In: The Journal of the Acoustical Society of America. 2013 ; Vol. 133, No. 3. pp. 1515-1524.
@article{a1f57bb51a3740959b3532d194f6fc3c,
title = "Exploring super-gaussianity towards robust information-theoretical time delay estimation",
abstract = "Time delay estimation (TDE) is a fundamental component of speaker localization and tracking algorithms. Most of the existing systems are based on the generalized cross-correlation method assuming gaussianity of the source. It has been shown that the distribution of speech, captured with far-field microphones, is highly varying, depending on the noise and reverberation conditions. Thus the performance of TDE is expected to fluctuate depending on the underlying assumption for the speech distribution, being also subject to multi-path reflections and competitive background noise. This paper investigates the effect upon TDE when modeling the source signal with different speech-based distributions. An information theoretical TDE method indirectly encapsulating higher order statistics (HOS) formed the basis of this work. The underlying assumption of Gaussian distributed source has been replaced by that of generalized Gaussian distribution that allows evaluating the problem under a larger set of speech-shaped distributions, ranging from Gaussian to Laplacian and Gamma. Closed forms of the univariate and multivariate entropy expressions of the generalized Gaussian distribution are derived to evaluate the TDE. The results indicate that TDE based on the specific criterion is independent of the underlying assumption for the distribution of the source, for the same covariance matrix.",
author = "Theodoros Petsatodis and Fotios Talantzis and Christos Boukis and Zheng-Hua Tan and Ramjee Prasad",
year = "2013",
doi = "10.1121/1.4776193",
language = "English",
volume = "133",
pages = "1515--1524",
journal = "The Journal of the Acoustical Society of America",
issn = "0001-4966",
publisher = "A I P Publishing LLC",
number = "3",

}

Exploring super-gaussianity towards robust information-theoretical time delay estimation. / Petsatodis, Theodoros; Talantzis, Fotios ; Boukis, Christos ; Tan, Zheng-Hua; Prasad, Ramjee.

In: The Journal of the Acoustical Society of America, Vol. 133, No. 3, 2013, p. 1515-1524.

Research output: Contribution to journalJournal articleResearchpeer-review

TY - JOUR

T1 - Exploring super-gaussianity towards robust information-theoretical time delay estimation

AU - Petsatodis, Theodoros

AU - Talantzis, Fotios

AU - Boukis, Christos

AU - Tan, Zheng-Hua

AU - Prasad, Ramjee

PY - 2013

Y1 - 2013

N2 - Time delay estimation (TDE) is a fundamental component of speaker localization and tracking algorithms. Most of the existing systems are based on the generalized cross-correlation method assuming gaussianity of the source. It has been shown that the distribution of speech, captured with far-field microphones, is highly varying, depending on the noise and reverberation conditions. Thus the performance of TDE is expected to fluctuate depending on the underlying assumption for the speech distribution, being also subject to multi-path reflections and competitive background noise. This paper investigates the effect upon TDE when modeling the source signal with different speech-based distributions. An information theoretical TDE method indirectly encapsulating higher order statistics (HOS) formed the basis of this work. The underlying assumption of Gaussian distributed source has been replaced by that of generalized Gaussian distribution that allows evaluating the problem under a larger set of speech-shaped distributions, ranging from Gaussian to Laplacian and Gamma. Closed forms of the univariate and multivariate entropy expressions of the generalized Gaussian distribution are derived to evaluate the TDE. The results indicate that TDE based on the specific criterion is independent of the underlying assumption for the distribution of the source, for the same covariance matrix.

AB - Time delay estimation (TDE) is a fundamental component of speaker localization and tracking algorithms. Most of the existing systems are based on the generalized cross-correlation method assuming gaussianity of the source. It has been shown that the distribution of speech, captured with far-field microphones, is highly varying, depending on the noise and reverberation conditions. Thus the performance of TDE is expected to fluctuate depending on the underlying assumption for the speech distribution, being also subject to multi-path reflections and competitive background noise. This paper investigates the effect upon TDE when modeling the source signal with different speech-based distributions. An information theoretical TDE method indirectly encapsulating higher order statistics (HOS) formed the basis of this work. The underlying assumption of Gaussian distributed source has been replaced by that of generalized Gaussian distribution that allows evaluating the problem under a larger set of speech-shaped distributions, ranging from Gaussian to Laplacian and Gamma. Closed forms of the univariate and multivariate entropy expressions of the generalized Gaussian distribution are derived to evaluate the TDE. The results indicate that TDE based on the specific criterion is independent of the underlying assumption for the distribution of the source, for the same covariance matrix.

U2 - 10.1121/1.4776193

DO - 10.1121/1.4776193

M3 - Journal article

VL - 133

SP - 1515

EP - 1524

JO - The Journal of the Acoustical Society of America

JF - The Journal of the Acoustical Society of America

SN - 0001-4966

IS - 3

ER -