Estimation of Source Panning Parameters and Segmentation of Stereophonic Mixtures

Jacob Møller Hjerrild, Mads Græsbøll Christensen

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

2 Citations (Scopus)
363 Downloads (Pure)

Abstract

In this paper, we propose a method for finding the number of sources and their parameters from stereophonic mixtures. The method is based on clustering of narrowband interaural level and time differences for an unknown number of sources and uses an optimal segmentation on which the clustering is based. The parameter distribution, for both individual seg- ments and across segments that comprise the entire signal, is modelled as a Gaussian mixture. For each segment parame- ters are estimated using a minimum description length algo- rithm for mixtures based on the expectation-maximization al- gorithm. The generalized variance and degree of membership of the Gaussian components across segments is used as a ba- sis for the proposed selection of clusters amongst candidates. Simulations on synthetic and real audio shows promising re- sults for source parameter estimation and number of sources estimated across segments. The optimal segmentation shows an improvement for parameter estimation success rate, com- pared to the uniform segmentation.
Original languageEnglish
Title of host publicationIEEE International Conference on Acoustics, Speech and Signal Processing
Number of pages5
PublisherIEEE
Publication date10 Sept 2018
Pages426-430
Article number8462522
ISBN (Print)978-1-5386-4659-5
ISBN (Electronic)978-1-5386-4658-8
DOIs
Publication statusPublished - 10 Sept 2018
Event2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) - Calgary, Canada
Duration: 15 Apr 201820 Apr 2018
https://2018.ieeeicassp.org/

Conference

Conference2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Country/TerritoryCanada
CityCalgary
Period15/04/201820/04/2018
Internet address
SeriesI E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings
ISSN1520-6149

Keywords

  • Audio analysis
  • Audio clustering
  • Multi-channel processing
  • Signal segmentation
  • Source localisation

Fingerprint

Dive into the research topics of 'Estimation of Source Panning Parameters and Segmentation of Stereophonic Mixtures'. Together they form a unique fingerprint.

Cite this