Abstract
In this paper, we propose a method for finding the number of sources and their parameters from stereophonic mixtures. The method is based on clustering of narrowband interaural level and time differences for an unknown number of sources and uses an optimal segmentation on which the clustering is based. The parameter distribution, for both individual seg- ments and across segments that comprise the entire signal, is modelled as a Gaussian mixture. For each segment parame- ters are estimated using a minimum description length algo- rithm for mixtures based on the expectation-maximization al- gorithm. The generalized variance and degree of membership of the Gaussian components across segments is used as a ba- sis for the proposed selection of clusters amongst candidates. Simulations on synthetic and real audio shows promising re- sults for source parameter estimation and number of sources estimated across segments. The optimal segmentation shows an improvement for parameter estimation success rate, com- pared to the uniform segmentation.
Original language | English |
---|---|
Title of host publication | IEEE International Conference on Acoustics, Speech and Signal Processing |
Number of pages | 5 |
Publisher | IEEE |
Publication date | 10 Sept 2018 |
Pages | 426-430 |
Article number | 8462522 |
ISBN (Print) | 978-1-5386-4659-5 |
ISBN (Electronic) | 978-1-5386-4658-8 |
DOIs | |
Publication status | Published - 10 Sept 2018 |
Event | 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) - Calgary, Canada Duration: 15 Apr 2018 → 20 Apr 2018 https://2018.ieeeicassp.org/ |
Conference
Conference | 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) |
---|---|
Country/Territory | Canada |
City | Calgary |
Period | 15/04/2018 → 20/04/2018 |
Internet address |
Series | I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings |
---|---|
ISSN | 1520-6149 |
Keywords
- Audio analysis
- Audio clustering
- Multi-channel processing
- Signal segmentation
- Source localisation