Parametric Multi-Channel Separation and Re-Panning of Harmonics Sources

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

97 Downloads (Pure)

Abstract

In this paper, a method for separating stereophonic mixtures into their harmonic constituents is proposed. The method is based on a harmonic signal model. An observed mixture is decomposed by first estimating the panning parameters of the sources, and then estimating the fundamental frequencies and the amplitudes of the harmonic components. The number of sources and their panning parameters are estimated using an approach based on clustering of narrowband interaural level and time differences. The panning parameter distribution is modelled as a Gaussian mixture and the generalized variance is used for selecting the number of sources. The fundamental frequencies of the sources are estimated using an iterative approach. To enforce spectral smoothness when estimat- ing the fundamental frequencies, a codebook of magnitude ampli- tudes is used to limit the amount of energy assigned to each har- monic. The source models are used to form Wiener filters which are used to reconstruct the sources. The proposed method can be used for source re-panning (demonstration given), remixing, and multi-channel upmixing, e.g. for hi-fi systems with multiple loud- speakers.
Original languageEnglish
Title of host publicationProceedings of the 21st International Conference on Digital Audio Effects (DAFx-18), Aveiro, Portugal, September 4–8, 2018
Number of pages8
PublisherUniversidade de Aveiro, Comissão Editorial
Publication date4 Sep 2018
Publication statusPublished - 4 Sep 2018
Event21st International Conference on Digital Audio Effects (Dafx 2018) - Aveiro, Portugal
Duration: 4 Sep 20188 Sep 2018

Conference

Conference21st International Conference on Digital Audio Effects (Dafx 2018)
CountryPortugal
CityAveiro
Period04/09/201808/09/2018
SeriesInternational Conference on Digital Audio Effects
ISSN2413-6700

Fingerprint

Loudspeakers
Demonstrations

Cite this

Hansen, M. W., Hjerrild, J. M., Christensen, M. G., & Kjeldskov, J. (2018). Parametric Multi-Channel Separation and Re-Panning of Harmonics Sources. In Proceedings of the 21st International Conference on Digital Audio Effects (DAFx-18), Aveiro, Portugal, September 4–8, 2018 Universidade de Aveiro, Comissão Editorial. International Conference on Digital Audio Effects
Hansen, Martin Weiss ; Hjerrild, Jacob Møller ; Christensen, Mads Græsbøll ; Kjeldskov, Jesper. / Parametric Multi-Channel Separation and Re-Panning of Harmonics Sources. Proceedings of the 21st International Conference on Digital Audio Effects (DAFx-18), Aveiro, Portugal, September 4–8, 2018. Universidade de Aveiro, Comissão Editorial, 2018. (International Conference on Digital Audio Effects).
@inproceedings{9ec4eb4bd9b14d78bc628bf5caad1c81,
title = "Parametric Multi-Channel Separation and Re-Panning of Harmonics Sources",
abstract = "In this paper, a method for separating stereophonic mixtures into their harmonic constituents is proposed. The method is based on a harmonic signal model. An observed mixture is decomposed by first estimating the panning parameters of the sources, and then estimating the fundamental frequencies and the amplitudes of the harmonic components. The number of sources and their panning parameters are estimated using an approach based on clustering of narrowband interaural level and time differences. The panning parameter distribution is modelled as a Gaussian mixture and the generalized variance is used for selecting the number of sources. The fundamental frequencies of the sources are estimated using an iterative approach. To enforce spectral smoothness when estimat- ing the fundamental frequencies, a codebook of magnitude ampli- tudes is used to limit the amount of energy assigned to each har- monic. The source models are used to form Wiener filters which are used to reconstruct the sources. The proposed method can be used for source re-panning (demonstration given), remixing, and multi-channel upmixing, e.g. for hi-fi systems with multiple loud- speakers.",
keywords = "Audio signal processing, Source Localization, Source Separation, Multi-pitch estimation",
author = "Hansen, {Martin Weiss} and Hjerrild, {Jacob M{\o}ller} and Christensen, {Mads Gr{\ae}sb{\o}ll} and Jesper Kjeldskov",
year = "2018",
month = "9",
day = "4",
language = "English",
series = "International Conference on Digital Audio Effects",
booktitle = "Proceedings of the 21st International Conference on Digital Audio Effects (DAFx-18), Aveiro, Portugal, September 4–8, 2018",
publisher = "Universidade de Aveiro, Comiss{\~a}o Editorial",

}

Hansen, MW, Hjerrild, JM, Christensen, MG & Kjeldskov, J 2018, Parametric Multi-Channel Separation and Re-Panning of Harmonics Sources. in Proceedings of the 21st International Conference on Digital Audio Effects (DAFx-18), Aveiro, Portugal, September 4–8, 2018. Universidade de Aveiro, Comissão Editorial, International Conference on Digital Audio Effects, 21st International Conference on Digital Audio Effects (Dafx 2018), Aveiro, Portugal, 04/09/2018.

Parametric Multi-Channel Separation and Re-Panning of Harmonics Sources. / Hansen, Martin Weiss; Hjerrild, Jacob Møller; Christensen, Mads Græsbøll; Kjeldskov, Jesper.

Proceedings of the 21st International Conference on Digital Audio Effects (DAFx-18), Aveiro, Portugal, September 4–8, 2018. Universidade de Aveiro, Comissão Editorial, 2018. (International Conference on Digital Audio Effects).

Research output: Contribution to book/anthology/report/conference proceedingArticle in proceedingResearchpeer-review

TY - GEN

T1 - Parametric Multi-Channel Separation and Re-Panning of Harmonics Sources

AU - Hansen, Martin Weiss

AU - Hjerrild, Jacob Møller

AU - Christensen, Mads Græsbøll

AU - Kjeldskov, Jesper

PY - 2018/9/4

Y1 - 2018/9/4

N2 - In this paper, a method for separating stereophonic mixtures into their harmonic constituents is proposed. The method is based on a harmonic signal model. An observed mixture is decomposed by first estimating the panning parameters of the sources, and then estimating the fundamental frequencies and the amplitudes of the harmonic components. The number of sources and their panning parameters are estimated using an approach based on clustering of narrowband interaural level and time differences. The panning parameter distribution is modelled as a Gaussian mixture and the generalized variance is used for selecting the number of sources. The fundamental frequencies of the sources are estimated using an iterative approach. To enforce spectral smoothness when estimat- ing the fundamental frequencies, a codebook of magnitude ampli- tudes is used to limit the amount of energy assigned to each har- monic. The source models are used to form Wiener filters which are used to reconstruct the sources. The proposed method can be used for source re-panning (demonstration given), remixing, and multi-channel upmixing, e.g. for hi-fi systems with multiple loud- speakers.

AB - In this paper, a method for separating stereophonic mixtures into their harmonic constituents is proposed. The method is based on a harmonic signal model. An observed mixture is decomposed by first estimating the panning parameters of the sources, and then estimating the fundamental frequencies and the amplitudes of the harmonic components. The number of sources and their panning parameters are estimated using an approach based on clustering of narrowband interaural level and time differences. The panning parameter distribution is modelled as a Gaussian mixture and the generalized variance is used for selecting the number of sources. The fundamental frequencies of the sources are estimated using an iterative approach. To enforce spectral smoothness when estimat- ing the fundamental frequencies, a codebook of magnitude ampli- tudes is used to limit the amount of energy assigned to each har- monic. The source models are used to form Wiener filters which are used to reconstruct the sources. The proposed method can be used for source re-panning (demonstration given), remixing, and multi-channel upmixing, e.g. for hi-fi systems with multiple loud- speakers.

KW - Audio signal processing

KW - Source Localization

KW - Source Separation

KW - Multi-pitch estimation

M3 - Article in proceeding

T3 - International Conference on Digital Audio Effects

BT - Proceedings of the 21st International Conference on Digital Audio Effects (DAFx-18), Aveiro, Portugal, September 4–8, 2018

PB - Universidade de Aveiro, Comissão Editorial

ER -

Hansen MW, Hjerrild JM, Christensen MG, Kjeldskov J. Parametric Multi-Channel Separation and Re-Panning of Harmonics Sources. In Proceedings of the 21st International Conference on Digital Audio Effects (DAFx-18), Aveiro, Portugal, September 4–8, 2018. Universidade de Aveiro, Comissão Editorial. 2018. (International Conference on Digital Audio Effects).