Estimation of Fundamental Frequencies in Stereophonic Music Mixtures

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

Resumé

In this paper, a method for multi-pitch estimation of stereophonic mixtures of harmonic signals, e.g., instrument recordings, is presented. The proposed method is based on a signal model that includes the panning parameters of the sources in a stereophonic mixture, such as those applied artificially in a recording studio. If the sources in a mixture have different panning parameters, this diversity can be used to simplify the pitch estimation problem. The mixing parameters of the sources might be shared, resulting in a multi-pitch estimation problem, which is solved using an approach based on an expectation-maximization algorithm for Gaussian sources, where the fundamental frequencies and model orders are estimated jointly. The fundamental frequencies may be related, resulting in overlapping harmonics, complicating the estimation of the parameters. A codebook of harmonic amplitude vectors is trained on recordings of instruments playing single notes, and used when estimating the amplitudes of the mixture components. The proposed method is evaluated using stereophonic mixtures of instrument recordings and is compared to state-of-the-art transcription and multi-pitch estimation methods. Experiments show an increase in performance when knowledge about the panning parameters is taken into account. The proposed method provides a full parameterization of the components of the observed signal. Possible applications include instrument tuning, audio editing tools, modification of harmonic mixture components, and audio effects.

OriginalsprogEngelsk
Artikelnummer8510905
TidsskriftIEEE/ACM Transactions on Audio, Speech, and Language Processing
Vol/bind27
Udgave nummer2
Sider (fra-til)296-310
Antal sider15
ISSN2329-9290
DOI
StatusUdgivet - 1 feb. 2019

Fingerprint

stereophonics
music
Recording instruments
recording instruments
harmonics
recording
editing
Studios
Transcription
Parameterization
parameterization
estimating
Tuning
tuning

Citer dette

@article{721629ebc47145518becff843302f7cc,
title = "Estimation of Fundamental Frequencies in Stereophonic Music Mixtures",
abstract = "In this paper, a method for multi-pitch estimation of stereophonic mixtures of harmonic signals, e.g., instrument recordings, is presented. The proposed method is based on a signal model that includes the panning parameters of the sources in a stereophonic mixture, such as those applied artificially in a recording studio. If the sources in a mixture have different panning parameters, this diversity can be used to simplify the pitch estimation problem. The mixing parameters of the sources might be shared, resulting in a multi-pitch estimation problem, which is solved using an approach based on an expectation-maximization algorithm for Gaussian sources, where the fundamental frequencies and model orders are estimated jointly. The fundamental frequencies may be related, resulting in overlapping harmonics, complicating the estimation of the parameters. A codebook of harmonic amplitude vectors is trained on recordings of instruments playing single notes, and used when estimating the amplitudes of the mixture components. The proposed method is evaluated using stereophonic mixtures of instrument recordings and is compared to state-of-the-art transcription and multi-pitch estimation methods. Experiments show an increase in performance when knowledge about the panning parameters is taken into account. The proposed method provides a full parameterization of the components of the observed signal. Possible applications include instrument tuning, audio editing tools, modification of harmonic mixture components, and audio effects.",
keywords = "Multi-pitch estimation, model selection, multi-channel pitch estimation, music information retrieval, sterephonic signal analysis, vector quantization",
author = "Hansen, {Martin Weiss} and Jensen, {Jesper Rindom} and Christensen, {Mads Gr{\ae}sb{\o}ll}",
year = "2019",
month = "2",
day = "1",
doi = "10.1109/TASLP.2018.2878384",
language = "English",
volume = "27",
pages = "296--310",
journal = "IEEE/ACM Transactions on Audio, Speech, and Language Processing",
issn = "2329-9290",
publisher = "IEEE Signal Processing Society",
number = "2",

}

Estimation of Fundamental Frequencies in Stereophonic Music Mixtures. / Hansen, Martin Weiss; Jensen, Jesper Rindom; Christensen, Mads Græsbøll.

I: IEEE/ACM Transactions on Audio, Speech, and Language Processing, Bind 27, Nr. 2, 8510905, 01.02.2019, s. 296-310.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

TY - JOUR

T1 - Estimation of Fundamental Frequencies in Stereophonic Music Mixtures

AU - Hansen, Martin Weiss

AU - Jensen, Jesper Rindom

AU - Christensen, Mads Græsbøll

PY - 2019/2/1

Y1 - 2019/2/1

N2 - In this paper, a method for multi-pitch estimation of stereophonic mixtures of harmonic signals, e.g., instrument recordings, is presented. The proposed method is based on a signal model that includes the panning parameters of the sources in a stereophonic mixture, such as those applied artificially in a recording studio. If the sources in a mixture have different panning parameters, this diversity can be used to simplify the pitch estimation problem. The mixing parameters of the sources might be shared, resulting in a multi-pitch estimation problem, which is solved using an approach based on an expectation-maximization algorithm for Gaussian sources, where the fundamental frequencies and model orders are estimated jointly. The fundamental frequencies may be related, resulting in overlapping harmonics, complicating the estimation of the parameters. A codebook of harmonic amplitude vectors is trained on recordings of instruments playing single notes, and used when estimating the amplitudes of the mixture components. The proposed method is evaluated using stereophonic mixtures of instrument recordings and is compared to state-of-the-art transcription and multi-pitch estimation methods. Experiments show an increase in performance when knowledge about the panning parameters is taken into account. The proposed method provides a full parameterization of the components of the observed signal. Possible applications include instrument tuning, audio editing tools, modification of harmonic mixture components, and audio effects.

AB - In this paper, a method for multi-pitch estimation of stereophonic mixtures of harmonic signals, e.g., instrument recordings, is presented. The proposed method is based on a signal model that includes the panning parameters of the sources in a stereophonic mixture, such as those applied artificially in a recording studio. If the sources in a mixture have different panning parameters, this diversity can be used to simplify the pitch estimation problem. The mixing parameters of the sources might be shared, resulting in a multi-pitch estimation problem, which is solved using an approach based on an expectation-maximization algorithm for Gaussian sources, where the fundamental frequencies and model orders are estimated jointly. The fundamental frequencies may be related, resulting in overlapping harmonics, complicating the estimation of the parameters. A codebook of harmonic amplitude vectors is trained on recordings of instruments playing single notes, and used when estimating the amplitudes of the mixture components. The proposed method is evaluated using stereophonic mixtures of instrument recordings and is compared to state-of-the-art transcription and multi-pitch estimation methods. Experiments show an increase in performance when knowledge about the panning parameters is taken into account. The proposed method provides a full parameterization of the components of the observed signal. Possible applications include instrument tuning, audio editing tools, modification of harmonic mixture components, and audio effects.

KW - Multi-pitch estimation

KW - model selection

KW - multi-channel pitch estimation

KW - music information retrieval

KW - sterephonic signal analysis

KW - vector quantization

UR - http://www.scopus.com/inward/record.url?scp=85055685859&partnerID=8YFLogxK

U2 - 10.1109/TASLP.2018.2878384

DO - 10.1109/TASLP.2018.2878384

M3 - Journal article

VL - 27

SP - 296

EP - 310

JO - IEEE/ACM Transactions on Audio, Speech, and Language Processing

JF - IEEE/ACM Transactions on Audio, Speech, and Language Processing

SN - 2329-9290

IS - 2

M1 - 8510905

ER -