High-Order Sparse Linear Predictors for Audio Processing

Daniele Giacobello, Toon van Waterschoot, Mads Græsbøll Christensen, Søren Holdt Jensen, Marc Moonen

Publikation: Bidrag til tidsskriftKonferenceartikel i tidsskriftForskningpeer review

7 Citationer (Scopus)
291 Downloads (Pure)

Resumé

Linear prediction has generally failed to make a breakthrough in audio processing, as it has done in speech processing. This is mostly due to its poor modeling performance, since an audio signal is usually an ensemble of different sources. Nevertheless, linear prediction comes with a whole set of interesting features that make the idea of using it in audio processing not far fetched, e.g., the strong ability of modeling the spectral peaks that play a dominant role in perception. In this paper, we provide some preliminary conjectures and experiments on the use of high-order sparse linear predictors in audio processing. These predictors, successfully implemented in modeling the short-term and long-term redundancies present in speech signals, will be used to model tonal audio signals, both monophonic and polyphonic. We will show how the sparse predictors are able to model efficiently the different components of the spectrum of an audio signal, i.e., its tonal behavior and the spectral
envelope characteristic.
OriginalsprogEngelsk
TidsskriftProceedings of the European Signal Processing Conference
Vol/bind2010
Sider (fra-til)234-238
ISSN2076-1465
StatusUdgivet - 2010
BegivenhedEuropean Signal Processing Conference 2010 - Aalborg, Danmark
Varighed: 23 aug. 201027 aug. 2010

Konference

KonferenceEuropean Signal Processing Conference 2010
LandDanmark
ByAalborg
Periode23/08/201027/08/2010

Fingerprint

Processing
Speech processing
Redundancy
Experiments

Citer dette

@inproceedings{2fc126a1f51a4e88ae3bf093722cd41d,
title = "High-Order Sparse Linear Predictors for Audio Processing",
abstract = "Linear prediction has generally failed to make a breakthrough in audio processing, as it has done in speech processing. This is mostly due to its poor modeling performance, since an audio signal is usually an ensemble of different sources. Nevertheless, linear prediction comes with a whole set of interesting features that make the idea of using it in audio processing not far fetched, e.g., the strong ability of modeling the spectral peaks that play a dominant role in perception. In this paper, we provide some preliminary conjectures and experiments on the use of high-order sparse linear predictors in audio processing. These predictors, successfully implemented in modeling the short-term and long-term redundancies present in speech signals, will be used to model tonal audio signals, both monophonic and polyphonic. We will show how the sparse predictors are able to model efficiently the different components of the spectrum of an audio signal, i.e., its tonal behavior and the spectralenvelope characteristic.",
author = "Daniele Giacobello and {van Waterschoot}, Toon and Christensen, {Mads Gr{\ae}sb{\o}ll} and Jensen, {S{\o}ren Holdt} and Marc Moonen",
year = "2010",
language = "English",
volume = "2010",
pages = "234--238",
journal = "Proceedings of the European Signal Processing Conference",
issn = "2076-1465",
publisher = "European Association for Signal Processing (EURASIP)",

}

High-Order Sparse Linear Predictors for Audio Processing. / Giacobello, Daniele; van Waterschoot, Toon; Christensen, Mads Græsbøll; Jensen, Søren Holdt; Moonen, Marc.

I: Proceedings of the European Signal Processing Conference, Bind 2010, 2010, s. 234-238.

Publikation: Bidrag til tidsskriftKonferenceartikel i tidsskriftForskningpeer review

TY - GEN

T1 - High-Order Sparse Linear Predictors for Audio Processing

AU - Giacobello, Daniele

AU - van Waterschoot, Toon

AU - Christensen, Mads Græsbøll

AU - Jensen, Søren Holdt

AU - Moonen, Marc

PY - 2010

Y1 - 2010

N2 - Linear prediction has generally failed to make a breakthrough in audio processing, as it has done in speech processing. This is mostly due to its poor modeling performance, since an audio signal is usually an ensemble of different sources. Nevertheless, linear prediction comes with a whole set of interesting features that make the idea of using it in audio processing not far fetched, e.g., the strong ability of modeling the spectral peaks that play a dominant role in perception. In this paper, we provide some preliminary conjectures and experiments on the use of high-order sparse linear predictors in audio processing. These predictors, successfully implemented in modeling the short-term and long-term redundancies present in speech signals, will be used to model tonal audio signals, both monophonic and polyphonic. We will show how the sparse predictors are able to model efficiently the different components of the spectrum of an audio signal, i.e., its tonal behavior and the spectralenvelope characteristic.

AB - Linear prediction has generally failed to make a breakthrough in audio processing, as it has done in speech processing. This is mostly due to its poor modeling performance, since an audio signal is usually an ensemble of different sources. Nevertheless, linear prediction comes with a whole set of interesting features that make the idea of using it in audio processing not far fetched, e.g., the strong ability of modeling the spectral peaks that play a dominant role in perception. In this paper, we provide some preliminary conjectures and experiments on the use of high-order sparse linear predictors in audio processing. These predictors, successfully implemented in modeling the short-term and long-term redundancies present in speech signals, will be used to model tonal audio signals, both monophonic and polyphonic. We will show how the sparse predictors are able to model efficiently the different components of the spectrum of an audio signal, i.e., its tonal behavior and the spectralenvelope characteristic.

M3 - Conference article in Journal

VL - 2010

SP - 234

EP - 238

JO - Proceedings of the European Signal Processing Conference

JF - Proceedings of the European Signal Processing Conference

SN - 2076-1465

ER -