A Phase Vocoder Based on Nonstationary Gabor Frames

Emil Solsbæk Ottosen, Monika Dörfler

Research output: Contribution to journalJournal articleResearchpeer-review

10 Citations (Scopus)

Abstract

We propose a new algorithm for time stretching
music signals based on the theory of nonstationary Gabor
frames (NSGFs). The algorithm extends the techniques of the
classical phase vocoder (PV) by incorporating adaptive timefrequency
(TF) representations and adaptive phase locking. The
adaptive TF representations imply good time resolution for the
onsets of attack transients and good frequency resolution for
the sinusoidal components. We estimate the phase values only
at peak channels and the remaining phases are then locked to
the values of the peaks in an adaptive manner. During attack
transients we keep the stretch factor equal to one and we propose
a new strategy for determining which channels are relevant
for reinitializing the corresponding phase values. In contrast to
previously published algorithms we use a non-uniform NSGF to
obtain a low redundancy of the corresponding TF representation.
We show that with just three times as many TF coefficients
as signal samples, artifacts such as phasiness and transient
smearing can be greatly reduced compared to the classical PV.
The proposed algorithm is tested on both synthetic and real
world signals and compared with state of the art algorithms in
a reproducible manner.
Original languageEnglish
JournalI E E E Transactions on Audio, Speech and Language Processing
Volume25
Issue number11
Pages (from-to)2199-2208
Number of pages10
ISSN1558-7916
DOIs
Publication statusPublished - Sept 2017

Keywords

  • Phase vocoder
  • nonstationary Gabor frames
  • Time-frequency analysis
  • Gabor theory
  • Time stretching

Fingerprint

Dive into the research topics of 'A Phase Vocoder Based on Nonstationary Gabor Frames'. Together they form a unique fingerprint.

Cite this