Variational Inference over Nonstationary Data Streams for Exponential Family Models

Andres R. Masegosa, Darío Ramos-López, Antonio Salmerón Cerdán, Helge Langseth, Thomas Dyhre Nielsen

Research output: Contribution to journalJournal articleResearchpeer-review

9 Citations (Scopus)
21 Downloads (Pure)

Abstract

In many modern data analysis problems, the available data is not static but, instead, comes in a streaming fashion. Performing Bayesian inference on a data stream is challenging for several reasons. First, it requires continuous model updating and the ability to handle a posterior distribution conditioned on an unbounded data set. Secondly, the underlying data distribution may drift from one time step to another, and the classic i.i.d. (independent and identically distributed), or data exchangeability assumption does not hold anymore. In this paper, we present an approximate Bayesian inference approach using variational methods that addresses these issues for conjugate exponential family models with latent variables. Our proposal makes use of a novel scheme based on hierarchical priors to explicitly model temporal changes of the model parameters. We show how this approach induces an exponential forgetting mechanism with adaptive forgetting rates. The method is able to capture the smoothness of the concept drift, ranging from no drift to abrupt drift. The proposed variational inference scheme maintains the computational efficiency of variational methods over conjugate models, which is critical in streaming settings. The approach is validated on four different domains (energy, finance, geolocation, and text) using four real-world data sets.
Original languageEnglish
Article number1942
JournalMathematics
Volume8
Issue number11
Pages (from-to)1-27
Number of pages27
DOIs
Publication statusPublished - Nov 2020

Keywords

  • Concept drift
  • Exponential forgetting
  • Latent variable models
  • Nonstationary data streams
  • Power priors
  • Variational inference

Fingerprint

Dive into the research topics of 'Variational Inference over Nonstationary Data Streams for Exponential Family Models'. Together they form a unique fingerprint.

Cite this