Analyzing concept drift: A case study in the financial sector

Andrés Masegosa, Ana Martinez, Dario Ramos-Lopez, Helge Langseth, Thomas Dyhre Nielsen, Antonio Samerón

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningpeer review

Abstrakt

In this paper, we present a method for exploratory data analysis of streaming data based on probabilistic graphical models (latent variable models). This method is illustrated by concept drift tracking, using financial client data from a European regional bank. For this particular setting, the analyzed data spans the period from April 2007 to March 2014 and therefore starts before the beginning of the financial crisis of 2008. The implied changes in the economic climate during this period manifests itself as concept drift in the underlying data generating distribution. We explore and analyze this financial client data using a probabilistic graphical modeling framework that provides an explicit representation of concept drift as an integral part of the model. We show how learning these types of models from data provides additional insight into the hidden mechanisms governing the drift in the domain. We present an iterative approach for identifying disparate factors that jointly account for the drift in the domain. This includes a semantic characterization of one of the main influencing drift factors. Based on the experiences and results obtained from analyzing the financial data, we discuss the applicability of the framework within a more general context.
OriginalsprogEngelsk
TidsskriftIntelligent Data Analysis
Vol/bind24
Udgave nummer3
Sider (fra-til)665-688
Antal sider24
ISSN1088-467X
DOI
StatusUdgivet - 2020

Fingeraftryk Dyk ned i forskningsemnerne om 'Analyzing concept drift: A case study in the financial sector'. Sammen danner de et unikt fingeraftryk.

Citationsformater