Analysis of massive data streams using R and AMIDST

Anders Læsø Madsen, Antonio Salmerón

Research output: Contribution to conference without publisher/journalPosterResearch

Abstract

Today, omnipresent sensors are continuously providing streaming data on the environments in which they operate. Sources of streaming data with even a modest updating frequency can produce extremely large volumes of data, thereby making efficient and accurate data analysis and prediction difficult.
Probabilistic graphical models (PGMs) provide a well-founded and principled approach for performing inference and belief updating in complex domains endowed with uncertainty. The on-going EU-FP7 research project AMIDST (Analysis of MassIve Data STreams, http://www.amidst.eu) is aimed at producing scalable methods able to handle massive data streams based on Bayesian networks technology. All of the developed methods are available through the AMIDST toolbox, implemented in Java 8. We show how the functionality of the AMIDST toolbox can be accessed from R. Available AMIDST objects include variables, distributions and Bayesian networks, as well as those devoted to inference and learning. The interaction between both platforms relies on the rJava package.
Original languageEnglish
Publication date2015
Number of pages1
Publication statusPublished - 2015
EventuseR! 2015 - Aalborg, Denmark
Duration: 27 Jun 201530 Jun 2015

Conference

ConferenceuseR! 2015
CountryDenmark
CityAalborg
Period27/06/201530/06/2015

Fingerprint

Bayesian networks
Electric power distribution
Sensors
Uncertainty

Cite this

Madsen, A. L., & Salmerón, A. (2015). Analysis of massive data streams using R and AMIDST. Poster presented at useR! 2015, Aalborg, Denmark.
Madsen, Anders Læsø ; Salmerón, Antonio. / Analysis of massive data streams using R and AMIDST. Poster presented at useR! 2015, Aalborg, Denmark.1 p.
@conference{e87f31bc75da435bb24ac8bfb134d885,
title = "Analysis of massive data streams using R and AMIDST",
abstract = "Today, omnipresent sensors are continuously providing streaming data on the environments in which they operate. Sources of streaming data with even a modest updating frequency can produce extremely large volumes of data, thereby making efficient and accurate data analysis and prediction difficult.Probabilistic graphical models (PGMs) provide a well-founded and principled approach for performing inference and belief updating in complex domains endowed with uncertainty. The on-going EU-FP7 research project AMIDST (Analysis of MassIve Data STreams, http://www.amidst.eu) is aimed at producing scalable methods able to handle massive data streams based on Bayesian networks technology. All of the developed methods are available through the AMIDST toolbox, implemented in Java 8. We show how the functionality of the AMIDST toolbox can be accessed from R. Available AMIDST objects include variables, distributions and Bayesian networks, as well as those devoted to inference and learning. The interaction between both platforms relies on the rJava package.",
author = "Madsen, {Anders L{\ae}s{\o}} and Antonio Salmer{\'o}n",
year = "2015",
language = "English",
note = "null ; Conference date: 27-06-2015 Through 30-06-2015",

}

Madsen, AL & Salmerón, A 2015, 'Analysis of massive data streams using R and AMIDST', useR! 2015, Aalborg, Denmark, 27/06/2015 - 30/06/2015.

Analysis of massive data streams using R and AMIDST. / Madsen, Anders Læsø; Salmerón, Antonio.

2015. Poster presented at useR! 2015, Aalborg, Denmark.

Research output: Contribution to conference without publisher/journalPosterResearch

TY - CONF

T1 - Analysis of massive data streams using R and AMIDST

AU - Madsen, Anders Læsø

AU - Salmerón, Antonio

PY - 2015

Y1 - 2015

N2 - Today, omnipresent sensors are continuously providing streaming data on the environments in which they operate. Sources of streaming data with even a modest updating frequency can produce extremely large volumes of data, thereby making efficient and accurate data analysis and prediction difficult.Probabilistic graphical models (PGMs) provide a well-founded and principled approach for performing inference and belief updating in complex domains endowed with uncertainty. The on-going EU-FP7 research project AMIDST (Analysis of MassIve Data STreams, http://www.amidst.eu) is aimed at producing scalable methods able to handle massive data streams based on Bayesian networks technology. All of the developed methods are available through the AMIDST toolbox, implemented in Java 8. We show how the functionality of the AMIDST toolbox can be accessed from R. Available AMIDST objects include variables, distributions and Bayesian networks, as well as those devoted to inference and learning. The interaction between both platforms relies on the rJava package.

AB - Today, omnipresent sensors are continuously providing streaming data on the environments in which they operate. Sources of streaming data with even a modest updating frequency can produce extremely large volumes of data, thereby making efficient and accurate data analysis and prediction difficult.Probabilistic graphical models (PGMs) provide a well-founded and principled approach for performing inference and belief updating in complex domains endowed with uncertainty. The on-going EU-FP7 research project AMIDST (Analysis of MassIve Data STreams, http://www.amidst.eu) is aimed at producing scalable methods able to handle massive data streams based on Bayesian networks technology. All of the developed methods are available through the AMIDST toolbox, implemented in Java 8. We show how the functionality of the AMIDST toolbox can be accessed from R. Available AMIDST objects include variables, distributions and Bayesian networks, as well as those devoted to inference and learning. The interaction between both platforms relies on the rJava package.

M3 - Poster

ER -

Madsen AL, Salmerón A. Analysis of massive data streams using R and AMIDST. 2015. Poster presented at useR! 2015, Aalborg, Denmark.