Pre-aggregation for Probability Distributions

Igor Timko, Curtis E. Dyreson, Torben Bach Pedersen

Publikation: Bog/antologi/afhandling/rapportRapportForskning

Resumé

Motivated by the increasing need to analyze complex uncertain multidimensional data (e.g., in order to optimize and personalize location-based services), this paper proposes novel types of {\em probabilistic} OLAP queries that operate on aggregate values that are probability distributions and the techniques to process these queries. The paper also presents the methods for computing the probability distributions, which enables pre-aggregation, and for using the pre-aggregated distributions for further aggregation. In order to achieve good time and space efficiency, the methods perform approximate computations of aggregate values. The paper also reports on the experiments with the methods. The work is motivated with a real-world case study, based on our collaboration with a leading Danish vendor of location-based services. No previous work considers the combination of the aspects of uncertain multidimensional data analysis that is considered in this paper (i.e., approximate processing of probabilistic OLAP queries over probability distributions).
OriginalsprogEngelsk
Udgivelses stedAalborg
ForlagDepartment of Computer Science, Aalborg University
Antal sider56
StatusUdgivet - 2005
NavnDB Tech Report
Nummer14

Fingerprint

Probability distributions
Location based services
Agglomeration
Processing
Experiments

Citer dette

Timko, I., Dyreson, C. E., & Pedersen, T. B. (2005). Pre-aggregation for Probability Distributions. Aalborg: Department of Computer Science, Aalborg University. DB Tech Report, Nr. 14
Timko, Igor ; Dyreson, Curtis E. ; Pedersen, Torben Bach. / Pre-aggregation for Probability Distributions. Aalborg : Department of Computer Science, Aalborg University, 2005. 56 s. (DB Tech Report; Nr. 14).
@book{cdb38540c49011dab67b000ea68e967b,
title = "Pre-aggregation for Probability Distributions",
abstract = "Motivated by the increasing need to analyze complex uncertain multidimensional data (e.g., in order to optimize and personalize location-based services), this paper proposes novel types of {\em probabilistic} OLAP queries that operate on aggregate values that are probability distributions and the techniques to process these queries. The paper also presents the methods for computing the probability distributions, which enables pre-aggregation, and for using the pre-aggregated distributions for further aggregation. In order to achieve good time and space efficiency, the methods perform approximate computations of aggregate values. The paper also reports on the experiments with the methods. The work is motivated with a real-world case study, based on our collaboration with a leading Danish vendor of location-based services. No previous work considers the combination of the aspects of uncertain multidimensional data analysis that is considered in this paper (i.e., approximate processing of probabilistic OLAP queries over probability distributions).",
author = "Igor Timko and Dyreson, {Curtis E.} and Pedersen, {Torben Bach}",
year = "2005",
language = "English",
publisher = "Department of Computer Science, Aalborg University",

}

Timko, I, Dyreson, CE & Pedersen, TB 2005, Pre-aggregation for Probability Distributions. DB Tech Report, nr. 14, Department of Computer Science, Aalborg University, Aalborg.

Pre-aggregation for Probability Distributions. / Timko, Igor; Dyreson, Curtis E.; Pedersen, Torben Bach.

Aalborg : Department of Computer Science, Aalborg University, 2005. 56 s. (DB Tech Report; Nr. 14).

Publikation: Bog/antologi/afhandling/rapportRapportForskning

TY - RPRT

T1 - Pre-aggregation for Probability Distributions

AU - Timko, Igor

AU - Dyreson, Curtis E.

AU - Pedersen, Torben Bach

PY - 2005

Y1 - 2005

N2 - Motivated by the increasing need to analyze complex uncertain multidimensional data (e.g., in order to optimize and personalize location-based services), this paper proposes novel types of {\em probabilistic} OLAP queries that operate on aggregate values that are probability distributions and the techniques to process these queries. The paper also presents the methods for computing the probability distributions, which enables pre-aggregation, and for using the pre-aggregated distributions for further aggregation. In order to achieve good time and space efficiency, the methods perform approximate computations of aggregate values. The paper also reports on the experiments with the methods. The work is motivated with a real-world case study, based on our collaboration with a leading Danish vendor of location-based services. No previous work considers the combination of the aspects of uncertain multidimensional data analysis that is considered in this paper (i.e., approximate processing of probabilistic OLAP queries over probability distributions).

AB - Motivated by the increasing need to analyze complex uncertain multidimensional data (e.g., in order to optimize and personalize location-based services), this paper proposes novel types of {\em probabilistic} OLAP queries that operate on aggregate values that are probability distributions and the techniques to process these queries. The paper also presents the methods for computing the probability distributions, which enables pre-aggregation, and for using the pre-aggregated distributions for further aggregation. In order to achieve good time and space efficiency, the methods perform approximate computations of aggregate values. The paper also reports on the experiments with the methods. The work is motivated with a real-world case study, based on our collaboration with a leading Danish vendor of location-based services. No previous work considers the combination of the aspects of uncertain multidimensional data analysis that is considered in this paper (i.e., approximate processing of probabilistic OLAP queries over probability distributions).

M3 - Report

BT - Pre-aggregation for Probability Distributions

PB - Department of Computer Science, Aalborg University

CY - Aalborg

ER -

Timko I, Dyreson CE, Pedersen TB. Pre-aggregation for Probability Distributions. Aalborg: Department of Computer Science, Aalborg University, 2005. 56 s. (DB Tech Report; Nr. 14).