Abstract
Motivated by the increasing need to analyze complex, uncertain multidimensional data this paper proposes probabilistic OLAP queries that are computed using probability distributions rather than atomic values. The paper describes how to create probability distributions from base data, and how the distributions can be subsequently used in pre-aggregation.
Since the probability distributions can become large, we show how to achieve good time and space efficiency by approximating the distributions. We present the results of
several experiments that demonstrate the effectiveness of our methods. The work is motivated with a real-world case study, based on our collaboration with a leading Danish vendor of location-based services. This paper is the first to consider the approximate processing of probabilistic OLAP queries over probability distributions.
Since the probability distributions can become large, we show how to achieve good time and space efficiency by approximating the distributions. We present the results of
several experiments that demonstrate the effectiveness of our methods. The work is motivated with a real-world case study, based on our collaboration with a leading Danish vendor of location-based services. This paper is the first to consider the approximate processing of probabilistic OLAP queries over probability distributions.
Original language | English |
---|---|
Title of host publication | Data Warehousing and OLAP : Proceedings of the 9th ACM international workshop on Data warehousing and OLAP 2006, Arlington, Virginia, USA |
Number of pages | 8 |
Publisher | Association for Computing Machinery |
Publication date | 2006 |
Pages | 35-42 |
ISBN (Print) | 1595935304 |
Publication status | Published - 2006 |
Event | ACM Ninth International Workshop on Data Warehousing and OLAP - Arlington, Va, United States Duration: 10 Nov 2006 → 10 Nov 2006 Conference number: 9 |
Conference
Conference | ACM Ninth International Workshop on Data Warehousing and OLAP |
---|---|
Number | 9 |
Country/Territory | United States |
City | Arlington, Va |
Period | 10/11/2006 → 10/11/2006 |
Keywords
- OLAP
- Pre-aggregation
- probability