1 flexible data cube for range-sum queries in dynamic olap data cubes authors: c.-i lee and y.-c. li...
DESCRIPTION
3 Introduction Data cubes are frequently adopted to implement OLAP and provides aggregate information Data cube: also known as Multi-dimensional Database(MDDB) Measure attributes: be chosen as metrics of interest Functional attributes(dimensions): other attributes of records. Cells: store measure attribute values Range-Sum Query: add all cells in query regionTRANSCRIPT
![Page 1: 1 Flexible Data Cube for Range-Sum Queries in Dynamic OLAP Data Cubes Authors: C.-I Lee and Y.-C. Li Speaker: Y.-C. Li Date :Dec. 19, 2002](https://reader034.vdocuments.us/reader034/viewer/2022052606/5a4d1aed7f8b9ab05997c2bd/html5/thumbnails/1.jpg)
1
Flexible Data Cube for Range-Sum Queries in
Dynamic OLAP Data Cubes
Authors: C.-I Lee and Y.-C. LiSpeaker: Y.-C. LiDate :Dec. 19, 2002
![Page 2: 1 Flexible Data Cube for Range-Sum Queries in Dynamic OLAP Data Cubes Authors: C.-I Lee and Y.-C. Li Speaker: Y.-C. Li Date :Dec. 19, 2002](https://reader034.vdocuments.us/reader034/viewer/2022052606/5a4d1aed7f8b9ab05997c2bd/html5/thumbnails/2.jpg)
2
Outline Introduction Related works Analysis of the average query and
update costs Flexible data cube Performance analysis Conclusions
![Page 3: 1 Flexible Data Cube for Range-Sum Queries in Dynamic OLAP Data Cubes Authors: C.-I Lee and Y.-C. Li Speaker: Y.-C. Li Date :Dec. 19, 2002](https://reader034.vdocuments.us/reader034/viewer/2022052606/5a4d1aed7f8b9ab05997c2bd/html5/thumbnails/3.jpg)
3
Introduction
Data cubes are frequently adopted to implement OLAP and provides aggregate information
Data cube: also known as Multi-dimensional Database(MDDB)
Measure attributes: be chosen as metrics of interest Functional attributes(dimensions):
other attributes of records. Cells: store measure attribute values Range-Sum Query:
add all cells in query region
![Page 4: 1 Flexible Data Cube for Range-Sum Queries in Dynamic OLAP Data Cubes Authors: C.-I Lee and Y.-C. Li Speaker: Y.-C. Li Date :Dec. 19, 2002](https://reader034.vdocuments.us/reader034/viewer/2022052606/5a4d1aed7f8b9ab05997c2bd/html5/thumbnails/4.jpg)
4
Measure attribute → Sale_Volume Dimensions → Year and Age of customers
Car-sales example
![Page 5: 1 Flexible Data Cube for Range-Sum Queries in Dynamic OLAP Data Cubes Authors: C.-I Lee and Y.-C. Li Speaker: Y.-C. Li Date :Dec. 19, 2002](https://reader034.vdocuments.us/reader034/viewer/2022052606/5a4d1aed7f8b9ab05997c2bd/html5/thumbnails/5.jpg)
5
+
+
4
20
255
1430
![Page 6: 1 Flexible Data Cube for Range-Sum Queries in Dynamic OLAP Data Cubes Authors: C.-I Lee and Y.-C. Li Speaker: Y.-C. Li Date :Dec. 19, 2002](https://reader034.vdocuments.us/reader034/viewer/2022052606/5a4d1aed7f8b9ab05997c2bd/html5/thumbnails/6.jpg)
6
Several previous approaches are used to accelerate the response time
But they slow down the update speed and require further space overhead
This study considers both query and update costs to construct data cubes
No extra space overhead Choice the best cube in any query or update ratio
We also present a FDC method No extra space overhead (for dense data cube) Select or integrate some pre-aggregation
techniques for each dimension
![Page 7: 1 Flexible Data Cube for Range-Sum Queries in Dynamic OLAP Data Cubes Authors: C.-I Lee and Y.-C. Li Speaker: Y.-C. Li Date :Dec. 19, 2002](https://reader034.vdocuments.us/reader034/viewer/2022052606/5a4d1aed7f8b9ab05997c2bd/html5/thumbnails/7.jpg)
7
Related works
The history of pre-aggregate range-sum queries
Prefix Sum(PS)[Ho et al., 1997]
Dynamic Data Cube(DDC)[Geffer et al., 1999b]
Relative Prefix Sum(RPS) [Geffer et al., 1999a]
Hierarchical Cube (HC)[Chan & Ioannidis, 1999]
Double RPS[Liang et al., 2000]
Space-Efficient Data Cube(SEDC)[Riedewal et al., 2000]
Iterative Data Cube(IDC)[Riedewal et al., 2001]
1997 1998 1999 2000 2001
![Page 8: 1 Flexible Data Cube for Range-Sum Queries in Dynamic OLAP Data Cubes Authors: C.-I Lee and Y.-C. Li Speaker: Y.-C. Li Date :Dec. 19, 2002](https://reader034.vdocuments.us/reader034/viewer/2022052606/5a4d1aed7f8b9ab05997c2bd/html5/thumbnails/8.jpg)
8
Prefix Sum(PS) ( Ho et al., 1997 )
3+5+1+2+7+3+2+6+2+4+2+3=40 A: 2+3+3+3+1+5+3+5+1+3+3+4=36 P: 103-50-35+18=36
![Page 9: 1 Flexible Data Cube for Range-Sum Queries in Dynamic OLAP Data Cubes Authors: C.-I Lee and Y.-C. Li Speaker: Y.-C. Li Date :Dec. 19, 2002](https://reader034.vdocuments.us/reader034/viewer/2022052606/5a4d1aed7f8b9ab05997c2bd/html5/thumbnails/9.jpg)
9
Prefix Sum(PS)
![Page 10: 1 Flexible Data Cube for Range-Sum Queries in Dynamic OLAP Data Cubes Authors: C.-I Lee and Y.-C. Li Speaker: Y.-C. Li Date :Dec. 19, 2002](https://reader034.vdocuments.us/reader034/viewer/2022052606/5a4d1aed7f8b9ab05997c2bd/html5/thumbnails/10.jpg)
10
Other methods RPS ( Geffer et al., 1999a)
Two levels(Local PS and overlay boxes) but extra space overhead HC ( Chan & Ioannidis, 1999 )
Hierarchical method DDC ( Geffer et al., 1999b )
Hierarchical method but need extra space overhead SEDC ( Riedewald et al., 2000 )
No exrtra space overhead of RPS and DDC (SRPS and SDDC) Double RPS ( Liang et al., 2000 )
Three levels but need extra space overhead IDC ( Riedewald et al., 2001 )
No extra space overhead (different method in different dimension)
![Page 11: 1 Flexible Data Cube for Range-Sum Queries in Dynamic OLAP Data Cubes Authors: C.-I Lee and Y.-C. Li Speaker: Y.-C. Li Date :Dec. 19, 2002](https://reader034.vdocuments.us/reader034/viewer/2022052606/5a4d1aed7f8b9ab05997c2bd/html5/thumbnails/11.jpg)
11
Our work focuses mainly on methods that do not require any extra space overhead for dense data cubes.
![Page 12: 1 Flexible Data Cube for Range-Sum Queries in Dynamic OLAP Data Cubes Authors: C.-I Lee and Y.-C. Li Speaker: Y.-C. Li Date :Dec. 19, 2002](https://reader034.vdocuments.us/reader034/viewer/2022052606/5a4d1aed7f8b9ab05997c2bd/html5/thumbnails/12.jpg)
12
Analysis of the average query and update costs Assume query ratio + update ratio
=100% Average query cost:
Average update cost: Cu(n) / n
![Page 13: 1 Flexible Data Cube for Range-Sum Queries in Dynamic OLAP Data Cubes Authors: C.-I Lee and Y.-C. Li Speaker: Y.-C. Li Date :Dec. 19, 2002](https://reader034.vdocuments.us/reader034/viewer/2022052606/5a4d1aed7f8b9ab05997c2bd/html5/thumbnails/13.jpg)
13
![Page 14: 1 Flexible Data Cube for Range-Sum Queries in Dynamic OLAP Data Cubes Authors: C.-I Lee and Y.-C. Li Speaker: Y.-C. Li Date :Dec. 19, 2002](https://reader034.vdocuments.us/reader034/viewer/2022052606/5a4d1aed7f8b9ab05997c2bd/html5/thumbnails/14.jpg)
14
Flexible Data Cube(FDC) Exponential time is required to find the optimal
pre-aggregated data cube Proposed the FDC method that is a heuristic
method to select or integrate any two pre-aggregation techniques for each dimension.
![Page 15: 1 Flexible Data Cube for Range-Sum Queries in Dynamic OLAP Data Cubes Authors: C.-I Lee and Y.-C. Li Speaker: Y.-C. Li Date :Dec. 19, 2002](https://reader034.vdocuments.us/reader034/viewer/2022052606/5a4d1aed7f8b9ab05997c2bd/html5/thumbnails/15.jpg)
15
In certain situation Size Query ratio
FDCopt = min average cost{FDC candidates} FDCopt = min{q×CaqFDC + u×CauFDC} Time complexity O(9n)=O(n)
The FDC Method
k’=0 A, LPS or PSk’=1 A, LPS or PSA, LPS or PSk’=2 A, LPS or PSA, LPS
or PSk’=3 A, LPS or PSA, LPS or PSk’=4 A, LPS or PS
A, LPS or PSk’=5
A, LPS or PS
A, LPS or PSk’=7
A, LPS or PS
A, LPS or PSk’=6 A, LPS
or PS
A, LPS or PS
k’=4 A PS
![Page 16: 1 Flexible Data Cube for Range-Sum Queries in Dynamic OLAP Data Cubes Authors: C.-I Lee and Y.-C. Li Speaker: Y.-C. Li Date :Dec. 19, 2002](https://reader034.vdocuments.us/reader034/viewer/2022052606/5a4d1aed7f8b9ab05997c2bd/html5/thumbnails/16.jpg)
16
Performance analysis Average cost at different query ratios d =
2, n = 16, 64
010203040506070
1 0.8 0.6 0.4 0.2 0query ratio (q)
Aver
age
cost
(acc
ess
cells
)
ALPSPSFDC
1
10
100
1000
10000
1 0.8 0.6 0.4 0.2 0query ratio (q)
Aver
age
cost
(acc
ess
cells
)
ALPSPSFDC
![Page 17: 1 Flexible Data Cube for Range-Sum Queries in Dynamic OLAP Data Cubes Authors: C.-I Lee and Y.-C. Li Speaker: Y.-C. Li Date :Dec. 19, 2002](https://reader034.vdocuments.us/reader034/viewer/2022052606/5a4d1aed7f8b9ab05997c2bd/html5/thumbnails/17.jpg)
17
Average cost for different dimension sizes: d = 4, q = 1, 0.9
1.E+001.E+011.E+021.E+031.E+041.E+051.E+061.E+071.E+08
2 4 8 16 32 64 128 256size (n)
Aver
age
cost
(acc
ess c
ells
)
ALPSPSFDC
1.E+001.E+011.E+021.E+031.E+041.E+051.E+061.E+071.E+08
2 4 8 16 32 64 128 256size (n)
Aver
age
cost
(acc
ess c
ells)
ALPSPSFDC
![Page 18: 1 Flexible Data Cube for Range-Sum Queries in Dynamic OLAP Data Cubes Authors: C.-I Lee and Y.-C. Li Speaker: Y.-C. Li Date :Dec. 19, 2002](https://reader034.vdocuments.us/reader034/viewer/2022052606/5a4d1aed7f8b9ab05997c2bd/html5/thumbnails/18.jpg)
18
Average cost for different dimension sizes: d = 4, q = 0.1, 0
1.E+001.E+011.E+021.E+031.E+041.E+051.E+061.E+071.E+081.E+09
2 4 8 16 32 64 128 256size (n)
Aver
age
cost
(acc
ess c
ells) A
LPSPSFDC
1.E+001.E+011.E+021.E+031.E+041.E+051.E+061.E+071.E+081.E+09
2 4 8 16 32 64 128 256size (n)
Aver
age co
st (a
cces
s ce
lls) A
LPSPSFDC
![Page 19: 1 Flexible Data Cube for Range-Sum Queries in Dynamic OLAP Data Cubes Authors: C.-I Lee and Y.-C. Li Speaker: Y.-C. Li Date :Dec. 19, 2002](https://reader034.vdocuments.us/reader034/viewer/2022052606/5a4d1aed7f8b9ab05997c2bd/html5/thumbnails/19.jpg)
19
Conclusions Take both the query and update costs into
consideration to select the suitable data cube. Propose the FDC method
select or integrate pre-aggregating techniques for each dimension.
Outperform other methods for any query (or update) ratio situation
linear time: determine the best FDC structure. In the future, develop new techniques to
support sparse data sets
![Page 20: 1 Flexible Data Cube for Range-Sum Queries in Dynamic OLAP Data Cubes Authors: C.-I Lee and Y.-C. Li Speaker: Y.-C. Li Date :Dec. 19, 2002](https://reader034.vdocuments.us/reader034/viewer/2022052606/5a4d1aed7f8b9ab05997c2bd/html5/thumbnails/20.jpg)
20
Thank You