microsoft powerpoint - dw_bl_sept2013_07(1) [compatibility mode]

Upload: okta-jilid-ii

Post on 02-Jun-2018

217 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    1/41

    Course: 0604M- Testing & ImplementationCourse: Datawarehouse

    An Overview of Testing and ImplementatiOLAP

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    2/41

    OL A P (OLAP)

    2

    OLAP O

    2D

    Datawarehouse_09/2013

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    3/41

    MD D M

    10 15

    Prod Code Time Code Store Code Sales Qty

    Store InfoNumerical MeasuresKey columns joining fact tableto dimension tables

    Product Info

    Time Info

    . . .

    Fact table formeasures

    Dimension tables

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    4/41

    OLAP & D

    Data

    Warehouse

    Time

    Product

    Category e.g Electrical Appliance

    Sub Category e.g Kitchen

    Product e.g Toaster

    Category e.g Electrical Appliance

    Sub Category e.g Kitchen

    Product e.g Toaster

    Drill downRoll up

    4Datawarehouse_09/2013

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    5/41

    OLAP &

    DataWarehouse

    Time

    Product

    Product=ToasterProduct

    PivotSlicing

    Time Region

    5Datawarehouse_09/2013

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    6/41

    A OLAP

    OLAP

    (1) OLAP (OLAP) DBM

    OLAP

    (2) M OLAP (MOLAP)

    A

    D

    (3) H OLAP (HOLAP) DBM

    MDBM

    MOLAP

    6Datawarehouse_09/2013

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    7/41

    P OLAP

    D ,

    A .

    C .

    OLAP , ,

    .B!!!

    QL .

    7Datawarehouse_09/2013

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    8/41

    OLAP: D M

    DBM

    : ,

    : ,

    P ( , DBM),

    MDDB

    P

    IBM DB2, O, IQ, B, I

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    9/41

    OLAP

    sale prodId storeId date amt

    p1 s1 1 12

    p2 s1 1 11

    p1 s3 1 50

    p2 s2 1 8

    p1 s1 2 44

    p1 s2 2 4

    Fact table view:

    81

    9Datawarehouse_09/2013

    Select sum(amt)From saleWhere date=1

    Select date, sum(amt)

    From saleGroup by date

    date amt1 81

    2 48

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    10/41

    MOLAP: D M

    M D M

    MDDB:

    F

    D DB

    P

    P, A E, G

    10Datawarehouse_09/2013

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    11/41

    MOLAP C : 2

    sale prodId storeId amt

    p1 s1 12p2 s1 11

    1 s3 50

    s1 s2 s3p1 12 50p2 11 8

    Fact table view:

    Multi-dimensional cube:

    p2 s2 8

    11Datawarehouse_09/2013

    Select prodId, storeId,sum(amt)

    From saleGroup by prodId,storeId

    prodId storeId amt

    p1 s1 12

    p1 s3 50p2 s1 11

    p2 s2 8

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    12/41

    3D C : 3

    Multi-dimensional cube:Fact table view:

    sale prodId storeId date amtp1 s1 1 12p2 s1 1 11p1 s3 1 50p2 s2 1 8

    1 s1 2 44

    day 2 s1 s2 s3p1 44 4p2 s1 s2 s3day 1

    p1 s2 2 4 p2 11 8

    12Datawarehouse_09/2013

    Select date, prodId, storeId,sum(amt)From saleGroup by date, prodId,storeId

    date prodId storeId amt

    1 p1 s1 12

    1 p1 s3 50

    1 p2 s1 11

    1 p2 s2 8

    2 p1 s1 44

    2 p1 s2 4

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    13/41

    13Datawarehouse_09/2013

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    14/41

    Data cubeData cube

    A two-dimensional,

    three-dimensional, orhigher-dimensionalob ect in which each

    dimension of the datarepresents a measureof interest

    - Grain

    - Drill-down- Slicing

    14Datawarehouse_09/2013

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    15/41

    15Datawarehouse_09/2013

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    16/41

    (

    Stores : New York

    Hats Coats Jackets Total

    Product : Hats

    Jan Feb Mar Total

    Jan 200 550 350 1100

    Feb 210 480 390 1080

    Mar 190 480 380 1050

    Total 600 1510 1120 3230

    New York 200 210 190 600Boston 20 175 125 320

    San Jose 110 210 125 445

    Total 330 595 440 1365

    Months January

    New York Boston San Jose Total

    Hats 200 20 110 330

    Coats 550 435 275 1260

    Jackets 350 220 125 695

    Total 1100 675 510 2285 16Datawarehouse_09/2013

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    17/41

    E

    ct

    Juice

    Milk

    NY

    SF

    LA

    10

    34

    Dimensions:

    Time, Product, StoreAttributes:

    Product (upc, price, )

    Store

    roll-up to brand

    roll-up to region

    Pro

    d

    Time

    M T W Th F S S

    CokeCream

    Soap

    Bread

    5632

    12

    56

    56 units of bread sold in LA on M

    Hierarchies:

    Product Brand

    DayWeek Quarter

    Store Region Countryroll-up to week

    17Datawarehouse_09/2013

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    18/41

    C A:

    day 2 s1 s2 s3p1 44 4p2 s1 s2 s3

    p1 12 50p2 11 8

    day 1

    . . .

    Example: computing sums

    s1 s2 s3

    p1 56 4 50p2 11 8

    s1 s2 s3sum 67 12 50

    sum

    p1 110

    p2 19

    129

    drill-down

    rollup

    18Datawarehouse_09/2013

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    19/41

    C O

    day 2 s1 s2 s3p1 44 4p2 s1 s2 s3

    p1 12 50p2 11 8

    day 1

    . . .

    sale(s1,*,*)

    s1 s2 s3

    p1 56 4 50p2 11 8

    s1 s2 s3sum 67 12 50

    sum

    p1 110

    p2 19

    129

    sale(*,*,*)sale(s2,p2,*)

    19Datawarehouse_09/2013

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    20/41

    s1 s2 s3 *p1 56 4 50 110p2 11 8 19* 67 12 50 129

    E C

    day 2 s1 s2 s3 *

    *

    p2* 44 4 48s1 s2 s3 *

    p1 12 50 62p2 11 8 19* 23 8 50 81

    day 1 sale(*,p2,*)

    20Datawarehouse_09/2013

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    21/41

    A H

    storeday 2 s1 s2 s3p1 44 4p2 s1 s2 s3

    p1 12 50day 1

    region A region B

    p1 56 54p2 11 8

    country

    (store s1 in Region A;stores s2, s3 in Region B)

    p

    21Datawarehouse_09/2013

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    22/41

    P MOLAP P

    .

    BF , MDD . A 200MB 5GB

    MDD 50GB .

    D .

    MDD, , DBM .

    22Datawarehouse_09/2013

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    23/41

    H OLAP (HOLAP)

    HOLAP = H OLAP:

    B

    DBM

    MDBM

    MOLAP

    23Datawarehouse_09/2013

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    24/41

    Multi-dimensionalaccess

    Multidimensional

    ClientMDBMS Server

    SQL-Read

    RDBMS Server

    D F HOLAP

    Viewer

    Relational

    Viewer

    Multi-

    dimensionaldata

    data Meta data

    Deriveddata

    SQL-ReachThrough

    SQL-Read

    24Datawarehouse_09/2013

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    25/41

    , :1) P:

    H ?

    MDD .

    2) D :

    MDD 50GB , DBM

    .

    25Datawarehouse_09/2013

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    26/41

    An experiment with Relational and theMultidimensional models on a data set

    The analysis of the authors example illustrates the following differences betweenthe best Relational alternative and the Multidimensional approach.

    relationalrelational MultiMulti--

    dimensionaldimensional

    ImprovementImprovement

    Disk space requirementDisk space requirement

    (Gigabytes)(Gigabytes)

    1717 1010 1.71.7

    * This may include the calculation of many other derived data without anyadditional I/O.

    Reference: ://../599/F2002//I2P064.

    Retrieve the corporate measuresRetrieve the corporate measuresActual Vs Budget, by month (I/Os)Actual Vs Budget, by month (I/Os)

    240240 11 240240

    Calculation of VarianceCalculation of VarianceBudget/Actual for the wholeBudget/Actual for the whole

    database (I/O time in hours)database (I/O time in hours)

    237237 2*2* 110*110*

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    27/41

    IF A.

    B. 50 GBC. 6090

    D. L

    E. D

    F.

    HEN C

    IF A. 100 GB

    " ".

    C. H D. D ,

    E. D

    HEN C .

    IF A. OLAP B. D

    C. E

    HEN C

    27Datawarehouse_09/2013

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    28/41

    E

    OLAP : (CD)

    EC

    C C C

    A

    HOLAP

    B F P

    28Datawarehouse_09/2013

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    29/41

    OLAP: OACLE 8

    OACLE ; OACLE D

    OACLE B

    A E

    MOLAP: OACLE E

    M D P P IB

    HOLAP:

    OACLE 8 OACLE E

    OACLE A M

    OACLE E C (C/ )

    29Datawarehouse_09/2013

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    30/41

    C

    OLAP: DBM > /

    MOLAP: MDD > C

    OLAP MOLAP: D

    MOLAP: (1050GB)

    OLAP:

    B

    , DBM/OLAP.

    30Datawarehouse_09/2013

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    31/41

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    32/41

    D M D

    D

    E

    32

    z Data Mining providesthe Enterprise withintelligence

    Datawarehouse_09/2013

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    33/41

    ...

    G 100,000 , ?

    ?

    I I . 2, OI?

    33

    I I 2,500 5,000, ?

    I I , ?

    ?

    Data Mining helps extract such informationDatawarehouse_09/2013

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    34/41

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    35/41

    D M

    G D M

    A B

    35

    C

    H G C

    B C

    Datawarehouse_09/2013

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    36/41

    ?

    A

    :

    (.., , , )

    36

    . G G

    Datawarehouse_09/2013

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    37/41

    D ?

    O & & .

    C OLAP . .

    , & & .

    z Function

    37

    y Missing data: Decision support requires historical data, whichop dbs do not typically maintain.

    y Data consolidation: Decision support requires consolidation(aggregation, summarization) of data from manyheterogeneous sources: op dbs, external sources.

    y Data quality: Different sources typically use inconsistent datarepresentations, codes, and formats which have to bereconciled.

    Datawarehouse_09/2013

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    38/41

    *M C

    F

    O M C

    *M: 2000+

    38

    AM' C 100+

    F C (NC )

    D

    Datawarehouse_09/2013

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    39/41

    O P

    *M

    I M M A

    P

    A O P P

    P

    39

    P P:

    N, , L

    I

    M

    I

    B C

    E

    D

    Datawarehouse_09/2013

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    40/41

    N (JI) P

    N

    P (PO A)

    O P

    ACAL

    *M M

    40

    D I

    ( D) *M

    P

    L I

    D

    D C

    *M

    Datawarehouse_09/2013

  • 8/11/2019 Microsoft PowerPoint - DW_BL_Sept2013_07(1) [Compatibility Mode]

    41/41

    *M

    NC 5100M 96 N;

    N :

    H D:

    24 00 1000

    5

    41

    N D :

    N :

    N Q:

    5

    100 +

    0,000

    Datawarehouse_09/2013