ddb presentation2data warehousing

Upload: calebkpabiteytetteh

Post on 13-Apr-2018

222 views

Category:

Documents


0 download

TRANSCRIPT

  • 7/26/2019 DDB Presentation2Data Warehousing

    1/27

    Data Warehousing

  • 7/26/2019 DDB Presentation2Data Warehousing

    2/27

    The Data Warehouse DefinitionB. Imnon:

    A data warehouse is a subject oriented, integrated, non-

    voatie, and time-variant coection of data in su!!ort of

    management"s decisions.

    #. $haudhiri % &. Da'a:

    Data warehousing is a coection of decision su!!ort

    technoogies, aimed at enabing the (nowedge wor(er)e*ecutive, manager, ana'st+ to ma(e better and

    faster decisions.

  • 7/26/2019 DDB Presentation2Data Warehousing

    3/27

    Data Warehouse Subject-Oriented

    Organized around major subjects, such ascustomer, product, sales.

    Focusing on the modeling and analysis of datafor decision makers, not on daily operations ortransaction processing.

    Provide a simple and concise view aroundparticular subject issues by ecluding datathat are not useful in the decision supportprocess.

  • 7/26/2019 DDB Presentation2Data Warehousing

    4/27

    Data Warehouse Integrated !onstructed by integrating multiple, heterogeneous

    data

    "ources

    # relational or other databases, flat files, eternal data

    $ata cleaning and data integration techni%ues areapplied.

    # &nsure consistency in naming conventions, encoding

    structures, attribute measures, etc. among differentdata

    sources# 'hen data is moved to the warehouse, it is converted.

  • 7/26/2019 DDB Presentation2Data Warehousing

    5/27

    Data Warehouse Time Variant

    (he time horizon for the data warehouse issignificantly longer than that of operational systems.

    # Operational database) current value data.# $ata warehouse data) provide information from a

    historical perspective *e.g., past +- years/ &very key structure in the data warehouse

    # !ontains an element of time# 0ut the key of operational data may or may not contain

    1time element2.

  • 7/26/2019 DDB Presentation2Data Warehousing

    6/27

    Data Warehouse Non-Volatile

    3 physically separate store of data

    transformed from the operationalenvironment.

    Operational update of data does not occur inthe data warehouse environment.

    # $oes not re%uire transaction processing,recovery, and concurrency control mechanisms

    # 4e%uires only ) loading and access of data.

  • 7/26/2019 DDB Presentation2Data Warehousing

    7/27

    Decision Support and OLA !Na"athe# 5nformation technology to help the knowledgeworker *eecutive, manager, analyst/ make faster

    and better decisions.

    # 'ill a -6 discount increase sales volume sufficiently7# 'hich of two new medications will result in the best

    best outcome) higher recovery rate 8 shorter hospitalityrate7

    # 9ow did the share price of computer manufacturers

    correlate with %uarterly profits over the past - years7

    On:ine 3nalytical Processing *O:3P/ is an elementof decision support system *$""/.

  • 7/26/2019 DDB Presentation2Data Warehousing

    8/27

    Data Warehouse !Na"athe#

    3 decision support database that is maintainedseparately from the organisation;s operationaldatabases. 3 data warehouse is a

    # subject oriented,# integrated,# timevarying,

    # nonvolatile

    collection of data that is used primarily in theorganisational decision making.

  • 7/26/2019 DDB Presentation2Data Warehousing

    9/27

    Wh$ separate data %arehouse&

    Performance

    # (he operational $0s are tuned to support known O:(Pworkloads# "upporting O:3P re%uires special data organisations,

    access methods and implementation methods

    Function# (he decision support re%uires data that may be missing

    from the operational $0s# $ecision support usually re%uires consolidating data frommany heterogeneous sources

  • 7/26/2019 DDB Presentation2Data Warehousing

    10/27

    OLT "s' OLA

  • 7/26/2019 DDB Presentation2Data Warehousing

    11/27

    oas of a Data Warehouse

    The data warehouse must ma(e an organisation"s

    information easi' accessibe

    The data warehouse must !resent the organisation"sinformation consistent'

    The data warehouse must be ada!tive and resiient tochange

    The data warehouse must be a secure bastion that!rotects our information assets

    The data must serve as the foundation for im!roveddecision ma(ing

    The business communit' must acce!t the data

    warehouse if it is to be deemed successfu.

  • 7/26/2019 DDB Presentation2Data Warehousing

    12/27

    Data Warehouse Architecture

  • 7/26/2019 DDB Presentation2Data Warehousing

    13/27

    Another iew of the DW Architecture

    !erationa

    #ource

    #'stems

    #ervices:

    $ean, combine, and

    standardi/e

    $onform dimensions

    0

    3&124#12I$1#

    Data #tore:

    5at fies and

    reationa

    tabes

    6rocessing:

    #orting and

    se7uentia

    !rocessing

    Data 8art 9

    DI810#I0A;

    Atomic and

    summar' data

    Based on a singe

    business !rocess

    Data 8art 9< =

    )#imiar' designed+

    Ad >oc 3uer' Toos

    2e!ort Writers

    Ana'tic

    A!!ications

    8odeing:

    5orecasting

    #coring

    Data mining

    1*tract

    1*tract

    1*tract

    Data

    #taging

    Area

    ;oad

    ;oad

    Data

    6resentation

    Area

    DW Bus:

    $onformed

    facts %

    dimensions

    Access

    Access

    Data

    Access

    Toos

  • 7/26/2019 DDB Presentation2Data Warehousing

    14/27

    Data Warehouse "s' Data (art

    &nterprise warehouse) collects all information

    about subject *customer, products, sales, assets,personnel/ that span the entire organisation

    # 4e%uires etensive business modelling

    #

  • 7/26/2019 DDB Presentation2Data Warehousing

    15/27

    To (eet the )e*uirements %ithin DW

    (he data is organised differently, i.e.1multidimensional2# starjoins schemas# snowflake schemas

    (he data is viewed differently (he data is stored differently# vector *array/ storage

    (he data is indeed differently# bitmap indees# join indees

  • 7/26/2019 DDB Presentation2Data Warehousing

    16/27

    Dimensional (odelling + ,asic oncepts

    Fact

    # 1something not known in advance2,# an observation# many facts *but not all/ have numerical,

    continuously values

    e.g., the price of a product, %uantity

    3ttribute# 1describe a characteristic of a tangible thing2# 1we do not measure them, we usually know them2# usually tet fields, with discrete valuese.g., the flavour of a product, the size of a product

  • 7/26/2019 DDB Presentation2Data Warehousing

    17/27

    $< # 0asic !oncepts = $imension

    # a business perspective from which data islooked upon

    # 1a collection of tet like attributes that arehighly correlated2

    e.g. Product, "tore, (ime

    >ranularity

    # the level of detail of data contained in the datawarehousee.g. $aily item totals by product, by store

  • 7/26/2019 DDB Presentation2Data Warehousing

    18/27

    ./ample o0 a Dimensional (odel

    Th St d d T l t 1

  • 7/26/2019 DDB Presentation2Data Warehousing

    19/27

    The Standard Template 1uer$

  • 7/26/2019 DDB Presentation2Data Warehousing

    20/27

    The Time Dimension

  • 7/26/2019 DDB Presentation2Data Warehousing

    21/27

    8utidimensiona Data

  • 7/26/2019 DDB Presentation2Data Warehousing

    22/27

    A #am!e Data $ube

  • 7/26/2019 DDB Presentation2Data Warehousing

    23/27

    5acts

  • 7/26/2019 DDB Presentation2Data Warehousing

    24/27

    5acts and Additive 6ro!ert'

  • 7/26/2019 DDB Presentation2Data Warehousing

    25/27

    #emiadditive fact 1*am!e

  • 7/26/2019 DDB Presentation2Data Warehousing

    26/27

    0umeric 8easures of Intensit'

  • 7/26/2019 DDB Presentation2Data Warehousing

    27/27

    1nd