why data warehouse? scenario 1 abc pvt. ltd is a company with branches at mumbai, delhi, chennai and...

43

Upload: nickolas-clarke

Post on 22-Dec-2015

213 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales
Page 2: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

Why Data Warehouse?

Page 3: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore.

The Sales Manager wants quarterly sales report.

Each branch has a separate operational system.

Page 4: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

Mumbai

Delhi

Chennai

Banglore

SalesManager

Sales per item type per branchfor first quarter.

Page 5: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

Solution 1:ABC Pvt Ltd.

Extract sales information from each database.

Store the information in a common repository at a single site.

Page 6: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

Mumbai

Delhi

Chennai

Banglore

DataWarehouse

SalesManager

Query &Analysis tools

Report

Page 7: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

One Stop Shopping Super Market has huge operational database.Whenever Executives wants some report the OLTP system becomes slow and data entry operators have to wait for some time.

Page 8: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

OperationalDatabase

Data Entry Operator

Data Entry Operator

ManagementWait

Report

Page 9: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

Solution 2

Extract data needed for analysis from operational database.

Store it in another system, the data warehouse.

Refresh warehouse at regular intervals so that it contains up to date information for analysis.

Warehouse will contain data with historical perspective.

Page 10: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

Operationaldatabase

DataWarehouse

Extractdata

Data EntryOperator

Data EntryOperator

ManagerTransaction

Report

Page 11: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

Cakes & Cookies is a small, new company. The chairman of this company wants his company to grow. He needs information so that he can make correct decisions.

Page 12: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

Solution 3Improve the quality of data before loading it

into the warehouse.Perform data cleaning and transformation

before loading the data.Use query analysis tools to support adhoc

queries.

Page 13: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

Query & Analysis

tool

Chairman

Expansion

Improvement

sales

time

DataWarehouse

Page 14: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

Summing up?Why do you need a warehouse?

Operational systems could not provide strategic information

Executive and managers need such strategic information for Making proper decision Formulating business strategies Establishing goals Setting objectives Monitoring results

Page 15: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

Why operational data is not capable of producing valuable information?Data is spread across incompatible structures

and systemsNot only that, improvements in technology

had made computing faster, cheaper and available

Page 16: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

FAILURES OF PAST DECISION-SUPPORT SYSTEMS

OLTP systems

Page 17: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

Decision support systems

Page 18: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

Operational and informational

Page 19: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales
Page 20: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales
Page 21: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

Functional definition of a DWThe data warehouse is an informational

environment that Provides an integrated and total view of the

enterprise Makes the enterprise’s current and historical

information easily available for decision making Makes decision-support transactions possible

without hindering operational systems Renders the organization’s information consistent Presents a flexible and interactive source of

strategic information

Page 22: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

Questions????Describe five differences between operational

systems and informational systemsA data warehouse in an environment, not a

product. Discuss.

Page 23: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales
Page 24: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

A data warehouse is- subject-oriented,- integrated,- time-variant,- nonvolatile

collection of data in support of management’sdecision making process.

Page 25: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

Subject-oriented

Data warehouse is organized around subjects such as sales, product, customer.

It focuses on modeling and analysis of data for decision makers.

Excludes data not useful in decision support process.

Page 26: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales
Page 27: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

Integration

Data Warehouse is constructed by integrating multiple heterogeneous sources.

Data Preprocessing are applied to ensure consistency.RDBMS

LegacySystem

DataWarehouse

Flat File Data ProcessingData Transformation

Page 28: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales
Page 29: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

IntegrationIn terms of data.

encoding structures.

Measurement ofattributes.

physical attribute. of data

naming conventions.

Data type format

remarks

Page 30: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

Time-variant

Provides information from historical perspective, e.g. past 5-10 years

Every key structure contains either implicitly or explicitly an element of time, i.e., every record has a timestamp.

The time-variant nature in a DWAllows for analysis of the pastRelates information to the presentEnables forecasts for the future

Page 31: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

Non-volatile

Data once recorded cannot be updated.Data warehouse requires two

operations in data accessingInitial loading of dataIncremental loading of data

load

access

Page 32: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales
Page 33: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

Data GranularityIn an operational system, data is usually kept at the

lowest level of detail.In a DW, data is summarized at different levels.

Three data levels in a banking data warehouse

Daily Detail Monthly Summary Quaterly Summary

Account Account Account

Activity Date Month Month

Amount No. of transactions No. of transactions

Deposit/ Withdraw Withdrawals Withdrawals

Deposits Deposits

Beginning Balance Beginning Balance

Ending Balance Ending Balance

Page 34: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

Operational v/s Information SystemFeatures Operational Information

Characteristics Operational processing Informational processing

Orientation Transaction Analysis

User Clerk,DBA,database professional

Knowledge workers

Function Day to day operation Decision support

Data Content Current Historical, archived, derived

View Detailed, flat relational Summarized, multidimensional

DB design Application oriented Subject oriented

Unit of work Short ,simple transaction Complex query

Access Read/write Read only

Page 35: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

Operational v/s Information System

Features Operational Information

Focus Data in Information out

No. of records accessed

tens/ hundreds millions

Number of users thousands hundreds

DB size 100MB to GB 100 GB to TB

Usage Predictable, repetitive Ad hoc, random, heuristic

Response Time Sub-seconds Several seconds to minutes

Priority High performance,high availability

High flexibility,end-user autonomy

Metric Transaction throughput Query throughput

Page 36: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

Two approaches in designing a DWTop-down approach Bottom-up approach

Enterprise view of data Narrow view of data

Inherently architected Inherently incremental

Single, central storage of data Faster implementation of manageable parts

Centralized rules and control Each datamart is developed independently

Takes longer time to build Comparatively less time than a DW

Higher risk to failure Less risk of failure

Needs higher level of cross-functional skills

Unmanageable interfaces

Page 37: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

Bottom Up Approach

Page 38: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

Top Down Approach

Page 39: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

A Practical Approach-Kimball1. Plan and Define requirements2. Create a surrounding architecture3. Conform and Standardize the data Content4. Implement Data Warehouse as series of

super-mart one at a time.

Page 40: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

Individual Architected Data Marts

Common Logical Subject Area ERD

Common Business Dimensions

Common Business Rules

Common Business Metrics

Glossary

SalesDistribution

Product

Marketing Customer Accounts

Finance Inventory Vendors

An Incremental Approach

Page 41: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

ArchitectedArchitectedEnterpriseEnterpriseFoundationFoundation

SalesDistribution

Product

Marketing Customer Accounts

Finance Inventory Vendors

Enterprise Data Warehouse

The Eventual Result

Page 42: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

Data Warehouse:

Holds multiple subject areasHolds very detailed informationWorks to integrate all data sourcesDoes not necessarily use a dimensional model but feeds dimensional models.Data Mart

Often holds only one subject area- for example, Finance, or SalesMay hold more summarised data (although many hold full detail)Concentrates on integrating information from a given subject area or set of source systemsIs built focused on a dimensional model using a star schema.

Page 43: Why Data Warehouse? Scenario 1 ABC Pvt. Ltd is a company with branches at Mumbai, Delhi, Chennai and Bangalore. The Sales Manager wants quarterly sales

Data Warehouse verses data marts