acct 6910 building business intelligence systems an introduction to data warehouse

20
Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse

Upload: shanon-tate

Post on 18-Jan-2018

215 views

Category:

Documents


0 download

DESCRIPTION

3 Why Data Warehouse Why Database??

TRANSCRIPT

Page 1: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse

Acct 6910

Building Business Intelligence Systems

An Introduction to Data Warehouse

Page 2: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse

2

Agenda

Why Data Warehouse

What is Data Warehouse

Current practice of data warehouse

Page 3: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse

3

Why Data Warehouse

Why Database??

Page 4: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse

4

Why Data WarehouseProblems with current database practices: Problem 1: Isolated databases

distributed in an enterprise

Sales

CRM

Inventory

Sub-problems:• Data Inconsistency• No comprehensive view of enterprise’s data sources – information island

Page 5: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse

5

Why Data Warehouse Problem 1: Isolated databases

distributed in an enterprise

Sales

CRM

Inventory

Sub-problems:• Data Inconsistency• Performance

Page 6: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse

6

Why Data Warehouse Problem 2: Historical data is archived in

offline storage systems

Sales

Sub-problems:• Historical data is always needed to support business decisionsArchiv

e

Historical Sales Data

Page 7: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse

7

Why Data WarehouseCustomer

Places

1

Order

M

Contain

1

OrderLine

M

Order

M

Product

1

Belong to

M

Product Category

1

Page 8: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse

8

Why Data Warehouse A marketing manager wants to know

sales amount distribution by product category and customer state in July?

Query???

Page 9: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse

9

Why Data Warehouse Problem 3: Database is designed to

process transactions but not to answer decision support queries

Complex queries

Bad query performance

Page 10: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse

10

What is Data WarehouseData Warehouse is designed to solve

problems associated with current database practices:

Problem 1: Isolated databases distributed in an enterprise

Sales

CRM

Inventory

Extract, Integrate and Replicate

Data Warehouse

Page 11: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse

11

Why Data Warehouse Problem 2: Historical data is archived in

offline storage systems

Sales Archiv

e

Historical Sales Data

Data Warehouse

Integrate Historical Data with Current Data

Page 12: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse

12

What is Data Warehouse Problem 3: Database is designed to process transactions but not to answer

decision support queries

Solution: In data warehouse, organize data in subject –oriented way rather than process-oriented way – dimensional modeling.

Page 13: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse

13

What is Data Warehouse

Customer

Places

1

Order

M

Contain

1

OrderLine

M

Order

M

Product

1

Belong to

M

Product Category

1

Sales

Product

Time

Customer

ER Modeling

Dimensional Modeling

Page 14: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse

14

What is Data Warehouse

Data Warehouse is a subject-oriented, integrated, time-variant, non-volatile collection of data in support of management’s decision making process.

1. Subject-oriented means the data warehouse focuses on the high-level entities of business such as sales, products, and customers. This is in contrast to database systems, which deals with processes such as placing an order.

Page 15: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse

15

What is Data Warehouse

2. Integrated means the data is integrated from distributed data sources and historical data sources and stored in a consistent format.

3. Time-variant means the data associates with a point in time (i.e., semester, fiscal year and pay period)

4. Non-volatile means the data doesn’t change once it gets into the warehouse.

Page 16: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse

16

What is Data Warehouse Data Warehouse Data Base

Purpose Decision Support Transaction Processing

Data Organization

Subject Oriented Process Oriented

Data Model Dimensional Modeling ER Modeling

Time Span Historical and Current Data

Current Data

Query processing

Scan a substantial subset of data

Scan a small set of data

Operation Read-only (add new data periodically)

Read & Update

Page 17: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse

17

Current Practice of DW*

Expected DW market value is 2002 will grow to $113.5 billion.

Average DW development cost is $1.5 million and average maintenance cost is $0.5 million.

* Source: H.J. Watson, “ Current Practicing in Data Warehousing”, I.S. Management, 2001

Page 18: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse

18

Current Practice of DW*

Sponsorship for the DW projectSponsor PercentageVP of a business unit 39.8CIO 26.9Business unit manager 16.7CEO 11.1Other 25.0

* Source: H.J. Watson, “ Current Practicing in Data Warehousing”, I.S. Management, 2001

Page 19: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse

19

Current Practice of DW*

DW Benefits Less effort to produce better information Better decisions Improvement of business processes Supporting for accomplishments of

strategic business objectives

* Source: H.J. Watson, “ Current Practicing in Data Warehousing”, I.S. Management, 2001

Page 20: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse

20

Reading:“ The Data Warehouse Toolkit” – Chapter 1