acct 6910 building business intelligence systems an introduction to data warehouse
DESCRIPTION
3 Why Data Warehouse Why Database??TRANSCRIPT
![Page 1: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse](https://reader036.vdocuments.us/reader036/viewer/2022082621/5a4d1b487f8b9ab0599a41ff/html5/thumbnails/1.jpg)
Acct 6910
Building Business Intelligence Systems
An Introduction to Data Warehouse
![Page 2: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse](https://reader036.vdocuments.us/reader036/viewer/2022082621/5a4d1b487f8b9ab0599a41ff/html5/thumbnails/2.jpg)
2
Agenda
Why Data Warehouse
What is Data Warehouse
Current practice of data warehouse
![Page 3: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse](https://reader036.vdocuments.us/reader036/viewer/2022082621/5a4d1b487f8b9ab0599a41ff/html5/thumbnails/3.jpg)
3
Why Data Warehouse
Why Database??
![Page 4: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse](https://reader036.vdocuments.us/reader036/viewer/2022082621/5a4d1b487f8b9ab0599a41ff/html5/thumbnails/4.jpg)
4
Why Data WarehouseProblems with current database practices: Problem 1: Isolated databases
distributed in an enterprise
Sales
CRM
Inventory
Sub-problems:• Data Inconsistency• No comprehensive view of enterprise’s data sources – information island
![Page 5: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse](https://reader036.vdocuments.us/reader036/viewer/2022082621/5a4d1b487f8b9ab0599a41ff/html5/thumbnails/5.jpg)
5
Why Data Warehouse Problem 1: Isolated databases
distributed in an enterprise
Sales
CRM
Inventory
Sub-problems:• Data Inconsistency• Performance
![Page 6: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse](https://reader036.vdocuments.us/reader036/viewer/2022082621/5a4d1b487f8b9ab0599a41ff/html5/thumbnails/6.jpg)
6
Why Data Warehouse Problem 2: Historical data is archived in
offline storage systems
Sales
Sub-problems:• Historical data is always needed to support business decisionsArchiv
e
Historical Sales Data
![Page 7: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse](https://reader036.vdocuments.us/reader036/viewer/2022082621/5a4d1b487f8b9ab0599a41ff/html5/thumbnails/7.jpg)
7
Why Data WarehouseCustomer
Places
1
Order
M
Contain
1
OrderLine
M
Order
M
Product
1
Belong to
M
Product Category
1
![Page 8: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse](https://reader036.vdocuments.us/reader036/viewer/2022082621/5a4d1b487f8b9ab0599a41ff/html5/thumbnails/8.jpg)
8
Why Data Warehouse A marketing manager wants to know
sales amount distribution by product category and customer state in July?
Query???
![Page 9: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse](https://reader036.vdocuments.us/reader036/viewer/2022082621/5a4d1b487f8b9ab0599a41ff/html5/thumbnails/9.jpg)
9
Why Data Warehouse Problem 3: Database is designed to
process transactions but not to answer decision support queries
Complex queries
Bad query performance
![Page 10: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse](https://reader036.vdocuments.us/reader036/viewer/2022082621/5a4d1b487f8b9ab0599a41ff/html5/thumbnails/10.jpg)
10
What is Data WarehouseData Warehouse is designed to solve
problems associated with current database practices:
Problem 1: Isolated databases distributed in an enterprise
Sales
CRM
Inventory
Extract, Integrate and Replicate
Data Warehouse
![Page 11: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse](https://reader036.vdocuments.us/reader036/viewer/2022082621/5a4d1b487f8b9ab0599a41ff/html5/thumbnails/11.jpg)
11
Why Data Warehouse Problem 2: Historical data is archived in
offline storage systems
Sales Archiv
e
Historical Sales Data
Data Warehouse
Integrate Historical Data with Current Data
![Page 12: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse](https://reader036.vdocuments.us/reader036/viewer/2022082621/5a4d1b487f8b9ab0599a41ff/html5/thumbnails/12.jpg)
12
What is Data Warehouse Problem 3: Database is designed to process transactions but not to answer
decision support queries
Solution: In data warehouse, organize data in subject –oriented way rather than process-oriented way – dimensional modeling.
![Page 13: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse](https://reader036.vdocuments.us/reader036/viewer/2022082621/5a4d1b487f8b9ab0599a41ff/html5/thumbnails/13.jpg)
13
What is Data Warehouse
Customer
Places
1
Order
M
Contain
1
OrderLine
M
Order
M
Product
1
Belong to
M
Product Category
1
Sales
Product
Time
Customer
ER Modeling
Dimensional Modeling
![Page 14: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse](https://reader036.vdocuments.us/reader036/viewer/2022082621/5a4d1b487f8b9ab0599a41ff/html5/thumbnails/14.jpg)
14
What is Data Warehouse
Data Warehouse is a subject-oriented, integrated, time-variant, non-volatile collection of data in support of management’s decision making process.
1. Subject-oriented means the data warehouse focuses on the high-level entities of business such as sales, products, and customers. This is in contrast to database systems, which deals with processes such as placing an order.
![Page 15: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse](https://reader036.vdocuments.us/reader036/viewer/2022082621/5a4d1b487f8b9ab0599a41ff/html5/thumbnails/15.jpg)
15
What is Data Warehouse
2. Integrated means the data is integrated from distributed data sources and historical data sources and stored in a consistent format.
3. Time-variant means the data associates with a point in time (i.e., semester, fiscal year and pay period)
4. Non-volatile means the data doesn’t change once it gets into the warehouse.
![Page 16: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse](https://reader036.vdocuments.us/reader036/viewer/2022082621/5a4d1b487f8b9ab0599a41ff/html5/thumbnails/16.jpg)
16
What is Data Warehouse Data Warehouse Data Base
Purpose Decision Support Transaction Processing
Data Organization
Subject Oriented Process Oriented
Data Model Dimensional Modeling ER Modeling
Time Span Historical and Current Data
Current Data
Query processing
Scan a substantial subset of data
Scan a small set of data
Operation Read-only (add new data periodically)
Read & Update
![Page 17: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse](https://reader036.vdocuments.us/reader036/viewer/2022082621/5a4d1b487f8b9ab0599a41ff/html5/thumbnails/17.jpg)
17
Current Practice of DW*
Expected DW market value is 2002 will grow to $113.5 billion.
Average DW development cost is $1.5 million and average maintenance cost is $0.5 million.
* Source: H.J. Watson, “ Current Practicing in Data Warehousing”, I.S. Management, 2001
![Page 18: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse](https://reader036.vdocuments.us/reader036/viewer/2022082621/5a4d1b487f8b9ab0599a41ff/html5/thumbnails/18.jpg)
18
Current Practice of DW*
Sponsorship for the DW projectSponsor PercentageVP of a business unit 39.8CIO 26.9Business unit manager 16.7CEO 11.1Other 25.0
* Source: H.J. Watson, “ Current Practicing in Data Warehousing”, I.S. Management, 2001
![Page 19: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse](https://reader036.vdocuments.us/reader036/viewer/2022082621/5a4d1b487f8b9ab0599a41ff/html5/thumbnails/19.jpg)
19
Current Practice of DW*
DW Benefits Less effort to produce better information Better decisions Improvement of business processes Supporting for accomplishments of
strategic business objectives
* Source: H.J. Watson, “ Current Practicing in Data Warehousing”, I.S. Management, 2001
![Page 20: Acct 6910 Building Business Intelligence Systems An Introduction to Data Warehouse](https://reader036.vdocuments.us/reader036/viewer/2022082621/5a4d1b487f8b9ab0599a41ff/html5/thumbnails/20.jpg)
20
Reading:“ The Data Warehouse Toolkit” – Chapter 1