modernizing your data warehouse for...

24
Modernizing Your Data Warehouse for Hadoop Christian Coté Big data. Small data. All data.

Upload: others

Post on 25-Apr-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Modernizing Your Data Warehouse for Hadoopquantlabs.net/academy/download/free_quant_instituitional_books... · Modernizing Your Data Warehouse for Hadoop Christian Coté Big data

Modernizing Your Data Warehouse for Hadoop

Christian Coté

Big data. Small data. All data.

Page 2: Modernizing Your Data Warehouse for Hadoopquantlabs.net/academy/download/free_quant_instituitional_books... · Modernizing Your Data Warehouse for Hadoop Christian Coté Big data

The traditional data warehouse

“…data warehousing has reached the most significant

tipping point since its inception.

The biggest, possibly most elaborate data

management system in IT is changing.”

– Gartner, “The State of Data Warehousing in 2012”

Page 3: Modernizing Your Data Warehouse for Hadoopquantlabs.net/academy/download/free_quant_instituitional_books... · Modernizing Your Data Warehouse for Hadoop Christian Coté Big data

The traditional data warehouse

Real time data2

Increasing datavolumes

1

Cloud-borndata

4

Increasing datavolumes

1 New data sourcesand types

3

Page 4: Modernizing Your Data Warehouse for Hadoopquantlabs.net/academy/download/free_quant_instituitional_books... · Modernizing Your Data Warehouse for Hadoop Christian Coté Big data

The modern data warehouse

Page 5: Modernizing Your Data Warehouse for Hadoopquantlabs.net/academy/download/free_quant_instituitional_books... · Modernizing Your Data Warehouse for Hadoop Christian Coté Big data

Microsoft’s modern data warehouse

Data Platform

PDW

SQL Server 2014

Microsoft Azure HDInsight

Page 6: Modernizing Your Data Warehouse for Hadoopquantlabs.net/academy/download/free_quant_instituitional_books... · Modernizing Your Data Warehouse for Hadoop Christian Coté Big data
Page 7: Modernizing Your Data Warehouse for Hadoopquantlabs.net/academy/download/free_quant_instituitional_books... · Modernizing Your Data Warehouse for Hadoop Christian Coté Big data
Page 8: Modernizing Your Data Warehouse for Hadoopquantlabs.net/academy/download/free_quant_instituitional_books... · Modernizing Your Data Warehouse for Hadoop Christian Coté Big data

Scale out technologies

in Parallel Data Warehouse

0TB 6PB

APS /

HDInsight

APS

APS /

HDInsight

APS /

HDInsight

APS /

HDInsight

APS /

HDInsight

APS /

HDInsight

From terabytes to multi-petabytesScale out relational data to petabytes

Page 9: Modernizing Your Data Warehouse for Hadoopquantlabs.net/academy/download/free_quant_instituitional_books... · Modernizing Your Data Warehouse for Hadoop Christian Coté Big data
Page 10: Modernizing Your Data Warehouse for Hadoopquantlabs.net/academy/download/free_quant_instituitional_books... · Modernizing Your Data Warehouse for Hadoop Christian Coté Big data
Page 11: Modernizing Your Data Warehouse for Hadoopquantlabs.net/academy/download/free_quant_instituitional_books... · Modernizing Your Data Warehouse for Hadoop Christian Coté Big data

In-memory performanceIn-memory Columnstore for next-generation performance

Columnstore

index representation

Page 12: Modernizing Your Data Warehouse for Hadoopquantlabs.net/academy/download/free_quant_instituitional_books... · Modernizing Your Data Warehouse for Hadoop Christian Coté Big data

Concurrency and mixed workloadsGreat performance for mixed workloads

Query

Results

Page 13: Modernizing Your Data Warehouse for Hadoopquantlabs.net/academy/download/free_quant_instituitional_books... · Modernizing Your Data Warehouse for Hadoop Christian Coté Big data
Page 14: Modernizing Your Data Warehouse for Hadoopquantlabs.net/academy/download/free_quant_instituitional_books... · Modernizing Your Data Warehouse for Hadoop Christian Coté Big data
Page 15: Modernizing Your Data Warehouse for Hadoopquantlabs.net/academy/download/free_quant_instituitional_books... · Modernizing Your Data Warehouse for Hadoop Christian Coté Big data

Data complexity: variety and velocity

Petabytes

What is big data?

Page 16: Modernizing Your Data Warehouse for Hadoopquantlabs.net/academy/download/free_quant_instituitional_books... · Modernizing Your Data Warehouse for Hadoop Christian Coté Big data

Hadoop Cluster

What is Hadoop?

Hive

Distributed, scalable system on commodity HW

Core Services

Operational services Data services

HDFS

SQOOP

FLUME

NFS

LOAD & EXTRACT

WebHDFS

OOZIE

AMBARI

YARN

MAP REDUCE

HIVE &HCATALOG

PIG

HBASEFALCON

compute

&

storage

. . .

. . .

. . compute

&

storage

.

.

Hadoop clusters provide scale-out storage and distributed data processing on commodity hardware

Page 17: Modernizing Your Data Warehouse for Hadoopquantlabs.net/academy/download/free_quant_instituitional_books... · Modernizing Your Data Warehouse for Hadoop Christian Coté Big data

Web app

optimization

Smart meter

monitoring

Equipment

monitoring

Advertising

analysis

Life sciences

research

Fraud

detection

Healthcare

outcomesWeather forecasting

Social network

analysis

Churn

analysis

Traffic flow

optimization

IT infrastructure

optimization

Legal

discovery

Natural resource

exploration

Page 18: Modernizing Your Data Warehouse for Hadoopquantlabs.net/academy/download/free_quant_instituitional_books... · Modernizing Your Data Warehouse for Hadoop Christian Coté Big data

Hadoop offerings on-premise and cloudReal-time with complex event processing

Microsoft Azure

Page 19: Modernizing Your Data Warehouse for Hadoopquantlabs.net/academy/download/free_quant_instituitional_books... · Modernizing Your Data Warehouse for Hadoop Christian Coté Big data

Architecture

Page 20: Modernizing Your Data Warehouse for Hadoopquantlabs.net/academy/download/free_quant_instituitional_books... · Modernizing Your Data Warehouse for Hadoop Christian Coté Big data
Page 21: Modernizing Your Data Warehouse for Hadoopquantlabs.net/academy/download/free_quant_instituitional_books... · Modernizing Your Data Warehouse for Hadoop Christian Coté Big data

Analyze unstructured data

in Excel

Combine different types of data with Power

Query

Analyze your data with Power Pivot and

Power View and perform analysis

Features and benefits

Page 22: Modernizing Your Data Warehouse for Hadoopquantlabs.net/academy/download/free_quant_instituitional_books... · Modernizing Your Data Warehouse for Hadoop Christian Coté Big data

Build a cluster in minutes and

tear it down when you’re done

Optimize cluster-size for time to

insight or cost-savings

Features and benefits

Page 23: Modernizing Your Data Warehouse for Hadoopquantlabs.net/academy/download/free_quant_instituitional_books... · Modernizing Your Data Warehouse for Hadoop Christian Coté Big data

Try HDInsight at www.windowsazure.com/bigdata

Try SQL Server for data warehousing in Microsoft Azure VMs atwww.windowsazure.com

Try Hortonworks Data Platform for Windows at www. hortonworks.com/products/hdp-windows/

Try SQL Server 2014 CTP1 at http://www.microsoft.com/en-us/sqlserver/sql-server-2014.aspx

Page 24: Modernizing Your Data Warehouse for Hadoopquantlabs.net/academy/download/free_quant_instituitional_books... · Modernizing Your Data Warehouse for Hadoop Christian Coté Big data