azure sql data warehouse - wordpress.com · tack för eran uppmärkasmhet. ... azure sql data...

40
Azure SQL Data Warehouse Andrija Marcic Microsoft

Upload: hoangtuong

Post on 30-Jun-2018

218 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load

Azure SQL Data Warehouse

Andrija Marcic

Microsoft

Page 2: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load

App Service

Intelligent App

Hadoop

Azure Machine Learning

Power BI

End to end platform built for the cloud

Azure SQL Database

SQL

Azure SQL DataWarehouse

Page 3: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load

Options:

SQL Server on physical machines

SQL Server in on-premises VM (private

cloud)

SQL Server in Azure VM (public cloud)

Azure SQL Database (public cloud)

SQL Server, Azure VMs, Azure SQL DB

Shared Lower Cost

Dedicated Higher Cost

Higher Administration Lower Administration

Off Premises

Hybrid Cloud

Physical

SQL ServerPhysical Machines(raw Iron)SQL

Virtual

SQL Server Private CloudVirtualized Machines + Appliances

Infrastructure as a service

SQL Server in Azure VMVirtualized machinesSQL

Platform as a service

Software as a services

Azure SQL DatabaseVirtualized Databases

SQL

On Premises

Page 4: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load

Demo

Page 5: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load

Azure SQL DB vs Azure SQL DataWarehouse

Page 6: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load

Azure SQL DB

Page 7: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load

SQL Azure Data Warehouse (MPP)

Page 8: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load

SQL Server

Page 9: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load
Page 10: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load

Real-time elasticity

Resize in <1 minute On-demand compute

Expand or reduce

as needed

Page 11: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load

Data Warehouse Units (DWUs) are a measure of reserved compute performance or ‘power.’ A customer’s DWU needs can vary depending on the needs of their workload.

100 200 300 400 500 600 1000 1200 1500 2000 3000 6000

DATA WAREHOUSE UNITS (DWUS)

Page 12: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load

ALTER DATABASE ContosoDW

MODIFY

(service_objective ='DW1000'

)

;

DWU

DW100

DW200

DW300

DW400

DW500

DW600

DW1000

DW1200

DW1500

DW2000

DW3000

DW6000

Page 13: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load

Data Warehouse Unit (DWU)

Page 14: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load

Demo

Page 15: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load
Page 16: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load
Page 17: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load
Page 18: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load

CREATE DATABASE MyDB COLLATESQL_Latin1_General_CP1_CI_AS

(

EDITION = 'DataWarehouse'

, SERVICE_OBJECTIVE = 'DW400'

, MAXSIZE = 10240 GB

);

ALTER DATABASE MyDBMODIFY (MAXSIZE = 245760 GB);

Page 19: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load
Page 20: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load
Page 21: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load

SQL Server Bulk Copy tool, SQL Server Integration Services (SSIS), Informatica, Attunity, Bryte Systems, snapLogic, others

Azure Blob Storage, HDInsight (future), Azure Data Lake Store (future)

Azure Data Factory, Redgate Data Platform Studio

Page 22: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load

Query unstructured data via PolyBase/T-SQL

PolyBase

Scale out compute

SQL DW instanceHadoop VMs /

Azure Storage/

Azure Datalake

Any data, any size, anywhere

Page 23: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load

T-SQL query

Azure SQL DW Hadoop

Quote:

************************

**********************

*********************

**********************

***********************

$658.39

Jim Gray

Name

11/13/58

DOB

WA

State

Ann Smith 04/29/76 ME

Page 24: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load
Page 25: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load

32active queries

1024

op

en se

ssions

Page 26: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load

1

2

4

8

16

32

64

128

Slo

ts

mediumrc (m) largerc (l) xlargerc (xl) Max Concurrency Slots

Page 27: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load

0,0

10,0

20,0

30,0

40,0

50,0

60,0

70,0

80,0

90,0

100,0

mediumrc (m) largerc (l) xlargerc (xl)

% C

on

sum

ed

DW100 DW200 DW300 DW400 DW500 DW600 DW1000 DW1200 DW1500 DW2000

Page 28: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load

0

4

8

12

16

20

24

28

32

DW100 DW200 DW300 DW400 DW500 DW600 DW1000 DW1200 DW1500 DW2000

Slo

ts

Concurrency impact

Page 29: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load

DWUMax External

ReadersMax Writers

DW100 8 60

DW200 16 60

DW300 24 60

DW400 32 60

DW500 40 60

DW600 48 60

DW1000 80 80

DW1200 96 96

DW1500 120 120

DW2000 160 160

DW3000 240 240

DW6000 480 480

Page 30: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load

0

1000

2000

3000

4000

5000

6000

7000

8000

1 2 3 4 5 6 10 12 15 20 30 60

GB/Hour

GB/Hour

Page 31: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load

0

5000

10000

15000

20000

25000

DW100 DW200 DW300 DW400 DW500 DW600 DW1000 DW1200 DW1500 DW2000 DW3000 DW6000

Capacity (GB)

Page 32: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load

0-4 TBs

4-8 TBs

8-12 TBs

12-16 TBs

16-20 TBs

20-36 TBs

36-48 TBs

48-60 TBs

60-80 TBs

80-160 TBs

>160 TBs

100 200 300 400 500 600 1000 1200 1500 2000 3000 6000

Recommended starting point

Flexibility to select any range of DWUs

Page 33: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load

?

Page 34: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load

Tack för eran uppmärkasmhet

Page 35: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load

Tack för eran uppmärksamhet&

Glöm inte att fylla i utvärderingen

Page 36: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load

Azure SQL Data Warehouse Architecture

Control

Node

Compute

Node

Compute

Node

Compute

Node

Compute

Node

SQL DB

SQL DB

SQL DB

SQL DB

Blob storage [WASB(S)]

ComputeScale compute up or down

when required(SLA <= 60 seconds).

Pause, Restart, Stop, Start.

StorageAdd\Load data to WASB(S) without incurring compute

costs

Massively Parallel Processing (MPP) Engine

Azure Infrastructure and Storage

100 DWU < > 2000 DWU

Storage and Compute are de-coupled, enabling a true elastic service and

separate charging for both compute and storage

Application or User connection

HDInsight

Data Loading(SSIS, REST, OLE, ADO, ODBC,

WebHDFS, AZCopy, PS) DMS

DMS DMS DMS DMS

DMS (Data Movement Service) executes across all

database nodes

Page 37: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load

Azure SQL Data Warehouse – Control Node

Control

Node

Compute

Node

Compute

Node

Compute

Node

Compute

Node

SQL DB

SQL DB

SQL DB

SQL DB

Blob storage [WASB(S)]

Massively Parallel Processing (MPP) Engine

HDInsight

Control

Node

SQL DB

• Endpoint for connections

• Regular SQL endpoint (TCP 1433)

• Persists no user data (metadata only)

• Coordinates compute activity using MPP

Page 38: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load

Azure SQL Data Warehouse - Compute Nodes

Control

Node

Compute

Node

Compute

Node

Compute

Node

Compute

Node

SQL DB

SQL DB

SQL DB

SQL DB

Blob storage [WASB(S)]

Massively Parallel Processing (MPP) Engine

HDInsight

Compute

Node(s)

Azure SQL Database

SQL DB

An increase of DWU will increase the number of

compute nodes

Page 39: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load

Azure SQL Data Warehouse – Blob storage

Control

Node

Compute

Node

Compute

Node

Compute

Node

Compute

Node

SQL DB

SQL DB

SQL DB

SQL DB

Blob storage [WASB(S)]

Massively Parallel Processing (MPP) Engine

HDInsight

• RA-GRS storage

• +PB’s of storage

• Load data without incurring compute costs

Page 40: Azure SQL Data Warehouse - WordPress.com · Tack för eran uppmärkasmhet. ... Azure SQL Data Warehouse Architecture Control Node Compute Node Compute Node Compute Node ... Add\Load