building an iot kafka pipeline in under 5 minutes

33
Building an IoT Ka.a Pipeline in Under 5 Minutes @garyorenstein @memsql MemSQL 1

Upload: memsql

Post on 12-Apr-2017

142 views

Category:

Data & Analytics


0 download

TRANSCRIPT

Building an IoT Ka.a Pipeline in Under 5 Minutes

@garyorenstein @memsql

MemSQL 1

Step 1

Launch Cluster

!MemSQL 2

Today’s Talk

IoT is big

Ka#a is popular

Real-&me pipelines need opera&onal data warehouses

How can we do this simply (in under 5 minutes)

MemSQL 3

About MemSQL

The Real-Time Data Warehousefor the front lines of your business

MemSQL 4

Real Time and Opera.onalGo Hand In Hand

• Live applica+ons and embedded analy+cs

• Con+nuous data processing (not batch)

• Real-+me AND historical data together

MemSQL 5

Gartner: An Opera,onal Data Warehouse• Manages structured data

• Loads con0nuously for embedded analy0cs in applica0ons

• Supports real-&me data warehousing

• Func0ons as an opera0onal data store

MemSQL 6

Gartner: An Opera,onal Data Warehouse• Manages structured data

• Loads con0nuously for embedded analy0cs in applica0ons

• Supports real-&me data warehousing

• Func0ons as an opera0onal data store

• Query op&miza&on plays a role (new)

• Many queries are repe00ve, mul0plying effects of op0miza0on

MemSQL 7

Magic Quadrant for Data Management Solu4ons for Analy4cs

February 2017 Highlights

• Disrup(on accelera(ng

• Demand to address mul(ple data types

• Demand for distributed processing and storage

• Cloud gaining trac(on

Data may include interac/on and observa/onal data fromInternet of Things (IoT) sensors

MemSQL 8

MemSQL 9

MemSQL 10

MemSQL

#1 Opera)onal Data Warehouse

Cri$cal Capabili$es for Data Warehouse and Data Management Solu$ons for Analy$cs

MemSQL 11

A Real-Time IoT Pipeline

MemSQL 12

MemSQL 13

All Types of “Things”

Mul$ple data types

Massive scale with distributed systems

Always-on

Machine learning and predic$ve analy$cs

MemSQL 14

4 Building Blocks of Enterprise IoT

MemSQL 15

MemSQL 16

Ingest

• From batch to con-nuous

• High throughput

• Massively parallel

• Exactly-once seman-cs

MemSQL 17

Transform

• In-line, in real-,me

• Alter, enrich, score machine learning

• Use exis,ng models with PMML

• Add new models

MemSQL 18

Persist

• Full durability and availability

• Fast via memory op4miza4on

• Both real-4me and historical data

• Scale with distributed systems

• Robust security

MemSQL 19

Analyze

• ANSI SQL

• Sophis.cated query execu.on

• Code compila.on for fast queries

• Easy business intelligence integra.ons

MemSQL 20

MemSQL 21

DemoBuilding and IoT Ka.a Pipeline in

Under 5 Minutes

MemSQL 22

MemSQL 23

MemSQL 24

Launch Cluster

!MemSQL 25

Database Speed Test

MemSQL 26

Create Schema and Pipelines

MemSQL 27

Show Sample Queries

MemSQL 28

⌚ Times Up!MemSQL 29

MemSQL 30

MemSQL 31

Please visit our booth!

Grab a shirt!MemSQL | Transac/ons |

Analy/cs | Cloud

MemSQL 32

Thank You!@garyorenstein @memsql

MemSQL 33