from spark to ignition: fueling your business on real-time analytics

33
Fueling Your Business on Real-Time Analytics Eric Frenkiel, MemSQL CEO June 29, 2015 • San Francisco, CA From Spark to Ignition:

Upload: memsql

Post on 05-Aug-2015

255 views

Category:

Data & Analytics


2 download

TRANSCRIPT

Page 1: From Spark to Ignition: Fueling Your Business on Real-Time Analytics

Fueling Your Business on Real-Time AnalyticsEric Frenkiel, MemSQL CEO

June 29, 2015 • San Francisco, CA

From Spark to Ignition:

Page 2: From Spark to Ignition: Fueling Your Business on Real-Time Analytics

What’s in Store For This Presentation?

1. MemSQL: A real-time database for transactions and analytics

2. Spark Use Cases

3. Example: Geospatial Enhancements

Page 3: From Spark to Ignition: Fueling Your Business on Real-Time Analytics

The real-time database for transactions and analytics

MemSQL Story

Page 4: From Spark to Ignition: Fueling Your Business on Real-Time Analytics

MemSQL at a Glance

Experienced leadership from Facebook, SQL Server, Oracle, Fusion-io

In-Memory, distributed, relational database

Solving the Enterprise Architecture Gap

Horizontal scale-out with modern database innovation

$50 million in funding

Page 5: From Spark to Ignition: Fueling Your Business on Real-Time Analytics

Four Ways Your DBMS is Holding You Back

ETL (Extract, Transform, Load)

Analytic Latency

Synchronization

Copies of data

Source: Gartner Hybrid/Transactional/Analytical Processing Will Foster Opportunities for Dramatic Business Innovation

Page 6: From Spark to Ignition: Fueling Your Business on Real-Time Analytics

6

The Real-Time Database for Transactions and Analytics 

In-Memory Distributed Relational

Data CenterSoftware Cloud

Page 7: From Spark to Ignition: Fueling Your Business on Real-Time Analytics

The Real-Time Database for Transactions and Analytics

Highest ValueHot Data

High ValueCold Data

Analytics

Tra

nsac

tions

Data Loading and Queries

Aggregator Nodes

AvailabilityGroup 1

AvailabilityGroup 2

Cluster

Page 8: From Spark to Ignition: Fueling Your Business on Real-Time Analytics

Gartner Identifies Emerging Category: HTAP (Hybrid Transactional/Analytical Processing)

“HTAP will enable business leaders to perform…much more advanced and sophisticated real-time analysis of their business data than with traditional architectures.”

Download at: memsql.com/gartner

Page 9: From Spark to Ignition: Fueling Your Business on Real-Time Analytics

Simple

Standard SQL Transactions and analytics in one database Behind the firewall or on the cloud Flexible integrations (Hadoop, Spark, SQL)

Page 10: From Spark to Ignition: Fueling Your Business on Real-Time Analytics

Fast

Extremely low-latency queries Massive parallel transaction capacity Lock-free, shared-nothing architecture

Page 11: From Spark to Ignition: Fueling Your Business on Real-Time Analytics

Scalable

Scales out on cloud and commodity hardware Deploys to thousands of machines True linear scaling

Page 12: From Spark to Ignition: Fueling Your Business on Real-Time Analytics

MemSQL Product EcosystemIn-Memory

ApplicationsTransactions and Analytics

Dashboards

HadoopAmazon S3

ODBC, JDBC, .NETConnectors MemSQL

Loader

Advanced Analytics

Wire-protocol compatibility

Databases and Data Warehouses

Streaming

Page 13: From Spark to Ignition: Fueling Your Business on Real-Time Analytics

Spark Use Cases

Page 14: From Spark to Ignition: Fueling Your Business on Real-Time Analytics

Spark Data Processing Framework

Intuitive, concise, and expressive operations needed for analytics

SparkSQL

SparkStreaming

Mllib(machine learning)

GraphX(graph)

Apache Spark

Page 15: From Spark to Ignition: Fueling Your Business on Real-Time Analytics

Cluster-wide Parallelization | Bi-Directional

Understanding MemSQL and Spark

Page 16: From Spark to Ignition: Fueling Your Business on Real-Time Analytics

Spark with MemSQL

MemSQL Spark Connector enables the real-time trinity

Message Queue Transformation Data Serving

Programming libraries PersistenceApplication platform

End-to-End Data Pipeline Under One Second

Page 17: From Spark to Ignition: Fueling Your Business on Real-Time Analytics

MemSQL and Spark Use Cases

Operationalize models built in Spark

Stream and event processing

Live dashboards and automated reports

Extend MemSQL analytics

Page 18: From Spark to Ignition: Fueling Your Business on Real-Time Analytics

Operationalize Models Built in Spark

Process in Spark, persist to MemSQL Go to production and iterate faster

Enterprise Consumption

Data into Spark

Model Creation Model Persistence

Page 19: From Spark to Ignition: Fueling Your Business on Real-Time Analytics

Stream and Event Processing

Structure event data on the fly Pass to MemSQL for persistent, queryable format

Enterprise Consumption

Real-time Streaming Data

Data TransformationPersistent,

Queryable Format

Page 20: From Spark to Ignition: Fueling Your Business on Real-Time Analytics

Real-Time Analytics at Pinterest

Higher performance event logging Reliable log transport and storage Faster query execution on real-time data

Enterprise Consumption

Message Queue

KafkaApp

Singer Secor

Spark, MemSQL, Application

TransformRT Analytics, Data Serve

50,000 pins/sec

Page 21: From Spark to Ignition: Fueling Your Business on Real-Time Analytics
Page 22: From Spark to Ignition: Fueling Your Business on Real-Time Analytics

Live Dashboards and Automated Reports

Serve live dashboards from MemSQL Run custom reports on live data with Spark

Live Dashboards

Custom Reporting

Access to Live Production Data

SQL Transactions and Analytics

Page 23: From Spark to Ignition: Fueling Your Business on Real-Time Analytics

Extend MemSQL Analytics

The freshest data for analysis in Spark Load from MemSQL to Spark and write results on return

Access to Live Production Data

Real-time Replica

Applications,Data Streams

Interactive Analytics,

Machine Learning

Page 24: From Spark to Ignition: Fueling Your Business on Real-Time Analytics

MemCity Capturing energy consumption data from 1.4 million households 8 devices per household 186,000 events per minute AWS hardware costs at $2.35 per hour

Page 25: From Spark to Ignition: Fueling Your Business on Real-Time Analytics
Page 26: From Spark to Ignition: Fueling Your Business on Real-Time Analytics

Geospatial Enhancements

Page 27: From Spark to Ignition: Fueling Your Business on Real-Time Analytics

Geospatial Challenge

Commercial applications now geo-enabled Location is everywhere Lots of insight possible Traditionally geo is processed separately Real need for integrated geospatial at scale

Page 28: From Spark to Ignition: Fueling Your Business on Real-Time Analytics

MemSQL Geospatial

Points, Lines, and Polygons

Topological filters

Measurement functions

Page 29: From Spark to Ignition: Fueling Your Business on Real-Time Analytics

MemSQL Geospatial

BILLIONS of objects

Sub-second latency

Geo data is first-class citizen

Geo + Simplicity + Speed + Scale

Page 30: From Spark to Ignition: Fueling Your Business on Real-Time Analytics

Real-Time Geospatial Location Intelligence

Sample from 170 million taxi trips Real-time ingest Concurrent queries in fractions of a second Unlimited number of geographic views Simple queries while simultaneously ingesting data

Page 31: From Spark to Ignition: Fueling Your Business on Real-Time Analytics
Page 32: From Spark to Ignition: Fueling Your Business on Real-Time Analytics

A database so scalable that everyone can use it.

UNLIMITED scale and capacity Free FOREVER

MemSQL 4 Community Edition

Page 33: From Spark to Ignition: Fueling Your Business on Real-Time Analytics

Thank You!

Visit the MemSQL Booth #4

MemCity Showcase GiveawaysGames

*