building data-intensive applications with mapr-db

27
Building Data-Intensive Applications with MapR-DB

Upload: mapr-data-technologies

Post on 21-Jan-2018

162 views

Category:

Data & Analytics


3 download

TRANSCRIPT

Page 1: Building Data-Intensive Applications with MapR-DB

Building Data-Intensive Applications with MapR-DB

Page 2: Building Data-Intensive Applications with MapR-DB

Today’s Presenters

Mike LeoneSenior Validation and

Data Management Analyst

Ankur DesaiSenior Product

Marketing Manger

Page 3: Building Data-Intensive Applications with MapR-DB

Biggest Database Challenges

48% of organizations struggle with managing data

growth and database size

34% of organizations are unable to support databases

in cloud environments

35% of organizations have challenges meeting database performance requirements

Page 4: Building Data-Intensive Applications with MapR-DB

Top 5 Attributes and Capabilities When Evaluating and Selecting NoSQL Databases

19%

21%

32%

35%

43%

Scales out efficiently across distributed,parallel nodes

Robust management utilities

Full SQL Compliance

Security

Cloud-based

Page 5: Building Data-Intensive Applications with MapR-DB

Advantages of Cloud-based NoSQL Databases

Faster time to deploy

Reduced storage costs

Increased scalability

Improved recoverability

Better availability

More elasticity

More advancedanalytic capabilities

Reduced processing costs

Page 6: Building Data-Intensive Applications with MapR-DB

© 2017 by The Enterprise Strategy Group, Inc.

Use of NoSQL Databases

18%plan to use or are interested

in using

78%of organizations

currently use NoSQL

Page 7: Building Data-Intensive Applications with MapR-DB

© 2017 MapR TechnologiesMapR Confidential 7

MapR-DB: The Database for Global Data-Intensive Applications

Page 8: Building Data-Intensive Applications with MapR-DB

© 2017 MapR TechnologiesMapR Confidential 8

Database Requirements For Data-Intensive Applications

DATABASE

INNOVATION

CUSTOMERS

EXTREME DATA & APPS SCALE

DIVERSE DATA TYPES

IN-PLACE INTELLIGENCE

ALWAYS-ON RELIABILITY

REAL-TIME DATA

Page 9: Building Data-Intensive Applications with MapR-DB

© 2017 MapR TechnologiesMapR Confidential 9

MapR-DB: Database for Global Data-Intensive Applications

Page 10: Building Data-Intensive Applications with MapR-DB

© 2017 MapR TechnologiesMapR Confidential 10

CONVERGED DATA PLATFORMHigh Availability Real Time Unified Security Multi-tenancy Disaster Recovery Global Namespace

EXISTING ENTERPRISE

APPLICATIONSBATCH & INTERACTIVE

ANALYTICS

INTELLIGENT

APPLICATIONS

ANALYTICS &

ML ENGINES

ON-PREMISE, MULTI-CLOUD, EDGE

IOT & EDGE

CLOUD-SCALE

DATA STORE

OPERATIONAL

DATABASE

GLOBAL EVENT

STREAMS

MapR-DB: Powerful Distributed NoSQL Database in MapR

Platform

Page 11: Building Data-Intensive Applications with MapR-DB

© 2017 MapR TechnologiesMapR Confidential 11

Top 10 Reasons Developers Choose MapR-DB

Native JSON Simplicity

with Expressive Queries

Multi-Model Flexibility Strong Consistency

No Data Loss

Extreme High

AvailabilityExtreme Performance

and Effortless Horizontal

Scale

Global Multi-Master

Replication

Optimized Multi-Tenancy

for 1000s of Apps

Integrated Streaming for

Real-Time Data Ingest,

Processing, and

Integration

In-Place SQL and

Advanced Analytics/ML

Robust Security and

Fine Grained Access

Control

Page 12: Building Data-Intensive Applications with MapR-DB

© 2017 MapR TechnologiesMapR Confidential 12

Business Benefits of MapR-DB

Personalized User

Experience

Fast Time-to-Market for

New Applications

Self-Service Operational

BI Without ETL/Data

Prep

Global Business

Continuity

Run Many Apps with

Commodity Scale

Across On-Prem and

Cloud

Real-Time Analytics and

ML Driven

NEW REVENUE ENGAGED CUSTOMERS INTELLIGENT PROCESSES

ALWAYS-ON LOWER TCO INSTANT INSIGHTS

Page 13: Building Data-Intensive Applications with MapR-DB

© 2017 MapR TechnologiesMapR Confidential 13

Performance + Scale: MapR-DB Maintains SLAs At Scale

• Built into the MapR Converged platform

• Always available on every node

• No separate process to start/stop/monitor

• No extra software to install or understand

• Automatic optimizations – it just works!

• Online admin

Platform integrated/minimal administration

• PBs of data, 1000s of nodes, Trillions of rows,

Millions of tables

• Automatic sharding of data

• Automatic balance and re-balancing of data

• No compaction delays

Scale Without Limits or Compromise

• Immediately consistent, no quorum complexity

• No single point of failure

• Auto re-balancing of data after disk/node failures

• Full data protection and point in time recovery

with snapshots

Strong Consistency by Default and Always

• Mixed workloads support with extreme high

read/write throughput

• Efficient CPU utilization

• Integrated with storage

• Architecture avoids contention.

• In-memory caching

• Flash/SSD optimizations

Best in Class Performance

Page 14: Building Data-Intensive Applications with MapR-DB

Enterprise Strategy Group | Getting to the bigger truth.™

ESG Validation:Analyzing the Performance of MapR-DB

Page 15: Building Data-Intensive Applications with MapR-DB

About ESG• ESG is an IT analyst, research, and strategy company.

• Our firm was founded in 1999 with headquarters in Milford, MA / an analyst and client relations presence in Silicon Valley, CA.

• ESG conducts research with and for IT vendors, IT professionals, business professionals, and channel partners.

• We maintain ongoing analyst coverage in cloud computing, networking, storage, data protection, cybersecurity, data management and analytics, application development and deployment, enterprise mobility, and channels.

• Capabilities include: Analyst services, market research, technical performance testing, economic validation, consulting, and custom content.

Page 16: Building Data-Intensive Applications with MapR-DB

© 2017 by The Enterprise Strategy Group, Inc.

Test Bed

• 2,000,000,000 records(250,000,000/instance)

• 10 fields of 100 bytes each(1K record size)

• 2TB of consumed capacity(256GB/instance)

• Strong Read/Write Consistency(Level 2, Replication 3)

Database

• 8 node EC2 cluster(VPC and single placement group)

• i3.8xlarge instance types(with HVM)

• Enhance Networking Adapters

• Linux 4.11 Linux kerneland CentOS 7.2

Infrastructure

Page 17: Building Data-Intensive Applications with MapR-DB

Performance Analysis – Operations/sec

What the numbers mean?

Significant operations/sec average performance benefit • 2.5x more than Cassandra• 5.5x more than HBase

MapR-DB > Cassandra > HBase• Cassandra always beat HBase• MapR-DB always beat Cassandra

by as much as 3.5x

Benefits across all workloads• 10x advantage over Hbase for

50% read/50% insert

Page 18: Building Data-Intensive Applications with MapR-DB

© 2017 by The Enterprise Strategy Group, Inc.

Performance Analysis – Latency

What the numbers mean?

MapR-DB delivered predictable and consistently low latency• Minimum of 2.7ms• Never exceeded 3ms

Cassandra delivered predictable, but high latency• 75% higher than MapR-DB

HBase delivered unpredictable and the highest latency• 6.4x higher than MapR-DB

Page 19: Building Data-Intensive Applications with MapR-DB

Why Does This Matter?

Speed is not the only thing that matters.

Consistency is essential.

Speed AND consistency enable…

…a predictable end-user experience.

…a reliable, real-time response.

…better, faster insights.

Page 20: Building Data-Intensive Applications with MapR-DB

© 2017 MapR TechnologiesMapR Confidential 20

Use cases

Page 21: Building Data-Intensive Applications with MapR-DB

© 2017 MapR TechnologiesMapR Confidential 21

MapR-DB Enables Wide Variety of Use Cases

Single View

Real-Time Analytics

IoT

Core Business Applications

List Management

Contextual User

Experience

Page 22: Building Data-Intensive Applications with MapR-DB

© 2017 MapR TechnologiesMapR Confidential 22

Why MapR-DB

• Scale - multiple groups, multiple products, several iterations of the tests over

large durations of time

• Schema-flexibility with native JSON (different users need different attributes

to measure designs on)

• Rich high performance queries for analysis on many dimensions from custom

apps

• Integrated streaming to ingest data from across the globe

• Reduced complexity with converged platform

CORE BUSINESS APPOPTIMIZATION OF CHIP DESIGN WORKFLOWS

Semi-conductor chip

evaluation/test runs capture &

iterative analysis throughout chip

development workflow to

continuously improve designs

and reduce costs. Replacement

of an existing NoSQL platform

Large

SEMI &

TELCO

Equipment

Company

Page 23: Building Data-Intensive Applications with MapR-DB

© 2017 MapR TechnologiesMapR Confidential 23

Why MapR-DB

Extreme Scale (Several PB in Database) Brings together

• Member Benefits, Demographics, Lab Results, Claims &

History

• Relationship details with providers/payers

• Calls

• Complaints

• Campaigns incentives out of box

High performance & Throughput

Multi-tenant

• 60+ use cases

• Consumption from variety of apps

Schema-flexibility with Native JSON support

Integrated Pub/Sub Streaming for real-time ingest & updates

Integrated analytics/ML

Reduced TCO

MULTI-TENANT DATA PLATFORM AS-A-SERVICE

WITH A SINGLE VIEW

Fortune

10 Health

Care

360 degree view of member population

for payers and providers. Provided as

data services to all core business

applications (IVR, NBA, RX Portals) for

improved customer service, reduced

costs and revenue growth.

Page 24: Building Data-Intensive Applications with MapR-DB

© 2017 MapR TechnologiesMapR Confidential 24

IOTPREDICTIVE MAINTENANCE FOR OIL WELLS

Radically improves safety, increases output, and

reduces costs in asset-intensive businesses using

big data and predictive analytics

Why MapR-DB

• Extreme scale: Huge volumes of IoT

sensor data. 100TB/ rig. PBs of total

data

• Extreme ingest rate: over 100M data

points/sec

• Extremely low latency: Faster

responsiveness on analytics and

applications

• Ability to process time-series data

• Advanced machine learning for

predictive maintenance

Fortune

500 Oil &

Gas

Company

Page 25: Building Data-Intensive Applications with MapR-DB

© 2017 MapR TechnologiesMapR Confidential 25

REAL-TIME ANALYTICSANALYTICS AS A SERVICE FOR SALES COMPENSATION

Xactly Insights is the first-ever big data platform to provide live incentive

compensation insights. Xactly provides prescriptive analytics and sales

performance insights to drive greater sales success.

Why MapR-DB

• 99.999% uptime: Always-on SaaS application

• Multi-tenancy: Data belonging to thousands of

customers shares same infrastructure

• Enterprise grade security: sensitive data

• In-place analytics: Real-time machine learning

• Extreme scale: Billions of transaction per day

Page 26: Building Data-Intensive Applications with MapR-DB

© 2017 MapR TechnologiesMapR Confidential 26

Q&A

ENGAGE WITH US

Contact us at:

855-NOW-MAPR

Or

[email protected]

https://twitter.com/mapr

https://www.linkedin.com/company/mapr-

technologies

Follow us at:

Page 27: Building Data-Intensive Applications with MapR-DB

© 2017 MapR TechnologiesMapR Confidential 27

Additional Resources

• Read the MapR-DB Performance Benchmark Report by ESG

• Learn more at our Product Page

• Watch: MapR Customer 360 Promo

• Try MapR at https://mapr.com/download/