building data-intensive applications with mapr-db
TRANSCRIPT
Building Data-Intensive Applications with MapR-DB
Today’s Presenters
Mike LeoneSenior Validation and
Data Management Analyst
Ankur DesaiSenior Product
Marketing Manger
Biggest Database Challenges
48% of organizations struggle with managing data
growth and database size
34% of organizations are unable to support databases
in cloud environments
35% of organizations have challenges meeting database performance requirements
Top 5 Attributes and Capabilities When Evaluating and Selecting NoSQL Databases
19%
21%
32%
35%
43%
Scales out efficiently across distributed,parallel nodes
Robust management utilities
Full SQL Compliance
Security
Cloud-based
Advantages of Cloud-based NoSQL Databases
Faster time to deploy
Reduced storage costs
Increased scalability
Improved recoverability
Better availability
More elasticity
More advancedanalytic capabilities
Reduced processing costs
© 2017 by The Enterprise Strategy Group, Inc.
Use of NoSQL Databases
18%plan to use or are interested
in using
78%of organizations
currently use NoSQL
© 2017 MapR TechnologiesMapR Confidential 7
MapR-DB: The Database for Global Data-Intensive Applications
© 2017 MapR TechnologiesMapR Confidential 8
Database Requirements For Data-Intensive Applications
DATABASE
INNOVATION
CUSTOMERS
EXTREME DATA & APPS SCALE
DIVERSE DATA TYPES
IN-PLACE INTELLIGENCE
ALWAYS-ON RELIABILITY
REAL-TIME DATA
© 2017 MapR TechnologiesMapR Confidential 9
MapR-DB: Database for Global Data-Intensive Applications
© 2017 MapR TechnologiesMapR Confidential 10
CONVERGED DATA PLATFORMHigh Availability Real Time Unified Security Multi-tenancy Disaster Recovery Global Namespace
EXISTING ENTERPRISE
APPLICATIONSBATCH & INTERACTIVE
ANALYTICS
INTELLIGENT
APPLICATIONS
ANALYTICS &
ML ENGINES
ON-PREMISE, MULTI-CLOUD, EDGE
IOT & EDGE
CLOUD-SCALE
DATA STORE
OPERATIONAL
DATABASE
GLOBAL EVENT
STREAMS
MapR-DB: Powerful Distributed NoSQL Database in MapR
Platform
© 2017 MapR TechnologiesMapR Confidential 11
Top 10 Reasons Developers Choose MapR-DB
Native JSON Simplicity
with Expressive Queries
Multi-Model Flexibility Strong Consistency
No Data Loss
Extreme High
AvailabilityExtreme Performance
and Effortless Horizontal
Scale
Global Multi-Master
Replication
Optimized Multi-Tenancy
for 1000s of Apps
Integrated Streaming for
Real-Time Data Ingest,
Processing, and
Integration
In-Place SQL and
Advanced Analytics/ML
Robust Security and
Fine Grained Access
Control
© 2017 MapR TechnologiesMapR Confidential 12
Business Benefits of MapR-DB
Personalized User
Experience
Fast Time-to-Market for
New Applications
Self-Service Operational
BI Without ETL/Data
Prep
Global Business
Continuity
Run Many Apps with
Commodity Scale
Across On-Prem and
Cloud
Real-Time Analytics and
ML Driven
NEW REVENUE ENGAGED CUSTOMERS INTELLIGENT PROCESSES
ALWAYS-ON LOWER TCO INSTANT INSIGHTS
© 2017 MapR TechnologiesMapR Confidential 13
Performance + Scale: MapR-DB Maintains SLAs At Scale
• Built into the MapR Converged platform
• Always available on every node
• No separate process to start/stop/monitor
• No extra software to install or understand
• Automatic optimizations – it just works!
• Online admin
Platform integrated/minimal administration
• PBs of data, 1000s of nodes, Trillions of rows,
Millions of tables
• Automatic sharding of data
• Automatic balance and re-balancing of data
• No compaction delays
Scale Without Limits or Compromise
• Immediately consistent, no quorum complexity
• No single point of failure
• Auto re-balancing of data after disk/node failures
• Full data protection and point in time recovery
with snapshots
Strong Consistency by Default and Always
• Mixed workloads support with extreme high
read/write throughput
• Efficient CPU utilization
• Integrated with storage
• Architecture avoids contention.
• In-memory caching
• Flash/SSD optimizations
Best in Class Performance
Enterprise Strategy Group | Getting to the bigger truth.™
ESG Validation:Analyzing the Performance of MapR-DB
About ESG• ESG is an IT analyst, research, and strategy company.
• Our firm was founded in 1999 with headquarters in Milford, MA / an analyst and client relations presence in Silicon Valley, CA.
• ESG conducts research with and for IT vendors, IT professionals, business professionals, and channel partners.
• We maintain ongoing analyst coverage in cloud computing, networking, storage, data protection, cybersecurity, data management and analytics, application development and deployment, enterprise mobility, and channels.
• Capabilities include: Analyst services, market research, technical performance testing, economic validation, consulting, and custom content.
© 2017 by The Enterprise Strategy Group, Inc.
Test Bed
• 2,000,000,000 records(250,000,000/instance)
• 10 fields of 100 bytes each(1K record size)
• 2TB of consumed capacity(256GB/instance)
• Strong Read/Write Consistency(Level 2, Replication 3)
Database
• 8 node EC2 cluster(VPC and single placement group)
• i3.8xlarge instance types(with HVM)
• Enhance Networking Adapters
• Linux 4.11 Linux kerneland CentOS 7.2
Infrastructure
Performance Analysis – Operations/sec
What the numbers mean?
Significant operations/sec average performance benefit • 2.5x more than Cassandra• 5.5x more than HBase
MapR-DB > Cassandra > HBase• Cassandra always beat HBase• MapR-DB always beat Cassandra
by as much as 3.5x
Benefits across all workloads• 10x advantage over Hbase for
50% read/50% insert
© 2017 by The Enterprise Strategy Group, Inc.
Performance Analysis – Latency
What the numbers mean?
MapR-DB delivered predictable and consistently low latency• Minimum of 2.7ms• Never exceeded 3ms
Cassandra delivered predictable, but high latency• 75% higher than MapR-DB
HBase delivered unpredictable and the highest latency• 6.4x higher than MapR-DB
Why Does This Matter?
Speed is not the only thing that matters.
Consistency is essential.
Speed AND consistency enable…
…a predictable end-user experience.
…a reliable, real-time response.
…better, faster insights.
© 2017 MapR TechnologiesMapR Confidential 20
Use cases
© 2017 MapR TechnologiesMapR Confidential 21
MapR-DB Enables Wide Variety of Use Cases
Single View
Real-Time Analytics
IoT
Core Business Applications
List Management
Contextual User
Experience
© 2017 MapR TechnologiesMapR Confidential 22
Why MapR-DB
• Scale - multiple groups, multiple products, several iterations of the tests over
large durations of time
• Schema-flexibility with native JSON (different users need different attributes
to measure designs on)
• Rich high performance queries for analysis on many dimensions from custom
apps
• Integrated streaming to ingest data from across the globe
• Reduced complexity with converged platform
CORE BUSINESS APPOPTIMIZATION OF CHIP DESIGN WORKFLOWS
Semi-conductor chip
evaluation/test runs capture &
iterative analysis throughout chip
development workflow to
continuously improve designs
and reduce costs. Replacement
of an existing NoSQL platform
Large
SEMI &
TELCO
Equipment
Company
© 2017 MapR TechnologiesMapR Confidential 23
Why MapR-DB
Extreme Scale (Several PB in Database) Brings together
• Member Benefits, Demographics, Lab Results, Claims &
History
• Relationship details with providers/payers
• Calls
• Complaints
• Campaigns incentives out of box
High performance & Throughput
Multi-tenant
• 60+ use cases
• Consumption from variety of apps
Schema-flexibility with Native JSON support
Integrated Pub/Sub Streaming for real-time ingest & updates
Integrated analytics/ML
Reduced TCO
MULTI-TENANT DATA PLATFORM AS-A-SERVICE
WITH A SINGLE VIEW
Fortune
10 Health
Care
360 degree view of member population
for payers and providers. Provided as
data services to all core business
applications (IVR, NBA, RX Portals) for
improved customer service, reduced
costs and revenue growth.
© 2017 MapR TechnologiesMapR Confidential 24
IOTPREDICTIVE MAINTENANCE FOR OIL WELLS
Radically improves safety, increases output, and
reduces costs in asset-intensive businesses using
big data and predictive analytics
Why MapR-DB
• Extreme scale: Huge volumes of IoT
sensor data. 100TB/ rig. PBs of total
data
• Extreme ingest rate: over 100M data
points/sec
• Extremely low latency: Faster
responsiveness on analytics and
applications
• Ability to process time-series data
• Advanced machine learning for
predictive maintenance
Fortune
500 Oil &
Gas
Company
© 2017 MapR TechnologiesMapR Confidential 25
REAL-TIME ANALYTICSANALYTICS AS A SERVICE FOR SALES COMPENSATION
Xactly Insights is the first-ever big data platform to provide live incentive
compensation insights. Xactly provides prescriptive analytics and sales
performance insights to drive greater sales success.
Why MapR-DB
• 99.999% uptime: Always-on SaaS application
• Multi-tenancy: Data belonging to thousands of
customers shares same infrastructure
• Enterprise grade security: sensitive data
• In-place analytics: Real-time machine learning
• Extreme scale: Billions of transaction per day
© 2017 MapR TechnologiesMapR Confidential 26
Q&A
ENGAGE WITH US
Contact us at:
855-NOW-MAPR
Or
https://twitter.com/mapr
https://www.linkedin.com/company/mapr-
technologies
Follow us at:
© 2017 MapR TechnologiesMapR Confidential 27
Additional Resources
• Read the MapR-DB Performance Benchmark Report by ESG
• Learn more at our Product Page
• Watch: MapR Customer 360 Promo
• Try MapR at https://mapr.com/download/