webinar - datastax enterprise 5.1: 3x the operational analytics speed, help for multi-tenant saas...
TRANSCRIPT
DataStax Enterprise 5.1: 3X the operational analytics speed, help for multi-tenant SaaS apps, & other shiny things
Gehrig Kunz, Product MarketingDavid Gilardi, Technical Evangelist
• What is a DataStax Enterprise? • Explore what’s new in:
• DataStax Enterprise 5.1• Core• Analytics• Search• Graph
• *Live demo*• OpsCenter 6.1 • Studio 2.0
Hey. Thanks for joining us.
David@sonicdmg
Gehrig@gehrigkunz(#BearDown)
Powering cloud applicationsPersonalization
Customer 360
Recommendation
Fraud Detection
Inventory Management
Identity Management
Security
Supply Chain
Cloud application characteristics
Real-Time DistributedAlways-OnContextual Scalable
Easy to build, effortless to scale
• DataStax Enterprise 5.1• Core• Analytics• Search• Graph
• DataStax OpsCenter 6.1• DataStax Studio 2.0• DSE Drivers
Easy to build, effortless to scale
• DataStax Enterprise 5.1• Core• Analytics• Search• Graph
• DataStax OpsCenter 6.1• DataStax Studio 2.0• DSE Drivers
Something for everyone.
Let’s dive in
DSE Core
Powered by the best distribution of Apache Cassandra™
• Multi-tenancy made easy• Improvements to Advanced Replication • Continuous paging (DSE Analytics)• Production certified Cassandra 3.11.1
Apache Cassandra, Spark and Solr are trademarks of the Apache Software Foundation or its subsidiaries in Canada, the United States and/or other countries.
Best distro of Apache Cassandra™
A look back at 5.0 introductions to DSE Core: • Introduced advanced replication• Tiered storage• Multi-instance
Row Level Access Control (RLAC)
• Part of DSE Advanced Security. • Secures data in tables at the row
level. • Handled via CQL. • Enables multi-tenancy capabilities
on Cassandra tables.
RLAC FTW
CREATE TABLE microservices.shoppingcart_by_tx (timestamp UUID,cart UUID,amount decimal,store text,tx_id UUIDPRIMARY_KEY((store, tx_id))
);
RESTRICT ROWS on microservices.shoppingcart_by_tx USING store;
CREATE ROLE bobcoapp;
GRANT SELECT ON ‘bobco’ IN microservices.shoppingcart_by_tx TO bobcoapp;
Advanced replication advanced-er
• Lower overhead and improved performance. • Hub to Spoke replication; multi-directional replication between
clusters now supported.• Support for multi-datacenter edge clusters.
Advanced replication for retail
Advanced replication for IoT
DSE Analytics
● 3x operational analytics performance improvements● DataStax Enterprise File System (DSEFS)
• HDFS minus the bad• Example use: PDFs and their metadata
● Separate Spark-only cluster• Adding/removing Spark nodes is cheap • Good for scaling up/down for specific needs • Example use: weekly analytics reporting
● Spark UI reachable from any node● Production certified Apache Spark™ 2.0
• Support for SparkR.
What is DSEFS?
• A new file system for DSE that is...• HDFS compatible • Distributed• Fault tolerant with no single point of failure• Scale-out
Benefits of DSEFS over HDFS Benefits of DSEFS over CassandraNo namenode No overhead from compactions or commit log writesNo secondary namenode No data density ramifications No zookeeper No overhead from delete operations; provides
immediate deletesNo single point of failure No JVM overhead for file system data
To the core
Analytics
Table
Scan 1
––– Prior to continuous paging –––
To the core
Analytics
Table
Scan 1
Scan 2
––– Without continuous paging –––
To the core
Analytics
Table
Scan 1
Scan 2
––– With continuous paging –––
The result – 3xR
espo
nse
Tim
e (S
econ
ds)
Test 1 Test 2 Test 3 Test 40
50
100
150
200
250
300
Read performance (Lower is better)
OSS C*/Spark DSE 5.1
DSE Search
• Increased performance for Indexing; new data now made available for search much faster.
• Enhanced CQL support for search index management; makes using search easier than ever.
• Production certified Apache Solr 6.0
CREATE SEARCH INDEX IF NOT EXISTS ON songs WITH COLUMNS id,val1;
DROP SEARCH INDEX ON songs WITH OPTIONS { deleteResources : true };
DSE Graph
The first scale-out, real-time graph database• First on the market to enable out-of-the-box graph search.• Improved graph analytics:
• With Spark GraphFrames, known as ‘DSEGraphFrames’• Introducing Fluent API with 2 flavors:
• Explicit = if you’re coming from DSE drivers• Implicit = if you’re coming from Tinkerpop world
Graph-based Search
Type ahead, fuzzy, and spell-check searches now available on graph data
Easy to build, effortless to scale
• DataStax Enterprise 5.1• Core• Analytics• Search• Graph
• DataStax OpsCenter 6.1• DataStax Studio 2.0• DSE Drivers
OpsCenter 6.1 improvementsLive demo! Oooooooooooo
OpsCenter 6.1 improvements• Full support for DSE 5.1• Updated repair section
• Improved resiliency• Granular control to ignore keyspaces and/or tables• Improved UI to track repair progress
• Enhanced metrics and alerts• Graph• Datacenter latency
• Datacenter level backup and improved support for Amazon S3• Backup and restore times decreased
• Backup/restore SASI indexes• SASI index definitions automatically backed up on tables backed up, index definitions are
rebuilt on restore• Backward compatibility of agents
• OpsCenter can be upgraded ahead of agents from 6.1 moving forward
Updated repairs section!
Improved UI
Enhanced metrics for DSE Graph and Datacenter Latency
Datacenter level backup and improved support for Amazon S3
Backward compatibility of agentsfrom 6.1 moving forward
Show you stuff
Studio 2.0More demos!
Some developer love - Studio● Studio 2.0
○ New graph visuals■Vertex size by property value■Color by label or property value■Shapes & Icons■Improved cluster segregation and reduced overlap
○ Gremlin scan steps not using indexes highlighted in profiler○ CQL Support
■Traditionally “Devcenter” functionality■CQL■Schema views■Tracing■Consistency level settings■Schema aware content assist “intellisense magic”
Vertex size by property value
Color by label or property value
Shapes & Icons
Improved cluster segregation and reduced overlap
Improved cluster segregation and reduced overlap
Show you more stuff
• OpsCenter 6.1• Timestamps in human readable format instead
of epoch• Repair
• Better logging• Added “pause” between repairs to avoid
aggressive cyclic repairs• Lifecycle Manager
• Concurrent inter-cluster jobs
• More hardening
• Schema viewer support for SASI indexes, MVs, UDTs, UDFs, UDAs
Honorable mentions
(1/3)
• Graph• API support for edit distance queries• Type ahead• Spell check• Support for multiple vertices & edges
and commit as a single operation/transaction
• Support for units in Geo.distance• Time and Date types added
Honorable mentions
(2/3)
• Studio 2.0• Syntax highlighting• Code completion• Schema-aware completion• Validations• Snippets• Dockable schema viewer• Detailed view• CQL execution configurations• Support for charts
• DSE Drivers• Unified authentication• Ability to run queries as another user• DateRangeField support in search
Honorable mentions
(3/3)
April 4th!
Release notes –
http://docs.datastax.com/en/dse/5.1/dse-dev/datastax_enterprise/RNdse.html
How to upgrade –
http://docs.datastax.com/en/dse/5.1/upgrade/upgrade/datastax_enterprise/upgdDSE51.html
Graph example data sets –
Things to check out
All downloads available April 4th
academy.datastax.com/slack
https://github.com/datastax/graph-examples
Thank you