multi-dimensional scaling: a new architecture for scaling big data applications: couchbase connect...
TRANSCRIPT
MULTI-DIMENSIONAL SCALING – A NEW ARCHITECTURE FOR SCALING BIG DATA APPLICATIONAnil Kumar, Senior Product ManagerCouchbase
©2015 Couchbase Inc. 2
About Me
Anil KumarSr. Product Manager,
Couchbase
@anilkumar1129
©2015 Couchbase Inc. 3
Next 40 minutes …
Brief history of scaling in database Scaling up Scaling out
NoSQL workloads and scalability model Core data operations, indexing and querying Homogenous HW scaling
Introducing multi-dimensional scaling Isolation of workloads through a services architecture Independent HW scaling
Demo Q & A
Brief History of Scaling in Database
©2015 Couchbase Inc. 5
Scaling up
Scale-up architecture Cluster processors – hyper-threading to cores Locally partition workload among processors Communicate over memory
©2015 Couchbase Inc. 6
Scaling Up – Pros & Cons
Pros Can result in major
performance improvement
Machines can now support having many cores and terabytes of RAM
Cons Expensive Requires downtime Performance bounded –
at some point the database engine itself becomes the bottleneck
Limited in scalability and elasticity
©2015 Couchbase Inc. 7
Scaling out
Scale-out architecture Cluster of commodity HW Horizontal partitioning of data on cluster nodes Communicate over network
©2015 Couchbase Inc. 8
Scaling Out – Pros & Cons
Pros Simple and easily
scalable Data evenly split across
cluster of nodes Scales linearly with
throughput Highly available No single point of failure
Cons Not great for all
workloads –data, index and query
©2015 Couchbase Inc. 9
So Which Model is the Right Model ?
Scale up or scale out?
©2015 Couchbase Inc. 10
Scaling Up vs. Scale OutLink to whitepaper - http://www.msr-waypoint.com/pubs/204499/a20-appuswamy.pdf
NoSQL Workloads & Scalability Model
©2015 Couchbase Inc. 12
NoSQL Workloads
One database, many workloads Core data processing: GETs & SETs for a given key Indexing: Index maintenance and lookups Querying: Combine index and data with complex just-in-
time datare-shaping, ordering, grouping, aggregations, and more
Varying resource requirements - CPU, RAM, I/O, Network
Varying methods to optimize latency & throughput for each
©2015 Couchbase Inc. 13
Scalability Model TodayHomogenous Scaling Each node get a slice of the workload Simple to do…But... Workloads compete and interfere with each other Can’t fine tune each workload
Core Data operation are partition-able so great with wider fan-out Indexing and queries aren’t always partitionable, so worse with wider fan-out
Index Workload
Couchbase Cluster
Query Workload
Data Workload
node1 node8
Introducing Multi-Dimensional Scaling
©2015 Couchbase Inc. 15
Modern Architecture
What is Multi-Dimensional Scalability? MDS is the architecture that enables independent scaling of data, query and indexing workloads.
Index Service
Couchbase Cluster
Query Service
Data Service
node1 node8
©2015 Couchbase Inc. 16Couchbase Cluster
node1 node8
Modern Architecture
Isolated Service for minimized interference Independent “zones” for Query, Index and Data Services
Minimize indexing and query overhead on core key-value operations.
Index ServiceQuery
ServiceData Service
©2015 Couchbase Inc. 17
Modern Architecture Independent Scalability for Best Computational Capacity per
Service
Heavier indexing (index more fields) : scale up index service nodesMore RAM for query processing: scale up query service nodes
Couchbase Cluster
node1 node8 node9
Data Service
Index ServiceQuery
Service
Services ArchitectureData, Index, & Query
©2015 Couchbase Inc. 19
Full Cluster Architecture
STORAGE
Couchbase Server 1
SHARD7
SHARD9
SHARD5
SHARDSHARDSHARD
Managed Cache
Cluster ManagerCluster
Manager
Managed Cache
Storage
Data Service
Index Service
Query Service
STORAGE
Couchbase Server 2
SHARD7
SHARD9
SHARD5
SHARDSHARDSHARD
Managed Cache
Cluster ManagerCluster
Manager
Managed Cache
Storage
Data Service
Index Service
Query Service
STORAGE
Couchbase Server 3
SHARD7
SHARD9
SHARD5
SHARDSHARDSHARD
Managed Cache
Cluster ManagerCluster
Manager
Managed Cache
Storage
Data Service
Index Service
Query Service
STORAGE
Couchbase Server 4
SHARD7
SHARD9
SHARD5
SHARDSHARDSHARD
Managed Cache
Cluster ManagerCluster
Manager
Managed Cache
Storage
Data Service
Index Service
Query Service
STORAGE
Couchbase Server 5
SHARD7
SHARD9
SHARD5
SHARDSHARDSHARD
Managed Cache
Cluster ManagerCluster
Manager
Managed Cache
Storage
Data Service
Index Service
Query Service
STORAGE
Couchbase Server 6
SHARD7
SHARD9
SHARD5
SHARDSHARDSHARD
Managed Cache
Cluster ManagerCluster
Manager
Managed Cache
Storage
Data Service
Index Service
Query Service
©2015 Couchbase Inc. 20
Full Cluster Architecture
STORAGE
Couchbase Server 1
SHARD7
SHARD9
SHARD5
SHARDSHARDSHARD
Managed Cache
Cluster ManagerCluster
Manager
Managed Cache
Storage
Data Service
Index Service
Query Service
STORAGE
Couchbase Server 2
SHARD7
SHARD9
SHARD5
SHARDSHARDSHARD
Managed Cache
Cluster ManagerCluster
Manager
Managed Cache
Storage
Data Service
Index Service
Query Service
STORAGE
Couchbase Server 3
SHARD7
SHARD9
SHARD5
SHARDSHARDSHARD
Managed Cache
Cluster ManagerCluster
Manager
Managed Cache
Storage
Data Service
Index Service
Query Service
STORAGE
Couchbase Server 4
SHARD7
SHARD9
SHARD5
SHARDSHARDSHARD
Managed Cache
Cluster ManagerCluster
Manager
Managed Cache
Storage
Data Service
Index Service
Query Service
STORAGE
Couchbase Server 5
SHARD7
SHARD9
SHARD5
SHARDSHARDSHARD
Managed Cache
Cluster ManagerCluster
Manager
Managed Cache
Storage
Data Service
Index Service
Query Service
STORAGE
Couchbase Server 6
SHARD7
SHARD9
SHARD5
SHARDSHARDSHARD
Managed Cache
Cluster ManagerCluster
Manager
Managed Cache
Storage
Data Service
Index Service
Query Service
©2015 Couchbase Inc. 21Storage Engine
Cluster Manager
Data Service
Projector & Router
New Services in Couchbase Server 4.0
Query ServiceIndex Service
SupervisorIndex maintenance &
Scan coordinator
Index#2
Index#1
Query Processorcbq-engine
Bucket#1
Bucket#2
DCP StreamIndex#4Index#3
...Bucket#2
Bucket#1
1809311211 18901
Managed Cache
Demo !!!
©2015 Couchbase Inc. 23
Recap
MDS enables unprecedented control of scalability with Couchbase Server Separate out competing workloads to independent services Independently scale each service “zone” within the cluster
Couchbase Server with MDS maximizes scalability and performance Improves scale and performance to degrees not possible
with other NoSQL or big-data engines on premise or in the cloud
Improved price/performance and squeezes more performance and throughput for mission-critical systems
Thank you.
Get Started with Couchbase Server 4.0: www.couchbase.com/beta
Get Trained on Couchbase: training.couchbase.com