multi-dimensional scaling: a new architecture for scaling big data applications: couchbase connect...

25
MULTI-DIMENSIONAL SCALING – A NEW ARCHITECTURE FOR SCALING BIG DATA APPLICATION Anil Kumar, Senior Product Manager Couchbase

Upload: couchbase

Post on 16-Aug-2015

248 views

Category:

Technology


0 download

TRANSCRIPT

Page 1: Multi-Dimensional Scaling: A New Architecture for Scaling Big Data Applications: Couchbase Connect 2015

MULTI-DIMENSIONAL SCALING – A NEW ARCHITECTURE FOR SCALING BIG DATA APPLICATIONAnil Kumar, Senior Product ManagerCouchbase

Page 2: Multi-Dimensional Scaling: A New Architecture for Scaling Big Data Applications: Couchbase Connect 2015

©2015 Couchbase Inc. 2

About Me

Anil KumarSr. Product Manager,

Couchbase

[email protected]

@anilkumar1129

Page 3: Multi-Dimensional Scaling: A New Architecture for Scaling Big Data Applications: Couchbase Connect 2015

©2015 Couchbase Inc. 3

Next 40 minutes …

Brief history of scaling in database Scaling up Scaling out

NoSQL workloads and scalability model Core data operations, indexing and querying Homogenous HW scaling

Introducing multi-dimensional scaling Isolation of workloads through a services architecture Independent HW scaling

Demo Q & A

Page 4: Multi-Dimensional Scaling: A New Architecture for Scaling Big Data Applications: Couchbase Connect 2015

Brief History of Scaling in Database

Page 5: Multi-Dimensional Scaling: A New Architecture for Scaling Big Data Applications: Couchbase Connect 2015

©2015 Couchbase Inc. 5

Scaling up

Scale-up architecture Cluster processors – hyper-threading to cores Locally partition workload among processors Communicate over memory

Page 6: Multi-Dimensional Scaling: A New Architecture for Scaling Big Data Applications: Couchbase Connect 2015

©2015 Couchbase Inc. 6

Scaling Up – Pros & Cons

Pros Can result in major

performance improvement

Machines can now support having many cores and terabytes of RAM

Cons Expensive Requires downtime Performance bounded –

at some point the database engine itself becomes the bottleneck

Limited in scalability and elasticity

Page 7: Multi-Dimensional Scaling: A New Architecture for Scaling Big Data Applications: Couchbase Connect 2015

©2015 Couchbase Inc. 7

Scaling out

Scale-out architecture Cluster of commodity HW Horizontal partitioning of data on cluster nodes Communicate over network

Page 8: Multi-Dimensional Scaling: A New Architecture for Scaling Big Data Applications: Couchbase Connect 2015

©2015 Couchbase Inc. 8

Scaling Out – Pros & Cons

Pros Simple and easily

scalable Data evenly split across

cluster of nodes Scales linearly with

throughput Highly available No single point of failure

Cons Not great for all

workloads –data, index and query

Page 9: Multi-Dimensional Scaling: A New Architecture for Scaling Big Data Applications: Couchbase Connect 2015

©2015 Couchbase Inc. 9

So Which Model is the Right Model ?

Scale up or scale out?

Page 11: Multi-Dimensional Scaling: A New Architecture for Scaling Big Data Applications: Couchbase Connect 2015

NoSQL Workloads & Scalability Model

Page 12: Multi-Dimensional Scaling: A New Architecture for Scaling Big Data Applications: Couchbase Connect 2015

©2015 Couchbase Inc. 12

NoSQL Workloads

One database, many workloads Core data processing: GETs & SETs for a given key Indexing: Index maintenance and lookups Querying: Combine index and data with complex just-in-

time datare-shaping, ordering, grouping, aggregations, and more

Varying resource requirements - CPU, RAM, I/O, Network

Varying methods to optimize latency & throughput for each

Page 13: Multi-Dimensional Scaling: A New Architecture for Scaling Big Data Applications: Couchbase Connect 2015

©2015 Couchbase Inc. 13

Scalability Model TodayHomogenous Scaling Each node get a slice of the workload Simple to do…But... Workloads compete and interfere with each other Can’t fine tune each workload

Core Data operation are partition-able so great with wider fan-out Indexing and queries aren’t always partitionable, so worse with wider fan-out

Index Workload

Couchbase Cluster

Query Workload

Data Workload

node1 node8

Page 14: Multi-Dimensional Scaling: A New Architecture for Scaling Big Data Applications: Couchbase Connect 2015

Introducing Multi-Dimensional Scaling

Page 15: Multi-Dimensional Scaling: A New Architecture for Scaling Big Data Applications: Couchbase Connect 2015

©2015 Couchbase Inc. 15

Modern Architecture

What is Multi-Dimensional Scalability? MDS is the architecture that enables independent scaling of data, query and indexing workloads.

Index Service

Couchbase Cluster

Query Service

Data Service

node1 node8

Page 16: Multi-Dimensional Scaling: A New Architecture for Scaling Big Data Applications: Couchbase Connect 2015

©2015 Couchbase Inc. 16Couchbase Cluster

node1 node8

Modern Architecture

Isolated Service for minimized interference Independent “zones” for Query, Index and Data Services

Minimize indexing and query overhead on core key-value operations.

Index ServiceQuery

ServiceData Service

Page 17: Multi-Dimensional Scaling: A New Architecture for Scaling Big Data Applications: Couchbase Connect 2015

©2015 Couchbase Inc. 17

Modern Architecture Independent Scalability for Best Computational Capacity per

Service

Heavier indexing (index more fields) : scale up index service nodesMore RAM for query processing: scale up query service nodes

Couchbase Cluster

node1 node8 node9

Data Service

Index ServiceQuery

Service

Page 18: Multi-Dimensional Scaling: A New Architecture for Scaling Big Data Applications: Couchbase Connect 2015

Services ArchitectureData, Index, & Query

Page 19: Multi-Dimensional Scaling: A New Architecture for Scaling Big Data Applications: Couchbase Connect 2015

©2015 Couchbase Inc. 19

Full Cluster Architecture

STORAGE

Couchbase Server 1

SHARD7

SHARD9

SHARD5

SHARDSHARDSHARD

Managed Cache

Cluster ManagerCluster

Manager

Managed Cache

Storage

Data Service

Index Service

Query Service

STORAGE

Couchbase Server 2

SHARD7

SHARD9

SHARD5

SHARDSHARDSHARD

Managed Cache

Cluster ManagerCluster

Manager

Managed Cache

Storage

Data Service

Index Service

Query Service

STORAGE

Couchbase Server 3

SHARD7

SHARD9

SHARD5

SHARDSHARDSHARD

Managed Cache

Cluster ManagerCluster

Manager

Managed Cache

Storage

Data Service

Index Service

Query Service

STORAGE

Couchbase Server 4

SHARD7

SHARD9

SHARD5

SHARDSHARDSHARD

Managed Cache

Cluster ManagerCluster

Manager

Managed Cache

Storage

Data Service

Index Service

Query Service

STORAGE

Couchbase Server 5

SHARD7

SHARD9

SHARD5

SHARDSHARDSHARD

Managed Cache

Cluster ManagerCluster

Manager

Managed Cache

Storage

Data Service

Index Service

Query Service

STORAGE

Couchbase Server 6

SHARD7

SHARD9

SHARD5

SHARDSHARDSHARD

Managed Cache

Cluster ManagerCluster

Manager

Managed Cache

Storage

Data Service

Index Service

Query Service

Page 20: Multi-Dimensional Scaling: A New Architecture for Scaling Big Data Applications: Couchbase Connect 2015

©2015 Couchbase Inc. 20

Full Cluster Architecture

STORAGE

Couchbase Server 1

SHARD7

SHARD9

SHARD5

SHARDSHARDSHARD

Managed Cache

Cluster ManagerCluster

Manager

Managed Cache

Storage

Data Service

Index Service

Query Service

STORAGE

Couchbase Server 2

SHARD7

SHARD9

SHARD5

SHARDSHARDSHARD

Managed Cache

Cluster ManagerCluster

Manager

Managed Cache

Storage

Data Service

Index Service

Query Service

STORAGE

Couchbase Server 3

SHARD7

SHARD9

SHARD5

SHARDSHARDSHARD

Managed Cache

Cluster ManagerCluster

Manager

Managed Cache

Storage

Data Service

Index Service

Query Service

STORAGE

Couchbase Server 4

SHARD7

SHARD9

SHARD5

SHARDSHARDSHARD

Managed Cache

Cluster ManagerCluster

Manager

Managed Cache

Storage

Data Service

Index Service

Query Service

STORAGE

Couchbase Server 5

SHARD7

SHARD9

SHARD5

SHARDSHARDSHARD

Managed Cache

Cluster ManagerCluster

Manager

Managed Cache

Storage

Data Service

Index Service

Query Service

STORAGE

Couchbase Server 6

SHARD7

SHARD9

SHARD5

SHARDSHARDSHARD

Managed Cache

Cluster ManagerCluster

Manager

Managed Cache

Storage

Data Service

Index Service

Query Service

Page 21: Multi-Dimensional Scaling: A New Architecture for Scaling Big Data Applications: Couchbase Connect 2015

©2015 Couchbase Inc. 21Storage Engine

Cluster Manager

Data Service

Projector & Router

New Services in Couchbase Server 4.0

Query ServiceIndex Service

SupervisorIndex maintenance &

Scan coordinator

Index#2

Index#1

Query Processorcbq-engine

Bucket#1

Bucket#2

DCP StreamIndex#4Index#3

...Bucket#2

Bucket#1

1809311211 18901

Managed Cache

Page 22: Multi-Dimensional Scaling: A New Architecture for Scaling Big Data Applications: Couchbase Connect 2015

Demo !!!

Page 23: Multi-Dimensional Scaling: A New Architecture for Scaling Big Data Applications: Couchbase Connect 2015

©2015 Couchbase Inc. 23

Recap

MDS enables unprecedented control of scalability with Couchbase Server Separate out competing workloads to independent services Independently scale each service “zone” within the cluster

Couchbase Server with MDS maximizes scalability and performance Improves scale and performance to degrees not possible

with other NoSQL or big-data engines on premise or in the cloud

Improved price/performance and squeezes more performance and throughput for mission-critical systems

Page 24: Multi-Dimensional Scaling: A New Architecture for Scaling Big Data Applications: Couchbase Connect 2015

Thank you.

Page 25: Multi-Dimensional Scaling: A New Architecture for Scaling Big Data Applications: Couchbase Connect 2015

Get Started with Couchbase Server 4.0: www.couchbase.com/beta

Get Trained on Couchbase: training.couchbase.com