vmworld 2014: site recovery manager and vsphere replication
TRANSCRIPT
Site Recovery Manager and vSphere Replication: What’s New Technical Deep Dive
BCO2629
Jeff Hunter, VMware, Inc Ken Werneburg, VMware, Inc
CONFIDENTIAL 2
Disclaimer • This presentation may contain product features that are currently under development. • This overview of new technology represents no commitment from VMware to deliver these
features in any generally available product. • Features are subject to change, and must not be included in contracts, purchase orders, or
sales agreements of any kind.
• Technical feasibility and market demand will affect final delivery. • Pricing and packaging for any new technologies or features discussed or presented have not
been determined.
CONFIDENTIAL 3
Agenda for Today
1 Site Recovery Manager quick recap
2 What’s New with SRM 5.8
3 vSphere Replication (VR) Recap
4 VR Use Cases
5 What’s New with VR
6 Your turn to make noise on the microphone
CONFIDENTIAL 4
The Software-Defined Data Center
4
Transform storage by aligning it with app demands
Management tools give way to automation
Expand virtual compute to all
applications
Virtualize the network for speed
and efficiency
CONFIDENTIAL 5
The Software-Defined Data Center
5
Transform storage by aligning it with app demands
CONFIDENTIAL 6
SAN / NAS
Abstraction and pooling of
infrastructure
VM centric data services
Common policy-based management
SAN/NAS Pool
Virtual Data Plane
x86 Servers
Hypervisor-converged Storage pool
Object Storage Pool
Cloud Object Storage
Virtual Data Services
Data Protection Cloud DR Local Availability
Policy-driven Control Plane
Software-Defined Storage and Availability
6
Bringing the Efficient Operational Model of Virtualization to Storage
Third-party Control Plane
API
Third-party Data Services
API
CONFIDENTIAL 7
2014+
Availability Timeline
2008
SRM 4.x + ABR
SRM 5.0 + ABR
SRM 5.5 + ABR
VR 5.0 (SRM Only)
SRM 1.x + ABR
VR/SRM & SVR 5.1
VR/SRM & SVR 5.5
SRM 5.1 + ABR
VR/SRM & SVR 5.8
SRM 5.8 + ABR
Site Recovery Manager 5.8
CONFIDENTIAL 9
Terminology • Acronyms and terms we will be using
– SRM (Site Recovery Manager) – ABR (Array Based Replication) – VR (vSphere Replication) – VRMS (vSphere Replication Management Server) – VRS (vSphere Replication Server) – “SVR” (Standalone vSphere Replication i.e. NO SRM involved)
CONFIDENTIAL 10
vCenter Site Recovery Manager
• What is vCenter Site Recovery Manager (SRM)? – SRM is the industry-leading disaster recovery
automation solution for vSphere environments
• What are the key features? – Centralized recovery plans for thousands of VMs – Non-disruptive recovery testing – Automated DR workflows – Integrated with the VMware product stack
• What are the key benefits? – Lowers the cost of DR management by 50% or more – Eliminates complexity and risk of manual processes – Enables fast and highly predictable RTOs – Provides policy-driven DR control for any virtualized app
vSphere
vCenter Server Site Recovery Manager vCenter Server Site Recovery
Manager
vSphere
Production Site Recovery Site
Servers Servers Array-based replication
vSphere Replication
* Disaster Avoidance
CONFIDENTIAL 11
Typical Uses Cases for SRM
• Least frequent but most-critical use case
• Ensure fastest RTO
• Avoid $145,000 per hour of downtime (Forrester)
Disaster Recovery
• Frequent on-ramp use case for SRM
• Enables data center maintenance and global load balancing
Planned Migration
• Proactive, controlled workflow
• Ensures app-consistency and zero data loss
Disaster Avoidance
CONFIDENTIAL 12
Recovery Workflows
• User defined recovery plan • Minimize errors
Failover Automation
• Isolated test environment • Increase confidence in DR process
Non-disruptive Failover Testing
• Zero data loss • Operational migration
Planned Migration
• Re-protect VM’s, migrate back
Failback Automation
What’s New in SRM 5.8
CONFIDENTIAL 14
What’s New in Site Recovery Manager 5.8
• Self-service, policy-based protection – vCAC integration through new vCO plugin
• Software-defined storage for DR – Virtual SAN integration through vSphere Replication
• 5x scale of protection – up to 5,000 protected VMs per vCenter Server*
• 2x scale of recovery – concurrent recovery of up to 2,000 VMs per vCenter Server*
• Performance enhancement – storage stack improvements reduces RTO
• Converged UI with vSphere – vSphere Web Client plugin
• Simplified IP address management – rule based customization at the subnet level
• Faster installation –embedded database option (vPostgres)
DR for the SDDC Simplified Operations Enhanced Scalability
* Scale supported with array-based replication
CONFIDENTIAL 15
Self-Service, Policy-Based DR Protection for Apps – And Other Workflows!
(1) Agents for SQL Server, Exchange and SharePoint
Capabilities • Self-service DR provisioning using vCAC
blueprints
• Automated protection mapping according to pre-defined tiers
Architecture
Production Site
vSphere
Site Recovery Manager
Recovery Site
vSphere
Site Recovery Manager
Array-based Replication
External Storage External Storage
vCloud Automation Center
• SRM using array-based replication
• vCAC management across two sites
• Integration via vCO plugin for SRM
• New APIs exposed for PowerCLI integration
Benefits • DR control delivered as a service to app tenants
• Quicker time to market for apps
• Reduced complexity for infrastructure admins
CONFIDENTIAL 16
vCAC and SRM Workflow • Load SRM plugin package into VCO • Run workflow to attach SRM postprovisioning to a vCAC Catalog Item
– Can do this against a ‘template’ CI
• Now when provisioning that CI, SRM priority and custom properties (callouts, etc.) will be prompted
• After deploy the workflow carries on: – Finds protection groups for the target datastore – Adds the VM to the protection group and associated recovery plans
CONFIDENTIAL 17
vCAC Integration
SRM protection exposed through
vCAC Portal
CONFIDENTIAL 18
Runs a Standard VCO Workflow after Provisioning
CONFIDENTIAL 19
Not Just for vCAC Though… • The VCO plugin for SRM offers many other workflows
as well: – Create protection groups and add VMs – Find protection groups by datastore – Add protection to unprotected VMs in a replicated
datastore – … and almost anything else the SRM API exposes can
now be accessed through the VCO plugin.
CONFIDENTIAL 20
Enhanced Scalability to Enable Large Enterprise Use Cases
CONFIDENTIAL 20
Protection
Concurrent Recovery
1,500 VMs
1,000 VMs
5,000 VMs
2,000 VMs
SRM 5.5 SRM 5.8
CONFIDENTIAL 21
fsLock global FSS layer semaphore hold time eliminated for
faster disk resignature! rescanAllHba() at VPXA/Hostd and VMKCTL lists
SCSI with default skipFsScan set to true!
SRA commands batched through single
instance Perl space to minimize discoverarray
and other overhead
Performance Improvements
CONFIDENTIAL 22
Up to 75%
Just trust me… there are lots of performance improvements
faster RTO
CONFIDENTIAL 24
VMware Performance and Scale Testing • 250 Protection Groups • 2000 VMs with IP Customization ON
Total Time Storage Time
Old Method 29 h 17 h 15 m
New Method 13 h 53 m 4 h 13 m
CONFIDENTIAL 25
SRM Is Now Even Simpler to Deploy and Manage
CONFIDENTIAL 25
Converged UI with vSphere Simplified IP address management
Faster installation
New SRM plugin for the vSphere Web Client and enhanced workflows
Rule-based management at the subnet level
Optional embedded vPostgres DB
CONFIDENTIAL 26
New SRM Plugin for the Web Interface
CONFIDENTIAL 27
No It’s Not Really the MUI
CONFIDENTIAL 28
IP Subnet Mapping
Dr-ip-customizer --cfg ..\config\vmware-dr.xml -o c:\example.csv --cmd generate --vc vcenter-recovery
Dr-ip-customizer --cfg ..\config\vmware-dr.xml --csv c:\example.csv --cmd apply --vc vcenter-recovery
CONFIDENTIAL 29
IP Subnet Mapping
CONFIDENTIAL 30
VSAN + VR and Site Recovery Manager • Virtual SAN is compatible with:
– vSphere Replication – SPBM configured as part of replication – vCenter Site Recovery Manager – SRM configuration based on VR replication
• vSphere Replication & vCenter SRM – Asynchronous replication – 15 minute RPO – VM-Centric based protection – Provide automated DR operation & orchestration – Automated failover – execution of user defined plans – Automated failback – reverser original recovery plan – Planned migration – ensure zero data loss – Point-in-Time Recovery – multiple recovery points – Non-disruptive test – automate test on isolated network
vCenter Server VR/SRM
vSphere
VMFS
vCenter Server VR/SRM
protected site recovery site
replication
Hard disks
SSD
vSphere + Virtual SAN
Hard disks
SSD Hard disks
SSD
CONFIDENTIAL 31
SRM Replication Options
• SRM can utilize BOTH array based AND vSphere Replication
• SRM will “see” existing standalone vSphere Replication protected VMs
• SRM can install vSphere Replication from scratch if needed
Hub LUN 2
Web
Multi-tier App
DB
App
vSphere Replication
Storage-based Replication
LUN 1
Web
DB
App
Multi-tier App
vSphere Replication 5.8
CONFIDENTIAL 33
vSphere Replication • Per-VM host-based replication integrated with vSphere platform • Included with vSphere Essentials Plus and higher editions
OS
Data App
OS
Data App
Location A
SAN
OS
Data App
Location B
Virtual SAN
vCenter Server
CONFIDENTIAL 34
vSphere Replication • Easy virtual appliance deployment • Integration with vSphere Web Client
• Protect any VM regardless of OS and apps • Flexible recovery point objective (RPO) policies
• Quick recovery for individual VMs
• Replication engine for Site Recovery Manager (SRM) • Compatible with SAN, NAS, local, and VSAN storage
Replicate workloads to vCenter Server and
vCloud Air
CONFIDENTIAL 35
vSphere Replication Use Cases • Data protection and disaster recovery • Data center migration
• Replication engine for SRM • Stand alone replication
• Within the same site
• Across sites - vCenter Server and vCloud Air
Data Center
vCloud Air
Data Center
CONFIDENTIAL 36
Replication to vCenter Server and vCloud Air
CONFIDENTIAL 37
Replication to vCloud Air
CONFIDENTIAL 38
vSphere Replication Components
OS
APP DATA
vSphere Replication Agent (VRA) built into vSphere
vCenter Server vSphere Web Client
vSphere Replication Management Server
(VRMS)
vSphere Replication Server (VRS)
OS
APP DATA
Management and VRS
CONFIDENTIAL 39
vSphere Replication Limitations • VM must be powered on to replicate • RPO 15 minutes to 24 hours
• Up to 10 appliances per vCenter Server – One VRMS (required), up to nine VRS (optional)
• Supported max of 500 replicated VMs per vCenter Server • VMs protected by vSphere FT not supported
• Physical Raw Device Mapping (RDM) not supported
CONFIDENTIAL 40
vSphere Replication Disk Consistency
No
No
OS
APP
DATA
OS
APP
DATA
Yes Yes
Source VMDKs
Target VMDKs
Within a VM: Yes
Across VMs: No
CONFIDENTIAL 41
vSphere Replication Application Consistency
CONFIDENTIAL 42
vSphere Replication Reporting
CONFIDENTIAL 43
vSphere Replication Reporting
CONFIDENTIAL 44
vSphere Replication Reporting
CONFIDENTIAL 45
vSphere Replication MPIT Recovery • Multiple point in time (MPIT) recovery - up to 24 recovery points
– Examples: • 4 recovery points, last 6 days • 3 recovery points, last 5 days
CONFIDENTIAL 46
vSphere Replication MPIT Recovery • Recovered as VM with snapshots
– vSphere Replication recovers latest replica – Use Snapshot Manager to roll back to recovery point
CONFIDENTIAL 47
vSphere Replication Recommendations • Set RPO to longest acceptable value
– 15-minute RPOs: Just because you can, does not mean you should
• Leave VSS quiescing disabled unless really necessary – Majority of apps recover well from crash-consistent copy
• Configure minimum acceptable number of MPIT recovery points – Fewer recovery points = faster recovery time, less storage consumption
CONFIDENTIAL 48
vSphere Replication Resources • vSphere Replication Calculator • Documentation
• Overview white paper • Frequently Asked Questions
• Capacity Planning Appliance (Fling)
• Blogs
CONFIDENTIAL 49
What Have We Covered? • Site Recovery Manager 5.8
– vCAC integration via VC Orchestrator – New web based user interface – Scale and Performance improvement – IP Subnet Mapping – vPostgres Database – VSAN Integration
• vSphere Replication – Recap – Use cases and architecture – Reporting – Recommendations – Resources
CONFIDENTIAL 50
SAN / NAS
Abstraction and pooling of
infrastructure
VM centric data services
Common policy-based management
SAN/NAS Pool
Virtual Data Plane
x86 Servers
Hypervisor-converged Storage pool
Object Storage Pool
Cloud Object Storage
Virtual Data Services
Data Protection Cloud DR Local Availability
Policy-driven Control Plane
Software-Defined Storage and Availability
50
Bringing the Efficient Operational Model of Virtualization to Storage
Third-party Control Plane
API
Third-party Data Services
API
CONFIDENTIAL 51
Supplemental Slides
CONFIDENTIAL 53
vSphere Replication Simplifies DR for vSphere • What is vSphere Replication (VR)?
– VR is VMware’s proprietary technology for hypervisor-based replication of vSphere virtual machines
• What are the key features? – Asynchronous replication with customizable RPO – VM-centric management, storage-agnostic – Multiple point-in-time recovery (MPIT)
• What are the key benefits? – Simplifies replication of virtual machines – Eliminates storage hardware lock-in – Integrated with the VMware product stack – Included with vSphere at no additional cost
CONFIDENTIAL 53
vSphere
vSphere Replication
Site A (Primary)
vSphere
Site B (Recovery)
CONFIDENTIAL 54
Workloads NOT Suitable for vSphere Replication? Shared inter-site components
vCenter Server
High Churn & VSS
Inter-VM write order fidelity
Zero RPO
Unsupported
• AD, DNS, LDAP • Typically shared across sites
• Official solution, “Heartbeat” • Could be done, unsupported
• Usually = aggressive RPO • VSS Quiescence time overhead
• Array Based Consistency Groups
• VR Range is 15min-24hrs • Use array replication < 15min
• Check release notes! • pRDMs, multi-writer VMDK’s etc.
CONFIDENTIAL 55
What Use Is vSphere Replication without SRM?
Disaster Recovery for smaller environments
ESXi
NFC VRA
Storage VMDK3 (VMDK3)
VR Appliance Individual VM recovery
Local & Intra site replication implementations
Location migration requirements
CONFIDENTIAL 56
Ok… So What Do We Miss out on if We Decide Not to Use SRM?
Failover / Migration Workflow Automation
Non-disruptive Failover Testing
Customizable Orchestration
Built-in Reporting / Alerting & Customization
CONFIDENTIAL 57
vSphere Replication Appliance Increase - Advantage for ROBO
Storage Storage VMDK1
ESXi
NFC VRA
ESXi
NFC VRA
ESXi
NFC VRA
ESXi
NFC VRA
Storage VMDK3
(VMDK1)
Branch 1
Branch 2
(VMDK3)
ESXi
NFC VRA
Storage VMDK2
VR Appliance
(VMDK2)
VR Appliance
VR Appliance
vCenter Server
CONFIDENTIAL 58
Multi Point-In-Time Recovery (MPIT)
ESXi
VRA
Protected VM Current Replica
N - 1
N - 2
N - 3
Retention of multiple points in time allows reversion to earlier known good states
CONFIDENTIAL 59
MPIT Configuration
Retention policy is specified during configuration of replication
CONFIDENTIAL 60
MPIT Presented as VM Snapshots after Failover
Use the snapshot manager to revert to earlier points
CONFIDENTIAL 61
MPIT Details • Retention “slots” differ from “replication instances” • Example:
– RPO = 4 Hours – Retention Policy = 3 instances for past 1 day(s)
time
0:00 8:00 16:00 0:00 slots
instances 23:57 3:44 12:38 16:34 20:28
instance expires latest instance
Most recent complete instance is *always* preserved. Most recent *might* be the second instance in the slot.
(Ensures you can always failover to the most recent copy)
CONFIDENTIAL 62
The oldest instance in any given retention slot is preserved, as is the most recent replication
MPIT Details • Example:
– RPO = 4 Hours – Retention Policy = 3 instances for past 1 day(s)
time
0:00 8:00 16:00 0:00 slots
instances 3:44 12:38 16:34 20:28
instance expires latest instance
0:28
CONFIDENTIAL 63
vSphere Replication
• Included with vSphere Essentials Plus and higher • Per VM, async replication
• Replication at the VMDK level • Fully integrated with vCenter / ESXi 5.x
• Use cases with AND without SRM
CONFIDENTIAL 64
vSphere Replication Appliance • Single Appliance to download
– Same appliance used in SRM and non-SRM deployments – Initial instance performs two roles
• Role 1 - vSphere Replication Management Server (VRMS) – Configuration management – Only one appliance per site performs role – Warning if attempt to configure >1 per site
• Role 2 - vSphere Replication Server (VRS) – Manages replica instances – One or more per site (up to 10 max)
CONFIDENTIAL 65
Simple Deployment
Deploy and configure VR components
Pair components with vCenter
Configure VM for replication*
* Will need to define RPO, Target Datastore, Target Folder or Resource pool
CONFIDENTIAL 66
Configuring vSphere Replication
VR replication is configured per VM in vCenter
Selectable RPO from 15 min up to 24 hours
Selectable destination datastore (per virtual disk)
CONFIDENTIAL 67
Four Steps for Full Recovery
Right-click, select “Recover”
Select a target folder
Select a target resource
Click Finish
Will validate your choices as you go
vSphere Replication Use Cases
CONFIDENTIAL 69
vSphere Replication Standalone • Local site replica
– Singe site copy – Works with single vCenter
• Replication between sites – Each site own vCenter – Replicate in either direction
• Remote office / Branch office – Support for single vCenter – Support for single VR appliance
CONFIDENTIAL 70
Local Site Replica
vCenter Server
ESXi
NFC VRA
ESXi
NFC VRA
ESXi
NFC VRA
Storage Storage VMDK1 (VMDK1)
VR Appliance
CONFIDENTIAL 71
Replication Across Sites
ESXi
NFC VRA
ESXi
NFC VRA
ESXi
NFC VRA
Storage Storage (VMDK1)
ESXi
NFC VRA
ESXi
NFC VRA
ESXi
NFC VRA
VR Appliance VR Appliance
Storage Storage VMDK1
vCenter Server vCenter Server
CONFIDENTIAL 72
Remote Office Branch Office (ROBO)
Storage Storage VMDK1
ESXi
NFC VRA
ESXi
NFC VRA
ESXi
NFC VRA
ESXi
NFC VRA
Storage VMDK3
(VMDK1)
Branch 1
Branch 2
(VMDK2)
(VMDK3)
ESXi
NFC VRA
Storage VMDK2
VR Appliance vCenter Server
CONFIDENTIAL 73
Why Is vSphere Replication Not Using All Available Bandwidth?
Built for “Full” host scheduling
• VM replication optimized per VMDK • Current implementation scales effectively, many VMs, many VMDKs
• Not ideal for single high churn use case, stay tuned
Thank You
Fill out a survey Every completed survey is entered into a
drawing for a $25 VMware company store gift certificate
Site Recovery Manager and vSphere Replication: What’s New Technical Deep Dive
BCO2629
Jeff Hunter, VMware, Inc Ken Werneburg, VMware, Inc