private cloud: database-as-a-service with enterprise manager : oow 16

32
Hybrid DBaaS with Oracle Enterprise Manager: Comcast's 7 Goals for the Cloud September 20, 2016 Bala Kuchibhotla, Vice President Engineering, Oracle Jayson Hurd – Database Platform Architect, Comcast Tejas Gohil – DBA Manager, Comcast

Upload: bala-kuchibohtla

Post on 21-Mar-2017

195 views

Category:

Technology


4 download

TRANSCRIPT

DBaaS Comcast OOW16

Hybrid DBaaS with Oracle Enterprise Manager: Comcast's 7 Goals for the CloudSeptember 20, 2016

Bala Kuchibhotla, Vice President Engineering, OracleJayson Hurd Database Platform Architect, ComcastTejas Gohil DBA Manager, Comcast

1

AgendaPresentation title (optional)2

Superior Service with DBaaSComcast Company & Department Vision &futureCurrent State and HistoryDirection, Solutions & Architecture

2

AgendaPresentation title (optional)3

Superior Service with DBaaSComcast Company & Department Vision &futureCurrent State and HistoryDirection, Solutions & Architecture

3

Complete Cloud Control

Optimized, Efficient | |Integrated Cloud & On-premise Stack ManagementAgile, AutomatedComplete Cloud & On-premise Lifecycle Management

Scalable, SecureSuperior Enterprise-Grade Management

Copyright 2016, Oracle and/or its affiliates. All rights reserved. |

4

The Communication Hub of Oracle

Single pane of glass for hardware and software management across cloud & on-premiseCentralized communication & collaboration for diagnostics and troubleshootingDesigned to share critical information while maintaining sandboxesIntegrated named credentials & auditing

5

Copyright 2016, Oracle and/or its affiliates. All rights reserved. |

5

Database as a Service: Agility in DevOpsGive Everyone What They Want Most

DBaaS

Finance

Sales

Manufacturing

Marketing

HRUsers WantEasy self-service Access to data storePerformance optimized for service levels

IT WantsAgility to spin up new instances & provision dataSimplified deployment on standardized platformsLeverage resource optimization techniques like storage snapshotting to save on CAPEX.Less maintenance and better support.

Copyright 2016, Oracle and/or its affiliates. All rights reserved. |

6

IntegrationAPI integration with 3rd party tools , change requests, dev workflows

EfficiencyConsolidated, shared platform maintained with automation

ElasticityAgile scale up, scale down, relocation of services

AccessibilityAutomated request channel via self-service Databaseas-a-Service

AccountabilityMeter usage, quota, compliance and governance

Oracle Database as a ServiceFully functional Database with any Data on any infrastructure

Dedicated Database

Pluggable Database

Database Schema

DatabaseDataInfrastructure

Full Clone

EmptySnap clones

ORORPhysical, Virtual, or Engineered Systems(Hybrid Cloud)On Premise & On Oracle Cloud

Copyright 2016, Oracle and/or its affiliates. All rights reserved. |

7

Agenda

Superior Service with DBaaSComcast Company & Department Vision &futureCurrent State and HistoryDirection, Solutions & ArchitecturePresentation title (optional)8

8

Im sure most of you know Comcast as another Cable company like the others. And that is true, to a point. In many ways we are 9

Vision: Company Strategy22M subscribers$74B revenueNBC UniversalContent/ChannelsComcast ventures

Comcast BusinessXfinity InternetXfnity TVX1 On Demand/PPVXfinity VoiceXfnity home securityNationwide hotspots

Competition is Google & AmazonTechnology focusSilicon Valley, Denver, Philadephia Product supplier

Presentation title (optional)10Spirit of InnovationQuality of Service Delivery

CompanyProductsInnovation

10

Vision: 7 Principles of Database Management

Automated

Self-Serviced

Controlled Delegation

MTTR Reduction

Agile Provisioning

Common Interface

ConsolidatedPresentation title (optional)11

So when we pulled ourselves up to 30000 feet and let the shackles go from our ankles, we narrowed our scope, our mission statement, down to 7 principles.

This is the title of our presentation because it is the very philosophy that drives what we are doing.

We need to be absolutely as automated as possible. Everyone says this, but it is really true. We cannot manage to scale with manual processes. Manual processes are both time consuming and prone to error. Therefore we must commit to develop rather than do so much. It is very easy to start doing instead of thinking how to automate for repetition. This requires commitment to change the very way we think as engineers and operations people.Our environment needs to be come self service. This has many drivers. In our organization we have devops as you may know from your organization. Our job is to provide a shared platform, a cloud, that can be consumed by developers and application support. And the quicker they can consume that, the better. That means cutting out ticketing procedures, tying duties to roles in Oracle Enterprise Manager. That means everything from provisioning, to patching, to maintenance must be self service. It can be limited to our DBAs to use as well, but it needs to be fast and self-service. Getting rid of the cycle of pain, for ticketing and hands offs, is the goal. The more we build in process and sticky procedures to getting things done, the more reviled we become. The more reviled we become, the more we cost the organization, and the more we slow down progress and continuous delivery.Controlled delegation I hinted at this in the previous point. We should be able to delegate the automated and self service solutions to either devops or other DBAs. Our efforts in engineering and architecture should be enabling them and giving the right duties to the right people, packaged, tested and ready to go against predictable and stable platforms.MTTR reduction all this must have the effect of reducing MTTR. That means we provision consistently, we patch regularly and predictably and we resolve issues much faster as a result. MTTR reduction is our key charter for 2016 and beyond from our CIO. This platform must, and will meet that demand admirably.

Agile provisioning We cannot be taking days, weeks or months to provision services. Anything from small development environments to full blown hardware RAC/Dataguard needs to be provisioned in hours. This is a must. We can no longer be a roadblock to our organization.We must have a common interface to all of this. This is obviously achieved with OEM. OEM needs to be a central point, a tier 1 application in its own right, that is the center of all database operations, provisioning, etc. This extends to mean that most back end duties are also done through OEM. We cannot have random manual work being done on a large scale as this risks the stability and viability of our platform.Finally, we must consolidate. 12c multitenant and resource manager bring us past the aforementioned either-or choice between risky schema consolidation and explosions of hardware and oracle homes. Oracle has truly enabled cloud management for us to meet this demand.11

AgendaPresentation title (optional)12

Superior Service with DBaaSComcast Company & Department Vision &futureCurrent State and HistoryDirection, Solutions & Architecture

12

Current State:ChallengesComcast - Oracle OpenWorld 201613

13

Presentation title (optional)142000-2009: Big IronEasy to ManageShared codeConsolidated

High riskResource controlsLess modular

BulkheadedRedundantScaled

UnmanageableExpensive, wastefulSilo-driven

2009-2015: Explosion

History

Lets go backwards a few years, again to understand our history and how we have arrived where we are, and why we are charting our innovative future.

2000-2009 think Big Iron. This was yesteryears way to manage a lot of applications in a smaller infrastructure footprint. Most of our application groups were schemas in large 9i and 10g databases running on very powerful Sun hardware. While this did offer capacity and simplicity of management, it did mean we had too many eggs in one basket. When one application sneezed, many others caught a cold. This meant some very long nights and high pressure conference calls.

It also meant that brining in new applications and code became very risky ventures the antithesis of our modern day continuous delivery. Being so inflexible meant we could not innovate.

And so the pendulum swung the other way14

Current State:Oracle Footprint

Presentation title (optional)15

IT Department - APS

Provisioning ServicesField AutomationOracle across the organization700+ applications supported

Oracle Footprint

20000 managed targets2200+ databases1300 virtual machines1800 hosts

Within Comcast we belong to the largest IT branch known as APS - Application Platform Services. Our division handles all provisioning and field services application. Hence we are the most customer facing of all IT departments. Our Oracle services, however, services the entire Comcast organization, or roughly 80-90% of all Oracle databases at Comcast.

This accounts in terms of our OEM targets, 20000 managed targets, 2000+ databases including 1300 virtual machines and 1800 hosts.

Our environment is almost entirely Linux x86 RAC with Dataguard, and most applications are 3-node RAC with a corresponding dataguard. Very little, if any, is currently consolidated. 90%+ of the environment is 11.2, with our preferred version of 11.2.0.4.

Of course, all this is quickly becoming legacy15

AgendaPresentation title (optional)16

Superior Service with DBaaSComcast Company & Department Vision &futureCurrent State and HistoryDirection, Solutions & Architecture

16

Direction

Comcast - Oracle OpenWorld 201617

17

Solutions

Comcast - Oracle OpenWorld 201618

18

Presentation title (optional)19

Self Service DBaaS PortalOracle OEM Cloud Pack PortalProvisioningCDB/PDB creationCloning OEM Service Catalog Active DirectoryPatching and UpgradesConfiguration ChangesTroubleshootingPerformance Tuning Metrics Reporting

OEM Automation & Monitoring3-Tier DeploymentMultitenant

Pluggable DB

Maximum DensityLower EnvironmentsSilverDedicated DB

RAC/DataGuard

Resource Controls

Highly AvailableProduction SharedGoldDedicated HW

RAC/DataGuard

RAC/Golden GateDedicated ProductionGold/PlatinumDensityPerformanceSelf Service L2 & DevOpsUsage TrackingHigh AvailabilityFault ToleranceArchitecture

From these principles we have developed a master framework for managing Oracle through Enterprise Manager and the Cloud Pack.

As I mentioned, and from the top. A common interface, tied into our LDAP Active Directly, accesses through the cloud pack portal. All routine duties for provisioning and patching etc. are done through a controlled service catalog no manual work. All troubleshooting and performance metrics are done through the OEM interface or fed to other internal interfaces through the API a central point source-of-truth for all Oracle related operations.

Underlying this is a 3-tier deployment model, a truly efficient balance between density and performance, meeting the demands of our customers while also not creating silos of resources that cost us millions.

All VMs are converted to multitenant self-service pluggable databases sitting on performant 3-node RAC. All production that is not of heavy utilization will be a CDB not PDB due to the Dataguard requirements of being able to fail over individual applications and all will be RAC/DataGuard. Effectively, you should be choosing dedicated hardware due to RTO/RPO, or business requirements. All our shared environments offer the highest level of availability. You should be selecting dedicated vs. cloud/shared hardware based on your load and that is it. As we are able to consolidate, we have resource manager on our CDBs to cage resources and guarantee service levels.Finally for high load applications we can still use dedicated hardware. We can stage hardware in datacenters throughout our footpring so as to deliver to the customer in a matter of hours.

As all this is tied together we can track resource utilization by active directory group, cloud zones and pools. This means we can tell our senior management and accounting people, who is using what and where. And of course, they can self-service this data for themselves!

19

Build and Migration

Comcast - Oracle OpenWorld 201620

Migrating to the cloud how?

Let me summarize

First we must build out a new architecture framework.

The framework must be easy to stand up and then built gradually in order to minimize up front hardware costs.

Then we have to migrate

Target is to migrate 1300 VMs to our Oracle private cloud for all our lower environments dev/qa to between 9 and 18 hosts (3 6 clusters)Create criteria for migrating lesser used production DBs to shared environments (1:10 consolidation ratio) since we were 100% RAC/DataGuard for nearly every schema (many applications had multiple rac/dataguard cluster combinations. This makes my our results look fantastic! Easy pickings to densifyEVERYTHING to be managed by DBaaS including platforms with dedicated hardware (zones)

20

Implementation ApproachEasy Entry

All this theory is great, and Im sure the architects and directors in the room have enjoyed this. But our job is to then take this and implement into reality.

Again, as I said earlier, this may not be as hard as you think.

We build a framework. We were able to build a framework, from planning to readiness, in under 5 months. Now, our OEM footprint had already been built out and needed only some modifications to meet this. But keep in mind, the duty is to build out a framework first, then move to the model as your time and budget fits.Next we must engineer standards so that as the proverbial bathtub is filled, things just gradually fall into line over the coming yearsResource management and subscription limits on shared clusters must be strictly enforced in order to maintain the integrity of the systemService catalog items using OEM must be the only point of provisioning, patching and other routine maintenance. Anything outside of this threatens the ease of use and stability of the cloudThings like naming conventions and clear/concise end user documentation must be available. The key is that end users do not need to come to DBAs to request that things be done. As confusion is encountered with devops engineers, documentation and communications must be tightened and enhancedHardware zones must be built out strategically. Comcast has 5 major datacenters with various network zones (firewall areas). We had to strategically plan the rollout and migrations while at the same time accommodating net new buildouts. This is difficult to do without spending $millions up front in hardware. Think of just in time inventory and supply chain management that is what we are effectively doing.Build a clear and concise user intake process. Users must easily obtain access to the self service platform21

Solution:Implementation ApproachOEM Level 4 HA/DR OEM ArchitecturePresentation title (optional)22Oracle Enterprise ManagerCloud PackLifecycle ManagementPublic Cloud Gateway

There were two major parts to our framework buildout:

Making OEM into the tier 1 platform it has to be to support our 700 applications, operations and BAU people.Underlying hardware footprint architecture and buildout.

Let me first discuss the Level 4 HA DR architecture here. OEM was put in two datacenters, 3 servers each. Underlying that is RAC/dataguard 12c across the same datacenters.

Then Active Directory was enabled so that we can leverage AD groups for role based access to OEM. This means that application teams control who gets access once we assign the groups. More self service!

We added the cloud pack and lifecycle management to our oracle licensed product portfolio to manage the environment and provide self service to our customers.

This architecture is the highest level of OEM architecture as prescribed by Oracle. Leverage their documentation and sales/service expertise for help here.22

Presentation title (optional)23

65 TB Storage SavingsMax CPU/Memory90%+ Dev Host Count Reduction10%+ Prod Host Count Reduction

Architecture:Initial Hardware Zones12c Multitenant

Talking about the three consolidation models I laid out in the high level architecture..

Our lower environments will likely see the greatest benefit because we are able to use multitenant to its fullest extent.

In effect we are looking currently at migrating our 1300 VMs to just 3 RACs!

Granted, these are much heavier hardware footprints:

HP DL 560 machines (our XL hosts), beefed up each to 512GB memory and full 10Gbps network cards to handle interconnect, backup and end user traffic.

If we need more capacity we will simply add more RAC clusters as new cloud zones.

And the savings as I mentioned are HUGE. We can accurately save roughly 65TB of usable, high and SAN storage, fully consolidate CPU and memory usage.

And patching becomes MUCH easier as we will create new CDBs and migrate pluggable databases this way, one by one. 23

Initial Hardware Zones Lower Environments12c Multitenant

1300+ VMs to 9 HostsPresentation title (optional)24

Here is just a graphical demonstration of how we plan to migrate literally hundreds of PDBs into given RAC clusters. Multitenant allows us to do this!24

Shared Production CDB/PDB RAC/DataGuardPresentation title (optional)25

Using the same hardware for ease of provisioning, we are building out shared production zones for lower utilization applications.

Instead of using PDBs, we will deploy individual CDBs per application. Now, those CDBs may contain multiple PDBs but the architecture would ensure each CDB is treated as a failure group meaning it could be failed over or switched over using DataGuard as one unit. Each CDB would be deployed as a service catalog option most likely by our own internal DBAs. Self service to devops teams might be possible in the future if we are able to put some controls in for subscription levels in particular zones.

In effect we need to limit utilization in two ways aggregate CDBs on each cluster and then how much each CDB uses. We will do this by carefully managing the subscription levels and having the right resource management plans attached to each CDB. We also plan to undersubscribe these clusters in order to have buffer capacity if databases grow in usage. Again, as we need, we simply scale out by adding more zones and clusters.

Let me also for a moment just touch on the topic of dedicated hardware. Many of our applications will have dedicated hardware, no doubt. The concept here however will be the same in terms of cloud management. Rather than shared zones, we have dedicated zones that is all. When we manage through OEM cloud pack, we manage EVERYTHING. In our organization there is no maybe we move to the cloud, maybe we dont. This model encompasses ALL our oracle footpring, period.

25

Migration PathsComcast - Oracle OpenWorld 201626

Golden Gate

RMAN

Expdp/Impdp

26

Hybrid Cloud (2017)Presentation title (optional)27

As we move to this in house private cloud we are also going to begin experimenting with a hybrid model

Through this same OEM footprint we will be adding access to Oracles Public Cloud. We can then make these resources available to our end users transparently.

We will be exploring the business justification for this and shifting resources between private and public accordingly.

Your choice between underlying private/hybrid/public cloud resources is really going to be driven by your own economic, scale and business requirements.

But in either case, the model we laid out is the same, and the philosophy and principles equally applicable.

27

Future RoadmapOEM PlatformEliminate VMsNet new to DBaaS2-3 year process2000+ DatabasesAutomation through OEM

13cShowbackSnap ClonesRelocations (plug/unplug)Devops patchingDevops full deploymentCapacity planning (IT Analytics)Comcast - Oracle OpenWorld 201628Migrations

28

ConclusionThere is no other future!Comcast - Oracle OpenWorld 201629

29

30

Dont Miss the SessionsTitleDayTimeExpanding DBaaS Beyond Data Centers: Hybrid Cloud Onboarding via Oracle Enterprise Manager [CON6985]Wednesday11:00 AM *Deep Dive: Snap Clone and Data Refresh Solutions using Oracle Enterprise Manager [CON6982]Wednesday1:30 PM *Best Practices for Upgrading to Oracle Enterprise Manager 13c: Learn from Customers [CON6989]Thursday1:15 PM *

Location: Hotel Nikko - Peninsula (25th Floor)DayTimeRelief from Chronic Patching PainOracle Enterprise Manager Fleet Maintenance [HOL7632]Tuesday11:30 AMHarnessing the Value of Hybrid CloudComplete Management of Cloud Services [HOL7631]Wednesday1:15 PM

Do it Yourself! (Hands on Lab)* Moscone South - 305

Copyright 2016, Oracle and/or its affiliates. All rights reserved. |

31

Additional ResourcesRelated VideosYoutube.com/OracleEnterpriseMgr

Join the Conversation

Twitter.com/oracle_em

Oracle.com/newsletters

Facebook.com/oracleenterprisemanager

Blogs.Oracle.com/OEMEnterprise Manager Page on O.com

Database as a Service Page on OTN

Cloud Administration Guide (Documentation)

MOS Note 1549855.1 : Enterprise Manager Cloud Control Recommended Plug-Ins and Patches for Database as a Service (DBaaS)

Copyright 2016, Oracle and/or its affiliates. All rights reserved. |

32