informatica executive summit nov. 3, 2010€¦ · enterprise data governance other processes...
TRANSCRIPT
1© Copyright 2010 EMC Corporation. All rights reserved.
Informatica Executive Summit Nov. 3, 2010
2© Copyright 2010 EMC Corporation. All rights reserved.
Managing Data Growth in the 21st
Century: Leveraging Virtualization & Cloud Technology
Tony PagliaruloVice President of IT, EMC
3© Copyright 2010 EMC Corporation. All rights reserved.
Agenda
• EMC Focus & Strategy
• EMC IT Journey to the Private Cloud
• Data Virtualization Roadmap
• Information Management Governance
4© Copyright 2010 EMC Corporation. All rights reserved.
About EMC
Fortune 500 Rank: 166
Revenues (2010 estimate): > $16.9 billion
Employees (end Q3 2010 worldwide): ≈ 47,000
Countries where EMC does business: > 80
Total Cash and Investments (year to date): $10.5 billion
Quarterly Free Cash Flow (year to date): $2.2 billion
Market Value (October 2010): > $44 billion
Founded: 1979
5© Copyright 2010 EMC Corporation. All rights reserved.
EMC’s focus is
IT Infrastructure
EMC is aTECHNOLOGY
company
EMC’s Focus
6© Copyright 2010 EMC Corporation. All rights reserved.
EMC’s Complementary Strategies
Information VirtualInfrastructure Infrastructure
Information Storage
Information Management
Information Protection
Information Security
Information Intelligence
Virtualization (VMware) the Cloud OS
7© Copyright 2010 EMC Corporation. All rights reserved.
EMC IT at a GlanceUser Profiles 48,000 “internal” users
400,000+ customers and partners
IT Environment 5 data centers, 7 PB storage
Business Applications 400+ applications and tools
Virtualization 6,000+ OS images (worldwide)71% of all virtualized85% of Intel virtualized
Global Support 80+ countries and 20 languages
8© Copyright 2010 EMC Corporation. All rights reserved.
Globalization
Business Value
Security
Performance
Functionality Cost of Ownership
Interoperability
Manageability
We have the
same
challenges as
our
customers
EMC IT Current Challenges
9© Copyright 2010 EMC Corporation. All rights reserved.
2009:0.8 Zettabytes
Source: IDC Digital Universe Study, sponsored by EMC, May 2010
GROWINGby a factor of
2020:35.2 Zettabytes
44
The Digital Universe 2009 - 2020
10© Copyright 2010 EMC Corporation. All rights reserved.
IT Infrastructure today …
ComplexInefficientInflexibleCostly
72%Maintain
28%Invest
11© Copyright 2010 EMC Corporation. All rights reserved.
Enter The Cloud
Enter The Cloud.
12© Copyright 2010 EMC Corporation. All rights reserved.
What isCloud Computing?
13© Copyright 2010 EMC Corporation. All rights reserved.
The Cloud is . . .
BuiltDIFFERENTLY:
Dynamic pools of virtualized resources
OperatedDIFFERENTLY:
End-to-end service delivery
ConsumedDIFFERENTLY:
Convenient for IT and for those they support
Private Cloudis one that IT controls
14© Copyright 2010 EMC Corporation. All rights reserved.
Trusted
Controlled
Reliable
Secure
Multiple IncompatibleArchitectures
Implications for Today’s Data Centers
15© Copyright 2010 EMC Corporation. All rights reserved.
Dynamic
Cost-Efficient
On-Demand
Controlled
Secure Flexible
Reliable
Trusted
Multiple IncompatibleArchitectures
Homogeneousx86 Architecture
Implications for Today’s Data Centers
16© Copyright 2010 EMC Corporation. All rights reserved.
Dynamic
Cost-Efficient
On-Demand
Flexible
Dynamic
Cost-Efficient
On-Demand
Trusted
Controlled
Reliable
Secure Flexible
Trusted
Controlled
Reliable
Secure
Dynamic
Cost-Efficient
On-Demand
Flexible
Implications for Today’s Data Center
PrivateCloud
PublicCloud
Compute
Storage
Network
Cloud OS
17© Copyright 2010 EMC Corporation. All rights reserved.
Operating System
Information
Security
Federation
VirtualApplications
Virtualization Enables Cloud Computing
The Goal: Global Workload Deployment
PrivateCloud
PublicCloud
18© Copyright 2010 EMC Corporation. All rights reserved.
Virtualization (vSphere)
Information
Security
Federation (vMotion + VPLEX)
Private
CloudPublic
Cloud
EMC IT’s Cloud Strategy
Traditional Apps “Next-Gen” Cloud Apps SaaS
19© Copyright 2010 EMC Corporation. All rights reserved.
Our Journey to the Private Cloud
% Virtualized
15%
30%
50%
95%
IT-as-a-ServiceIT Production Business ProductionImprove agilityLower costs Improved quality of service
GovernanceCloud enablement
Service management
VDCOptimization
Standardization Virtualization
GoldPlatinum
85%
We are here
20© Copyright 2010 EMC Corporation. All rights reserved.
The Journey to the Private Cloud
% Virtualized
15%
30%
50%
85% 95%
IT-as-a-ServiceImprove Agility
IT ProductionLower Costs
Business ProductionImprove Quality of Service
PlatinumGold
Applications Application portfolio rationalization
Application selection
Virtualization of CIO owned applications
Infrastructure Data center consolidation
Virtualization strategy
Virtualization factory
Governance Establish PMO
Design and implement transformation dashboard
Implement IT management policies
Establish service catalog
21© Copyright 2010 EMC Corporation. All rights reserved.
EMC Enterprise Information Architecture
Rapid
Prototyping
Global Data Warehouse
Enterprise Data
Subject Oriented Marts
H
RRevenue
PI Tool TBD…
Source Systems
Catalyst Etc…PeopleSof
t
SAPOracle 11i
End User Query Tools
BI as a Service
POC 1 POC 2
BU App 1 BU App 2
Data Integration LayerInformatica PowerCenter Informatica Data Services
Data Federation
Master Data
Customer
Master
22© Copyright 2010 EMC Corporation. All rights reserved.
Enter The Cloud
Data VirtualizationRoadmap
23© Copyright 2010 EMC Corporation. All rights reserved.
Guiding PrinciplesApplication/ Database Layer
• Maintain as few copies of data as possible– Master Data Management (Informatica Siperian) as single source of
truth– Informatica Data Services to enable data federation– Subset data using Informatica Applimation
• Transform and replicate data if needed– Informatica PowerCenter used to feed the Global Data Warehouse and
the subject marts
• Archive data– Archive database data using Informatica Applimation– Email archiving using EMC SourceOne– Filesystem archiving using EMC Rainfinity
24© Copyright 2010 EMC Corporation. All rights reserved.
Guiding PrinciplesStorage Layer
• Better utilization using storage optimization techniques– File virtualization using EMC Rainfinity– Block virtualization using EMC vPlex– Virtual provisioning
• De-duplication technology– Source de-duplication using EMC Avamar– Target de-duplication using EMC Data Domain
• Object technology for primary storage/backup– EMC Atmos as durable distributed object storage
25© Copyright 2010 EMC Corporation. All rights reserved.
Data ILM- Complete Lifecycle of Data
Nearline Database
3. Archive for Application Retirement – archive data to on-line content addressable storage.– Retire legacy application and eliminate application and RDBMS license and
server costs– Maintain application independent access to archived data via ODBC/JDBC– Search, browse, view archived data through Informatica Data Discovery portal
1. Archive for Performance – archive (relocate) production data to less expensive and virtualized infrastructure.– Improve core application performance and operational efficiency– Lower total application infrastructure cost– Maintain seamless application access to data
2. Archive for Compliance – archive data to on-line content address storage.– Meet compliance requirements while reduce risk and infrastructure cost– Maintain application independent access to archived data in compressed file via
ODBC/JDBC– Search, browse, view archived data in compressed file through Informatica Data
Discovery portal
26© Copyright 2010 EMC Corporation. All rights reserved.
Remove Large Amounts of Data using Data Archive and Application RetirementEMC Management for Oracle Applications with Informatica
• Archive engine relocates all data within identified tables and entities based on the archiving policy definition
• Decrease capital and operating costs by reducing storage volume of rarely-used data
• Retired application data stored in highly compressed immutable file archive format
ALM
Server
File
Archive
Server
Data Discovery
BI ToolsEMC Centera
Production
Informatica Data Archive
Staging Files
27© Copyright 2010 EMC Corporation. All rights reserved.
Reduce Storage Footprint with Data SubsetEMC Management for Oracle Applications with Informatica
• Review the effect of subsetting before removing the data
• Data integrity and immediate availability for subsetted instances
• Reduce the footprint of module storage by 78%
Test/Development:
EMC Symmetrix DMX-4
Data Subset Filter
Useful data
Productionreplica
28© Copyright 2010 EMC Corporation. All rights reserved.
Oracle 11i Applications – EMC IT Use Case
• Poor Performance
• Infrastructure Costs
• Resource Costs
29© Copyright 2010 EMC Corporation. All rights reserved.
Tape Backup of Prod
90 TB
Backup of (Prod, Splx, Dev, Test, etc) onto EDL with RAID
28 TB
3 TB - Dev, Test, Training, Perf, etc RAID
12 TB - Dev, Test, Training, Perf
5 TB - Prod, Splx, SBY, ACT, Bkup Mirror
5 TB - Prod, Splx, SBY, ACT, Bkup DR
5 TB - Prod, Splx, SBY, ACT, Bkup Mirror
5 TB - Prod, Splx, SBY, ACT, Bkup
Oracle 11i Multiplier
1TB of Data
Storage Multiplier Effect (circa 2008)
10TB
5TB
15TB
20TB
32TB
35TB
63TB
153TB
30© Copyright 2010 EMC Corporation. All rights reserved.
Deduplication
Data ILM Journey
2008
2009
2010
2011
Oracle 11i Multiplier Effect – 1 TB
153 TB• Tape Backup (90)
• EDL Backup (30)
• Non Prod (12)
• Disaster
Recovery (5)
• Production (5)
64TB• EDL Backup (30)
• Non Prod (9)
• Disaster
Recovery (5)
• Production (5)
40TB• EDL Backup (30)
• Non Prod (9)
• Disaster
Recovery (5)
• Production (5)
20TB• Deduplication
• Archiving
• Retirement
Decommission of 3 Envs.
Elimination of Tape Backups
Reduce Backup Retention
Archiving
Subsetting
31© Copyright 2010 EMC Corporation. All rights reserved.
Enter The Cloud
Information Mgmt Governance
32© Copyright 2010 EMC Corporation. All rights reserved.
However Impact Will Be Limited Without Enterprise Data Governance
Other processes (partner, vendor, etc)Sustain: Ongoing adjustment of business rules, data cleansing and sourcing
3
Data Governance
2 Integrate: Consistent data integration across multiple processes driving enterprise-wide analytics and insights
Lead gene-ration
Lead mgt
Oppty mgt
Order to cash
Service and support
Customer lifecycle process
Feasibility DesignQualifi-cation
General Avail-ability
End of life
Product lifecycle process
Define: Consistent data definitions across a singleprocess (over multiple geos and functions)
1
33© Copyright 2010 EMC Corporation. All rights reserved.
Data Governance Best Practice
Data is an enterpriseasset, and should be governed and secured at the enterprise level
Business ownership of data has to come top-down from the highest executives
– IT is a key enabler, but not the owner
Business users are the data stewards and content architects
CRM
• Customer Accounts
• Partner Accounts
HR
• Employee• Contractor
ERP
• Product• Item
• Orders
Eng/Svc
• Product Quality
• Total Customer
Experience
Other
• Shadow• NDA
• Personal
MDM
Business Intelligence
Role-based Access
Compliance & reporting
DATA
34© Copyright 2010 EMC Corporation. All rights reserved.
www.EMC.com/emcit
EMC IT Journey to the Private Cloud: A Practitioner's Guidehttp://www.emc.com/collateral/software/white-papers/h7298-it-journey-private-cloud-wp.pdf
35EMC CONFIDENTIAL—INTERNAL USE ONLY
Q&A
36© Copyright 2010 EMC Corporation. All rights reserved.
THANK YOU