hortonworks & ibm cloud event · governance-don’t have the right data-data resides in...
TRANSCRIPT
1 © Hortonworks Inc. 2011–2018. All rights reserved.
Hortonworks & IBM CognitiveThe Future of Data ScienceHortonworks & IBM Better Together
2 © Hortonworks Inc. 2011–2018. All rights reserved.
Hortonworks & IBM Cloud
Data has gravity. Manage it right with a hybrid data
architecture for BigData
Thiago SantiagoSolution Engineer – Latam
3 © Hortonworks Inc. 2011–2018. All rights reserved.
Gravity: ...the force that attracts a body toward any other physical body
having mass.
4 © Hortonworks Inc. 2011–2018. All rights reserved.
5 © Hortonworks Inc. 2011–2018. All rights reserved.
6 © Hortonworks Inc. 2011–2018. All rights reserved.
7 © Hortonworks Inc. 2011–2018. All rights reserved.
...the ability of bodies of data to attract applications, services andother data.
Data gravity:
8 © Hortonworks Inc. 2011–2018. All rights reserved.
Technology Trends: Shifting the Data Paradigm
Artificial IntelligenceInternet of Things Cloud Computing Streaming Data
Industrial InternetConnected BusinessConsumer Devices
Smart DevicesAutonomy
Prescriptive Analytics
SaaS/PaaS ApplicationsEphemeral Use CasesOperational Efficiency
Collaboration
Real-time ApplicationsTargeted Retail
RecommendationsIndustrial Applications
9 © Hortonworks Inc. 2011–2018. All rights reserved.
Do These Problems Sound Familiar?
Data Analytics
Governance
- Don’t have the right data- Data resides in silos- High volumes of data- Lack of unstructured data - Difficult to access
- Data is hard to find and access- Self-Service is limited - No centrally managed security system
Data source not trusted -Impossible to audit or track data lineage -
Lack of skills and tools -Departmental silos -
Need for “fail fast” strategy -Technology has limitations -
10 © Hortonworks Inc. 2011–2018. All rights reserved.
From the edge, through movement, to rest
Hortonworks DataPlane Service a foundational platform for the delivery of data solutions that will:
• Support enterprise hybrid deployment strategy and adoption of cloud
• Common Metadata, Security and Governance across all tiers and types of data
• Simplified enterprise data asset management
• Extensible to new services: Services enablement layer for rapidly bringing new solutions to market
• Brings all data under management
HORTONWORKSDATAPLANE
SERVICE
MULTIPLE CLUSTERS AND SOURCES
MULTIHYBRID
Manage, Secure, GovernDATA AT REST
HortonworksData Platform
DATA IN MOTION
HortonworksData Flow
What is Hortonworks DataPlane Service?
11 © Hortonworks Inc. 2011–2018. All rights reserved.
MULTIPLE CLUSTERS AND SOURCES
MULTIHYBRID
DATAPLANE SERVICE (DPS)MANAGE, GOVERN, SECURE
DATALIFECYCLEMANAGER
DATA STEWARD
STUDIOEtc...
*not yet available, coming soon
EXTENSIBLE SERVICES
IBM DSX*CLOUD-BREAK*
DATAANALYTICS
STUDIO
CONNECTED DATA PLATFORMS
HORTONWORKSDATA PLATFORM (HDP®)DATA-AT-REST
HORTONWORKSDATAFLOW (HDF™)DATA-IN-MOTION
MODERN DATA USE CASESEDW
OPTIMIZATION CYBER SECURITY DATA SCIENCE ADVANCEDANALYTICS
PARTNERSOLUTIONS
IOT/ STREAMING ANALYTICS
HORTONWORKSCONNECTION
ENTERPRISE SUPPORT
PREMIER SUPPORT
EDUCATIONAL SERVICES
PROFESSIONAL SERVICES
COMMUNITY CONNECTION
HORTONWORKSPLATFORM SERVICES
OPERATIONAL SERVICES
SMARTSENSE™
Global Data Management With Hortonworks
12 © Hortonworks Inc. 2011–2018. All rights reserved.
DATAPLANE SERVICE (DPS)MANAGE, GOVERN, SECURE
DATALIFECYCLEMANAGER
- DLM -
DATA STEWARD
STUDIO- DSS -
IBM DSX
EXTENSIBLE SERVICES
CLOUDBREAK- CB -
STREAMS MESSAGING MANAGER
- SMM -
DATAANALYTICS
STUDIO- DAS -
CONNECTED DATA PLATFORMS
HORTONWORKSDATA PLATFORM (HDP®)DATA-AT-REST
HORTONWORKSDATAFLOW (HDF™)DATA-IN-MOTION
Global Data Management With Hortonworks
13 © Hortonworks Inc. 2011–2018. All rights reserved.
DPS Console: One place to visualize Virtual Data Lakes
DPS PlatformData Plane Service Console
14 © Hortonworks Inc. 2011–2018. All rights reserved.
DATAPLANE SERVICE (DPS)MANAGE, GOVERN, SECURE
DATALIFECYCLEMANAGER
- DLM -
DATA STEWARD
STUDIO- DSS -
Etc...
EXTENSIBLE SERVICES
CLOUDBREAK- CB -
STREAMS MESSAGING MANAGER
- SMM -
DATAANALYTICS
STUDIO- DAS -
CONNECTED DATA PLATFORMS
HORTONWORKSDATA PLATFORM (HDP®)DATA-AT-REST
HORTONWORKSDATAFLOW (HDF™)DATA-IN-MOTION
Global Data Management With Hortonworks
15 © Hortonworks Inc. 2011–2018. All rights reserved.
• First Extensible Service Delivered on DPS Platform –
GA Oct, 2017
• Manage the Data Lifecycle:
– Replication/failback to another cloud/on-prem
site for Disaster Recovery
– Auto Tiering of hot/warm/cold data to cloud
object storage/on-prem for TCO reduction
– Backup & Recover Critical Business Data
• Maintain Common Security and Governance Policies
Across Multi Data Sources/ Environments
Data Lifecycle Manager (DLM)
SERVICE: DATA LIFECYCLE MANAGER
REPLICATION &DISASTER
RECOVERY
Cluster Cluster ClusterMOVE MOVE
AUTO TIERING
BACKUP &RESTORE
P(use): highCost: $$$
P(use): mediumCost: $$
P(use): lowCost: $
Fullbackup
day 1 day 2 day 3
Cumulative incrementalbackups
Accidentdelete
X
FAILBACK
REPLICATION
RESTORE
ProdCluster
BackupCluster
Generally Available
Coming Soon
Coming Soon
16 © Hortonworks Inc. 2011–2018. All rights reserved.
DLM: Pair clusters and manage data replication flows
Data Lifecycle Manager (DLM)
17 © Hortonworks Inc. 2011–2018. All rights reserved.
DLM: Replicate data between on-prem HDP and Cloud.
Data Lifecycle Manager (DLM)
18 © Hortonworks Inc. 2011–2018. All rights reserved.
DLM: Replication policies and instances
Data Lifecycle Manager (DLM)
19 © Hortonworks Inc. 2011–2018. All rights reserved.
DATAPLANE SERVICE (DPS)MANAGE, GOVERN, SECURE
DATALIFECYCLEMANAGER
- DLM -
DATA STEWARD
STUDIO- DSS -
Etc...
EXTENSIBLE SERVICES
CLOUDBREAK- CB -
STREAMS MESSAGING MANAGER
- SMM -
DATAANALYTICS
STUDIO- DAS -
CONNECTED DATA PLATFORMS
HORTONWORKSDATA PLATFORM (HDP®)DATA-AT-REST
HORTONWORKSDATAFLOW (HDF™)DATA-IN-MOTION
Global Data Management With Hortonworks
20 © Hortonworks Inc. 2011–2018. All rights reserved.
Data Steward StudioUnderstand, secure & govern data across enterprise data lakes
21 © Hortonworks Inc. 2011–2018. All rights reserved.
DSS: Understand shape of Hive column data with statistical profiler, example: Profile shows box plot and histogram for distribution of column values
Data Steward Studio (DSS)
22 © Hortonworks Inc. 2011–2018. All rights reserved.
DSS: Data lineage shows complete chain of custody and downstream dependencies for an asset!
Data Steward Studio (DSS)
23 © Hortonworks Inc. 2011–2018. All rights reserved.
DSS: Audit Profiler shows both summarized views & patterns of access for a data asset.
Data Steward Studio (DSS)
24 © Hortonworks Inc. 2011–2018. All rights reserved.
DATAPLANE SERVICE (DPS)MANAGE, GOVERN, SECURE
DATALIFECYCLEMANAGER
- DLM -
DATA STEWARD
STUDIO- DSS -
Etc...
EXTENSIBLE SERVICES
CLOUDBREAK- CB -
STREAMS MESSAGING MANAGER
- SMM -
DATAANALYTICS
STUDIO- DAS -
CONNECTED DATA PLATFORMS
HORTONWORKSDATA PLATFORM (HDP®)DATA-AT-REST
HORTONWORKSDATAFLOW (HDF™)DATA-IN-MOTION
Global Data Management With Hortonworks
25 © Hortonworks Inc. 2011–2018. All rights reserved.
DAS: Data Analytics Studio gives database heatmap, quickly discover and see what part of your cluster is being utilized more
Data Analytics Studio (DAS)
26 © Hortonworks Inc. 2011–2018. All rights reserved.
DAS: Built-in batch operationsNo more scripting needed for day-to-day operations
Data Analytics Studio (DAS)
27 © Hortonworks Inc. 2011–2018. All rights reserved.
DAS: Full featured Auto-complete, results direct download, quick-data preview and many other quality-of-life improvements.
Data Analytics Studio (DAS)
28 © Hortonworks Inc. 2011–2018. All rights reserved.
DAS: Pre-defined searches to quickly narrow down problematic queries in a large cluster
Data Analytics Studio (DAS)
29 © Hortonworks Inc. 2011–2018. All rights reserved.
DAS: Heuristic recommendation engineFully self-serviced query and storage optimization
Data Analytics Studio (DAS)
30 © Hortonworks Inc. 2011–2018. All rights reserved.
DATAPLANE SERVICE (DPS)MANAGE, GOVERN, SECURE
DATALIFECYCLEMANAGER
- DLM -
DATA STEWARD
STUDIO- DSS -
Etc...
EXTENSIBLE SERVICES
CLOUDBREAK- CB -
STREAMS MESSAGING MANAGER
- SMM -
DATAANALYTICS
STUDIO- DAS -
CONNECTED DATA PLATFORMS
HORTONWORKSDATA PLATFORM (HDP®)DATA-AT-REST
HORTONWORKSDATAFLOW (HDF™)DATA-IN-MOTION
Global Data Management With Hortonworks
31 © Hortonworks Inc. 2011–2018. All rights reserved.
Operations: SMM
32 © Hortonworks Inc. 2011–2018. All rights reserved.
SMM(Topic Profile / Detail Page)
33 © Hortonworks Inc. 2011–2018. All rights reserved.
SMM(Topic Profile / Detail Page)
34 © Hortonworks Inc. 2011–2018. All rights reserved.
DATAPLANE SERVICE (DPS)MANAGE, GOVERN, SECURE
DATALIFECYCLEMANAGER
- DLM -
DATA STEWARD
STUDIO- DSS -
Etc...
EXTENSIBLE SERVICES
CLOUDBREAK- CB -
STREAMS MESSAGING MANAGER
- SMM -
DATAANALYTICS
STUDIO- DAS -
CONNECTED DATA PLATFORMS
HORTONWORKSDATA PLATFORM (HDP®)DATA-AT-REST
HORTONWORKSDATAFLOW (HDF™)DATA-IN-MOTION
Global Data Management With Hortonworks
35 © Hortonworks Inc. 2011–2018. All rights reserved.
CB: Deploy clusters on the Cloud
DPS PlatformCloudbreak (CB)
36 © Hortonworks Inc. 2011–2018. All rights reserved.
CB: Workload specific prescriptive clusters
DPS PlatformCloudbreak (CB)
37 © Hortonworks Inc. 2011–2018. All rights reserved.
DATAPLANE SERVICE (DPS)MANAGE, GOVERN, SECURE
DATALIFECYCLEMANAGER
- DLM -
DATA STEWARD
STUDIO- DSS -
Etc...
EXTENSIBLE SERVICES
CLOUDBREAK- CB -
STREAMS MESSAGING MANAGER
- SMM -
DATAANALYTICS
STUDIO- DAS -
CONNECTED DATA PLATFORMS
HORTONWORKSDATA PLATFORM (HDP®)DATA-AT-REST
HORTONWORKSDATAFLOW (HDF™)DATA-IN-MOTION
Global Data Management With Hortonworks
38 © Hortonworks Inc. 2011–2018. All rights reserved.
Why Customers Choose Hortonworks
Global Data Management • Hybrid• Multi-cloud• End-to-end security and governance
100% Open Source –“We are the Linux of Big Data”
• Innovation• Interoperability• No vendor lock-in• Rapid community innovation
Proven Business Model:• 1,300 enterprise customers• First to IPO• Fastest to $100M• First to profitability
Most Comprehensive Platform • Data at Rest and Data in Motion• Any style of workload• Centralized management, security,
governance
39 © Hortonworks Inc. 2011–2018. All rights reserved.
Why Do Customers Migrate to Hortonworks?
Open source model aligned to customer success with no vendor lock-in
Cost advantage
Strong references from customersBest practices captured from 1,300 customer engagements
Dozens of successful large-scale migrations to HDP
Latest and greatest Hadoop stack
40 © Hortonworks Inc. 2011–2018. All rights reserved.40 © Hortonworks Inc. 2011–2018. All rights reserved.
A Continuous Track Record of Leading Innovation
DATA-AT-REST HADOOP 1.0100% Open
YARN HADOOP 2.0Enable multiple workloads
DATA-IN-MOTIONHDP & HDFOut to the edge
CONNECT DATA PLATFORMSCloud/On-Prem
Performance and cost control
Hortonworks 3.0
HDP IPODec 2014First OSS IPO in 10 years
Inno
vatio
n
Hortonworks 1.0Hadoop as an enterprise viable data platform
Fastest Companyto $100M in RevenueAugust 2015
Hortonworks 2.0Bring this to the edge with connected platforms
Ecosystem consolidationIBM
First to Cash Flow Breakeven
2011 2013 2015
2016
2017
2018
First to Multi-CloudDataPlane Service
41 © Hortonworks Inc. 2011–2018. All rights reserved.
Thank you