hortonworks & ibm cloud event · governance-don’t have the right data-data resides in...

41
1 © Hortonworks Inc. 2011–2018. All rights reserved. Hortonworks & IBM Cognitive The Future of Data Science Hortonworks & IBM Better Together

Upload: others

Post on 17-Jun-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

1 © Hortonworks Inc. 2011–2018. All rights reserved.

Hortonworks & IBM CognitiveThe Future of Data ScienceHortonworks & IBM Better Together

Page 2: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

2 © Hortonworks Inc. 2011–2018. All rights reserved.

Hortonworks & IBM Cloud

Data has gravity. Manage it right with a hybrid data

architecture for BigData

Thiago SantiagoSolution Engineer – Latam

Page 3: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

3 © Hortonworks Inc. 2011–2018. All rights reserved.

Gravity: ...the force that attracts a body toward any other physical body

having mass.

Page 4: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

4 © Hortonworks Inc. 2011–2018. All rights reserved.

Page 5: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

5 © Hortonworks Inc. 2011–2018. All rights reserved.

Page 6: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

6 © Hortonworks Inc. 2011–2018. All rights reserved.

Page 7: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

7 © Hortonworks Inc. 2011–2018. All rights reserved.

...the ability of bodies of data to attract applications, services andother data.

Data gravity:

Page 8: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

8 © Hortonworks Inc. 2011–2018. All rights reserved.

Technology Trends: Shifting the Data Paradigm

Artificial IntelligenceInternet of Things Cloud Computing Streaming Data

Industrial InternetConnected BusinessConsumer Devices

Smart DevicesAutonomy

Prescriptive Analytics

SaaS/PaaS ApplicationsEphemeral Use CasesOperational Efficiency

Collaboration

Real-time ApplicationsTargeted Retail

RecommendationsIndustrial Applications

Page 9: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

9 © Hortonworks Inc. 2011–2018. All rights reserved.

Do These Problems Sound Familiar?

Data Analytics

Governance

- Don’t have the right data- Data resides in silos- High volumes of data- Lack of unstructured data - Difficult to access

- Data is hard to find and access- Self-Service is limited - No centrally managed security system

Data source not trusted -Impossible to audit or track data lineage -

Lack of skills and tools -Departmental silos -

Need for “fail fast” strategy -Technology has limitations -

Page 10: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

10 © Hortonworks Inc. 2011–2018. All rights reserved.

From the edge, through movement, to rest

Hortonworks DataPlane Service a foundational platform for the delivery of data solutions that will:

• Support enterprise hybrid deployment strategy and adoption of cloud

• Common Metadata, Security and Governance across all tiers and types of data

• Simplified enterprise data asset management

• Extensible to new services: Services enablement layer for rapidly bringing new solutions to market

• Brings all data under management

HORTONWORKSDATAPLANE

SERVICE

MULTIPLE CLUSTERS AND SOURCES

MULTIHYBRID

Manage, Secure, GovernDATA AT REST

HortonworksData Platform

DATA IN MOTION

HortonworksData Flow

What is Hortonworks DataPlane Service?

Page 11: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

11 © Hortonworks Inc. 2011–2018. All rights reserved.

MULTIPLE CLUSTERS AND SOURCES

MULTIHYBRID

DATAPLANE SERVICE (DPS)MANAGE, GOVERN, SECURE

DATALIFECYCLEMANAGER

DATA STEWARD

STUDIOEtc...

*not yet available, coming soon

EXTENSIBLE SERVICES

IBM DSX*CLOUD-BREAK*

DATAANALYTICS

STUDIO

CONNECTED DATA PLATFORMS

HORTONWORKSDATA PLATFORM (HDP®)DATA-AT-REST

HORTONWORKSDATAFLOW (HDF™)DATA-IN-MOTION

MODERN DATA USE CASESEDW

OPTIMIZATION CYBER SECURITY DATA SCIENCE ADVANCEDANALYTICS

PARTNERSOLUTIONS

IOT/ STREAMING ANALYTICS

HORTONWORKSCONNECTION

ENTERPRISE SUPPORT

PREMIER SUPPORT

EDUCATIONAL SERVICES

PROFESSIONAL SERVICES

COMMUNITY CONNECTION

HORTONWORKSPLATFORM SERVICES

OPERATIONAL SERVICES

SMARTSENSE™

Global Data Management With Hortonworks

Page 12: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

12 © Hortonworks Inc. 2011–2018. All rights reserved.

DATAPLANE SERVICE (DPS)MANAGE, GOVERN, SECURE

DATALIFECYCLEMANAGER

- DLM -

DATA STEWARD

STUDIO- DSS -

IBM DSX

EXTENSIBLE SERVICES

CLOUDBREAK- CB -

STREAMS MESSAGING MANAGER

- SMM -

DATAANALYTICS

STUDIO- DAS -

CONNECTED DATA PLATFORMS

HORTONWORKSDATA PLATFORM (HDP®)DATA-AT-REST

HORTONWORKSDATAFLOW (HDF™)DATA-IN-MOTION

Global Data Management With Hortonworks

Page 13: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

13 © Hortonworks Inc. 2011–2018. All rights reserved.

DPS Console: One place to visualize Virtual Data Lakes

DPS PlatformData Plane Service Console

Page 14: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

14 © Hortonworks Inc. 2011–2018. All rights reserved.

DATAPLANE SERVICE (DPS)MANAGE, GOVERN, SECURE

DATALIFECYCLEMANAGER

- DLM -

DATA STEWARD

STUDIO- DSS -

Etc...

EXTENSIBLE SERVICES

CLOUDBREAK- CB -

STREAMS MESSAGING MANAGER

- SMM -

DATAANALYTICS

STUDIO- DAS -

CONNECTED DATA PLATFORMS

HORTONWORKSDATA PLATFORM (HDP®)DATA-AT-REST

HORTONWORKSDATAFLOW (HDF™)DATA-IN-MOTION

Global Data Management With Hortonworks

Page 15: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

15 © Hortonworks Inc. 2011–2018. All rights reserved.

• First Extensible Service Delivered on DPS Platform –

GA Oct, 2017

• Manage the Data Lifecycle:

– Replication/failback to another cloud/on-prem

site for Disaster Recovery

– Auto Tiering of hot/warm/cold data to cloud

object storage/on-prem for TCO reduction

– Backup & Recover Critical Business Data

• Maintain Common Security and Governance Policies

Across Multi Data Sources/ Environments

Data Lifecycle Manager (DLM)

SERVICE: DATA LIFECYCLE MANAGER

REPLICATION &DISASTER

RECOVERY

Cluster Cluster ClusterMOVE MOVE

AUTO TIERING

BACKUP &RESTORE

P(use): highCost: $$$

P(use): mediumCost: $$

P(use): lowCost: $

Fullbackup

day 1 day 2 day 3

Cumulative incrementalbackups

Accidentdelete

X

FAILBACK

REPLICATION

RESTORE

ProdCluster

BackupCluster

Generally Available

Coming Soon

Coming Soon

Page 16: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

16 © Hortonworks Inc. 2011–2018. All rights reserved.

DLM: Pair clusters and manage data replication flows

Data Lifecycle Manager (DLM)

Page 17: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

17 © Hortonworks Inc. 2011–2018. All rights reserved.

DLM: Replicate data between on-prem HDP and Cloud.

Data Lifecycle Manager (DLM)

Page 18: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

18 © Hortonworks Inc. 2011–2018. All rights reserved.

DLM: Replication policies and instances

Data Lifecycle Manager (DLM)

Page 19: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

19 © Hortonworks Inc. 2011–2018. All rights reserved.

DATAPLANE SERVICE (DPS)MANAGE, GOVERN, SECURE

DATALIFECYCLEMANAGER

- DLM -

DATA STEWARD

STUDIO- DSS -

Etc...

EXTENSIBLE SERVICES

CLOUDBREAK- CB -

STREAMS MESSAGING MANAGER

- SMM -

DATAANALYTICS

STUDIO- DAS -

CONNECTED DATA PLATFORMS

HORTONWORKSDATA PLATFORM (HDP®)DATA-AT-REST

HORTONWORKSDATAFLOW (HDF™)DATA-IN-MOTION

Global Data Management With Hortonworks

Page 20: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

20 © Hortonworks Inc. 2011–2018. All rights reserved.

Data Steward StudioUnderstand, secure & govern data across enterprise data lakes

Page 21: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

21 © Hortonworks Inc. 2011–2018. All rights reserved.

DSS: Understand shape of Hive column data with statistical profiler, example: Profile shows box plot and histogram for distribution of column values

Data Steward Studio (DSS)

Page 22: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

22 © Hortonworks Inc. 2011–2018. All rights reserved.

DSS: Data lineage shows complete chain of custody and downstream dependencies for an asset!

Data Steward Studio (DSS)

Page 23: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

23 © Hortonworks Inc. 2011–2018. All rights reserved.

DSS: Audit Profiler shows both summarized views & patterns of access for a data asset.

Data Steward Studio (DSS)

Page 24: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

24 © Hortonworks Inc. 2011–2018. All rights reserved.

DATAPLANE SERVICE (DPS)MANAGE, GOVERN, SECURE

DATALIFECYCLEMANAGER

- DLM -

DATA STEWARD

STUDIO- DSS -

Etc...

EXTENSIBLE SERVICES

CLOUDBREAK- CB -

STREAMS MESSAGING MANAGER

- SMM -

DATAANALYTICS

STUDIO- DAS -

CONNECTED DATA PLATFORMS

HORTONWORKSDATA PLATFORM (HDP®)DATA-AT-REST

HORTONWORKSDATAFLOW (HDF™)DATA-IN-MOTION

Global Data Management With Hortonworks

Page 25: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

25 © Hortonworks Inc. 2011–2018. All rights reserved.

DAS: Data Analytics Studio gives database heatmap, quickly discover and see what part of your cluster is being utilized more

Data Analytics Studio (DAS)

Page 26: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

26 © Hortonworks Inc. 2011–2018. All rights reserved.

DAS: Built-in batch operationsNo more scripting needed for day-to-day operations

Data Analytics Studio (DAS)

Page 27: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

27 © Hortonworks Inc. 2011–2018. All rights reserved.

DAS: Full featured Auto-complete, results direct download, quick-data preview and many other quality-of-life improvements.

Data Analytics Studio (DAS)

Page 28: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

28 © Hortonworks Inc. 2011–2018. All rights reserved.

DAS: Pre-defined searches to quickly narrow down problematic queries in a large cluster

Data Analytics Studio (DAS)

Page 29: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

29 © Hortonworks Inc. 2011–2018. All rights reserved.

DAS: Heuristic recommendation engineFully self-serviced query and storage optimization

Data Analytics Studio (DAS)

Page 30: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

30 © Hortonworks Inc. 2011–2018. All rights reserved.

DATAPLANE SERVICE (DPS)MANAGE, GOVERN, SECURE

DATALIFECYCLEMANAGER

- DLM -

DATA STEWARD

STUDIO- DSS -

Etc...

EXTENSIBLE SERVICES

CLOUDBREAK- CB -

STREAMS MESSAGING MANAGER

- SMM -

DATAANALYTICS

STUDIO- DAS -

CONNECTED DATA PLATFORMS

HORTONWORKSDATA PLATFORM (HDP®)DATA-AT-REST

HORTONWORKSDATAFLOW (HDF™)DATA-IN-MOTION

Global Data Management With Hortonworks

Page 31: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

31 © Hortonworks Inc. 2011–2018. All rights reserved.

Operations: SMM

Page 32: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

32 © Hortonworks Inc. 2011–2018. All rights reserved.

SMM(Topic Profile / Detail Page)

Page 33: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

33 © Hortonworks Inc. 2011–2018. All rights reserved.

SMM(Topic Profile / Detail Page)

Page 34: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

34 © Hortonworks Inc. 2011–2018. All rights reserved.

DATAPLANE SERVICE (DPS)MANAGE, GOVERN, SECURE

DATALIFECYCLEMANAGER

- DLM -

DATA STEWARD

STUDIO- DSS -

Etc...

EXTENSIBLE SERVICES

CLOUDBREAK- CB -

STREAMS MESSAGING MANAGER

- SMM -

DATAANALYTICS

STUDIO- DAS -

CONNECTED DATA PLATFORMS

HORTONWORKSDATA PLATFORM (HDP®)DATA-AT-REST

HORTONWORKSDATAFLOW (HDF™)DATA-IN-MOTION

Global Data Management With Hortonworks

Page 35: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

35 © Hortonworks Inc. 2011–2018. All rights reserved.

CB: Deploy clusters on the Cloud

DPS PlatformCloudbreak (CB)

Page 36: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

36 © Hortonworks Inc. 2011–2018. All rights reserved.

CB: Workload specific prescriptive clusters

DPS PlatformCloudbreak (CB)

Page 37: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

37 © Hortonworks Inc. 2011–2018. All rights reserved.

DATAPLANE SERVICE (DPS)MANAGE, GOVERN, SECURE

DATALIFECYCLEMANAGER

- DLM -

DATA STEWARD

STUDIO- DSS -

Etc...

EXTENSIBLE SERVICES

CLOUDBREAK- CB -

STREAMS MESSAGING MANAGER

- SMM -

DATAANALYTICS

STUDIO- DAS -

CONNECTED DATA PLATFORMS

HORTONWORKSDATA PLATFORM (HDP®)DATA-AT-REST

HORTONWORKSDATAFLOW (HDF™)DATA-IN-MOTION

Global Data Management With Hortonworks

Page 38: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

38 © Hortonworks Inc. 2011–2018. All rights reserved.

Why Customers Choose Hortonworks

Global Data Management • Hybrid• Multi-cloud• End-to-end security and governance

100% Open Source –“We are the Linux of Big Data”

• Innovation• Interoperability• No vendor lock-in• Rapid community innovation

Proven Business Model:• 1,300 enterprise customers• First to IPO• Fastest to $100M• First to profitability

Most Comprehensive Platform • Data at Rest and Data in Motion• Any style of workload• Centralized management, security,

governance

Page 39: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

39 © Hortonworks Inc. 2011–2018. All rights reserved.

Why Do Customers Migrate to Hortonworks?

Open source model aligned to customer success with no vendor lock-in

Cost advantage

Strong references from customersBest practices captured from 1,300 customer engagements

Dozens of successful large-scale migrations to HDP

Latest and greatest Hadoop stack

Page 40: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

40 © Hortonworks Inc. 2011–2018. All rights reserved.40 © Hortonworks Inc. 2011–2018. All rights reserved.

A Continuous Track Record of Leading Innovation

DATA-AT-REST HADOOP 1.0100% Open

YARN HADOOP 2.0Enable multiple workloads

DATA-IN-MOTIONHDP & HDFOut to the edge

CONNECT DATA PLATFORMSCloud/On-Prem

Performance and cost control

Hortonworks 3.0

HDP IPODec 2014First OSS IPO in 10 years

Inno

vatio

n

Hortonworks 1.0Hadoop as an enterprise viable data platform

Fastest Companyto $100M in RevenueAugust 2015

Hortonworks 2.0Bring this to the edge with connected platforms

Ecosystem consolidationIBM

First to Cash Flow Breakeven

2011 2013 2015

2016

2017

2018

First to Multi-CloudDataPlane Service

Page 41: Hortonworks & IBM Cloud Event · Governance-Don’t have the right data-Data resides in silos-High volumes of data ... • Support enterprise hybrid deployment strategy and adoption

41 © Hortonworks Inc. 2011–2018. All rights reserved.

Thank you