microsoft and hortonworks delivers the modern data architecture for big data

27
© Hortonworks Inc. 2014 Hybrid Modern Data Architecture with Microsoft and Apache Hadoop

Upload: hortonworks

Post on 26-Jan-2015

105 views

Category:

Technology


1 download

DESCRIPTION

Joint webinar with Microsoft and Hortonworns on the power of combining the Hortonworks Data Platform with Microsoft’s ubiquitous Windows, Office, SQL Server, Parallel Data Warehouse, and Azure platform to build the Modern Data Architecture for Big Data.

TRANSCRIPT

Page 1: Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data

© Hortonworks Inc. 2014

Hybrid Modern Data Architecture with Microsoft and Apache Hadoop

Page 2: Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data

Your Presenters

• Oliver Chiu (twitter name ) – Title – Years of experience – Fun Fact

• John Kreisa (@marked_man)

– VP Strategic Marketing, Hortonworks – Over 20 years in data management as a

developer and a marketer – Avid camper

Page 3: Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data

Poll 1: What stage are you looking in Hadoop

• Research • Evaluation • Trial • Haven’t started research

Page 4: Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data

Today’s Topics

• Introduction • What is a Hybrid Modern Data Architecture (MDA)? • Apache Hadoop in the Hybrid MDA • The Hybrid MDA and Microsoft • Q&A

Page 5: Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data

© Hortonworks Inc. 2014

Existing Data Architecture AP

PLICAT

IONS  

DATA

   SYSTEM  

REPOSITORIES  

SOURC

ES  

Exis4ng  Sources    (CRM,  ERP,  Clickstream,  Logs)  

RDBMS   EDW   MPP  

Business    Analy4cs  

Custom  Applica4ons  

Packaged  Applica4ons  

Source: IDC

2.8  ZB  in  2012  

85%  from  New  Data  Types  

15x  Machine  Data  by  2020  40  ZB  by  2020  

Page 6: Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data

© Hortonworks Inc. 2014

Modern Data Architecture Enabled AP

PLICAT

IONS  

DATA

   SYSTEM  

REPOSITORIES  

SOURC

ES  

Exis4ng  Sources    (CRM,  ERP,  Clickstream,  Logs)  

RDBMS   EDW   MPP  

Emerging  Sources    (Sensor,  Sen4ment,  Geo,  Unstructured)  

OPERATIONAL  TOOLS  

MANAGE  &  MONITOR  

DEV  &  DATA  TOOLS  

BUILD  &  TEST  

Business    Analy4cs  

Custom  Applica4ons  

Packaged  Applica4ons  

Page 7: Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data

Hadoop Powers Modern Data Architecture

Apache Hadoop is an open source project governed by the Apache Software Foundation (ASF) that allows you to gain insight from massive amounts of structured and unstructured data quickly and without significant investment.

Hadoop Cluster

compute &

storage . . . . . . . .

compute &

storage

.

.

Hadoop clusters provide scale-out storage and distributed data processing on commodity hardware

Page 8: Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data

Integrated Interoperable with existing data center investments

Skills Leverage your existing skills: development, operations, analytics

Requirements for Hadoop Adoption

Key Services Platform, operational and data services essential for the enterprise

3 Requirements for Hadoop’s Role in the Modern Data Architecture

Page 9: Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data

© Hortonworks Inc. 2013

Use Cases for the MDA

Page 9

Industry Use Case Type of Data

Financial Services New Account Risk Screens Text, Server Logs

Trading Risk Server Logs

Insurance Underwriting Geographic, Sensor, Text

Telecom Call Detail Records (CDRs) Machine, Geographic

Infrastructure Investment Machine, Server Logs

Real-time Bandwidth Allocation Server Logs, Text, Social

Retail 360° View of the Customer Clickstream, Text

Localized, Personalized Promotions Geographic

Website Optimization Clickstream

Manufacturing Supply Chain and Logistics Sensor

Assembly Line Quality Assurance Sensor

Crowdsourced Quality Assurance Social

Healthcare Use Genomic Data in Medical Trials Structured

Monitor Patient Vitals in Real-Time Sensor

Pharmaceuticals Recruit and Retain Patients for Drug Trials Social, Clickstream

Improve Prescription Adherence Social, Unstructured, Geographic

Oil & Gas Unify Exploration & Production Data Sensor, Geographic & Unstructured

Monitor Rig Safety in Real-Time Sensor, Unstructured

Government ETL Offload in Response to Federal Budgetary Pressures Structured

Sentiment Analysis for Government Programs Social

Page 10: Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data

© Hortonworks Inc. 2014

Microsoft in the Modern Data Architecture

INFRASTRUCTURE  

SOURC

ES  

Emerging  Sources    (Sensor,  Sen4ment,  Geo,  Unstructured)  

Exis4ng  Sources    (CRM,  ERP,  Clickstream,  Logs)  

APPLICAT

IONS  

DATA

   SYSTEM  

OPERATIONAL  TOOLS  

DEV  &  DATA  TOOLS  

Microsoft Applications

New! Power BI

Public Preview

Page 11: Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data

Today’s Topics

• Introduction • What is a Hybrid Modern Data Architecture (MDA)? • Apache Hadoop in the Hybrid MDA • The Hybrid MDA and Microsoft • Q&A

Page 12: Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data

Hortonworks and Microsoft

Engineering alignment Corporate alignment

Field Alignment

Page 13: Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data

End-to-End Data Platform

PDW vNext (PDW +

HDInsight)

Windows Azure HDInsight

Hortonworks Data Platform

PDW SQL Server for DW in Azure SQL Server

Page 14: Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data

PDW vNext (PDW + HDInsight)

Windows Azure HDInsight

Hadoop Solutions From Microsoft

Hortonworks Data Platform

Page 15: Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data

Hortonworks Data Platform for Windows

Hortonworks Data Platform

Page 16: Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data

Parallel Data Warehouse Next w/ HDInsight

PDW vNext (PDW + HDInsight)

Page 17: Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data

Microsoft Confidential 17

Select…

Hadoop Data

Result Set

Relational

Data

PolyBase

Page 18: Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data

18

Scale out technologies in SQL Server Parallel Data Warehouse

Page 19: Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data
Page 20: Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data

Windows Azure HDInsight

Windows Azure HDInsight

Page 21: Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data

Master Chief meets Big Data

§  In-game analysis detects cheaters and improves experience for everyone

§  Enables targeted campaigns that improve customer retention

Page 22: Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data
Page 23: Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data

PDW vNext (PDW + HDInsight)

Windows Azure HDInsight

Hadoop Solutions From Microsoft

Hortonworks Data Platform

Page 24: Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data

Development and Data Tools

Hortonworks & Microsoft

AMBARI

MAPREDUCE

YARN

TEZ

DATA SERVICES

HIVE HBASE

PIG

HCATALOG

HDFS

Java

RPC

INTERFACE

ODBC

JDBC

JAVA RPC

HADOOP Data Services

Governance

Exchange

Replication

Query/Visualization/Reporting/Analytics

SQOOP

Reference Architecture

SOURCE DATA

JMS Queue’s

Servers & Mainframe

Files

Databases

Sensor data

Social

LOAD

SQOOP

FLUME

Web HDFS

Enterprise Repositories

Management and Monitoring

Page 25: Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data
Page 26: Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data
Page 27: Microsoft and Hortonworks Delivers the Modern Data Architecture for Big Data

Question & Answer session will be conducted electronically, using the panel to the right of your screen

More about Microsoft and Hortonworks http://hortonworks.com/labs/Microsoft

Get started with Hortonworks Sandbox http://hortonworks.com/hadoop-tutorial/partner-tutorial-microsoft/

Follow us: @hortonworks @MicrosoftBI