big data closing the analytics gap - sas institute€¦ · copyright © 2011, sas institute inc....

31
Copyright © 2011, SAS Institute Inc. All rights reserved. make connections • share ideas • be inspired Big Data – Closing the analytics gap André Münger Greenplum [email protected]

Upload: others

Post on 10-May-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Big Data Closing the analytics gap - Sas Institute€¦ · Copyright © 2011, SAS Institute Inc. All rights reserved. SAS High Performance Analytics (Product) Lösungs-Architektur

Copyright © 2011, SAS Institute Inc. All rights reserved.

make connections • share ideas • be inspired

Big Data – Closing the analytics gap André Münger Greenplum [email protected]

Page 2: Big Data Closing the analytics gap - Sas Institute€¦ · Copyright © 2011, SAS Institute Inc. All rights reserved. SAS High Performance Analytics (Product) Lösungs-Architektur

Copyright © 2011, SAS Institute Inc. All rights reserved.

Ben Mac

Mouth

Money Bang

B

a

n

d

Brother

Lebowski

F

i

v

e

Fish

Business

Was ist all diesen Begriffen gemeinsam?

Page 3: Big Data Closing the analytics gap - Sas Institute€¦ · Copyright © 2011, SAS Institute Inc. All rights reserved. SAS High Performance Analytics (Product) Lösungs-Architektur

Copyright © 2011, SAS Institute Inc. All rights reserved.

DATA MINING & BIG DATA Mythos

+ =

+

1. Überhaupt nichts, ist am Preiswertesten

2. Entschuldigungsbrief schreiben (minimaler Aufwand, minimale Kosten)

3. Gutschein über 300 Franken für den nächsten Flug

4. Eine Jahresmitgliedschaft in der VIP-Lounge

5. 25.000 geschenkte Frequent-Flyer-Meilen

Page 4: Big Data Closing the analytics gap - Sas Institute€¦ · Copyright © 2011, SAS Institute Inc. All rights reserved. SAS High Performance Analytics (Product) Lösungs-Architektur

Copyright © 2011, SAS Institute Inc. All rights reserved.

7 THESEN ZU BIG DATA

Page 5: Big Data Closing the analytics gap - Sas Institute€¦ · Copyright © 2011, SAS Institute Inc. All rights reserved. SAS High Performance Analytics (Product) Lösungs-Architektur

Copyright © 2011, SAS Institute Inc. All rights reserved.

1. Man kann vor Big

Data wegrennen,

sich aber nicht

verstecken.

Page 6: Big Data Closing the analytics gap - Sas Institute€¦ · Copyright © 2011, SAS Institute Inc. All rights reserved. SAS High Performance Analytics (Product) Lösungs-Architektur

Copyright © 2011, SAS Institute Inc. All rights reserved.

THRIVING IN THE BIG DATA ERA

Volume

Variety

Velocity

Value

The

Analytics

Gap

Page 7: Big Data Closing the analytics gap - Sas Institute€¦ · Copyright © 2011, SAS Institute Inc. All rights reserved. SAS High Performance Analytics (Product) Lösungs-Architektur

Copyright © 2011, SAS Institute Inc. All rights reserved.

2. Big Data wird

DAS

entscheidende

Differenzierungs-

kriterium.

Page 8: Big Data Closing the analytics gap - Sas Institute€¦ · Copyright © 2011, SAS Institute Inc. All rights reserved. SAS High Performance Analytics (Product) Lösungs-Architektur

Copyright © 2011, SAS Institute Inc. All rights reserved.

SAS Press Release im April 2011

“In early April 2011, SAS demonstrated the power of high

performance analytics at its Global Forum meeting. In the

first case, two racks (16 nodes) of Greenplum's Data

Computing Appliance (DCA) were used to run a logistic

regression of bank loan defaults across a database with a

billion records, applying just a few variables. The regression

was able to complete in less than 80 seconds (as compared

to 20 hours for an unspecified serial implementation).”

0.11% of time needed

unspecified serial implementation takes 900 times longer

SAS Code: proc logistic changed to proc HPlogistic

Page 9: Big Data Closing the analytics gap - Sas Institute€¦ · Copyright © 2011, SAS Institute Inc. All rights reserved. SAS High Performance Analytics (Product) Lösungs-Architektur

Copyright © 2011, SAS Institute Inc. All rights reserved.

3. Big Data braucht

neue

Fähigkeiten.

Page 10: Big Data Closing the analytics gap - Sas Institute€¦ · Copyright © 2011, SAS Institute Inc. All rights reserved. SAS High Performance Analytics (Product) Lösungs-Architektur

Copyright © 2011, SAS Institute Inc. All rights reserved.

Statistik / Mathematik

Business Verständnis

Hacking

Entdecker-geist

Team – und Kommuni-kations-fäjhigkeit

Offenheit

Neugierde

Engineering

Data Scientist

Page 11: Big Data Closing the analytics gap - Sas Institute€¦ · Copyright © 2011, SAS Institute Inc. All rights reserved. SAS High Performance Analytics (Product) Lösungs-Architektur

Copyright © 2011, SAS Institute Inc. All rights reserved.

4. Big Data meint,

man soll Social

Media nutzen.

Page 12: Big Data Closing the analytics gap - Sas Institute€¦ · Copyright © 2011, SAS Institute Inc. All rights reserved. SAS High Performance Analytics (Product) Lösungs-Architektur

Copyright © 2011, SAS Institute Inc. All rights reserved.

Im Web wird Realität erzeugt

„Nichts passiert, was nicht genügend gedacht,

gesagt oder geschrieben worden ist.“

Facebook alleine ist das

3. grösste Land auf der Welt.

Page 13: Big Data Closing the analytics gap - Sas Institute€¦ · Copyright © 2011, SAS Institute Inc. All rights reserved. SAS High Performance Analytics (Product) Lösungs-Architektur

Copyright © 2011, SAS Institute Inc. All rights reserved.

5. Big Data

benötigt neue

Werkzeuge und

Technologie.

Page 14: Big Data Closing the analytics gap - Sas Institute€¦ · Copyright © 2011, SAS Institute Inc. All rights reserved. SAS High Performance Analytics (Product) Lösungs-Architektur

Copyright © 2011, SAS Institute Inc. All rights reserved.

TDWI Study Big Data Analytics

Page 15: Big Data Closing the analytics gap - Sas Institute€¦ · Copyright © 2011, SAS Institute Inc. All rights reserved. SAS High Performance Analytics (Product) Lösungs-Architektur

Copyright © 2011, SAS Institute Inc. All rights reserved.

20 Jahre Evolution

Page 16: Big Data Closing the analytics gap - Sas Institute€¦ · Copyright © 2011, SAS Institute Inc. All rights reserved. SAS High Performance Analytics (Product) Lösungs-Architektur

Copyright © 2011, SAS Institute Inc. All rights reserved.

SAS High Performance

Analytics (Product) Lösungs-Architektur – MPP Datenbank Appliance

Server N Server 2 Server 1

Teradata- oder

EMC² Greenplum-DB

Speicher für SAS

In-Memory Analytics

SAS High Performance

Deployment

SAS Embedded

Process

SAS High Performance

Analytics Plattform

MPI MPI

proc hplogistic data=MPPLib.MyTable; class A B C D ; model y = a b c

b*d x1-x100; output

out=MPPlib.logout pred=p; run;

Multiple

Threads

Multiple

Threads

Multiple

Threads

Database Storage

Database Server

Embedded

Process

Database Storage

Database Server

Embedded

Process

Database Storage

Database Server

Embedded

Process

SAS STAT Client

SAS Enterprise Miner

www.sas.com/hpa

Page 17: Big Data Closing the analytics gap - Sas Institute€¦ · Copyright © 2011, SAS Institute Inc. All rights reserved. SAS High Performance Analytics (Product) Lösungs-Architektur

Copyright © 2011, SAS Institute Inc. All rights reserved.

Traditionelle Architekturen – Scale Up

Ihre Aufgabe:

Bringen Sie 300 Passagiere in ein Flugzeug.

1 Eingang.

Wie lange dauert es?

Page 18: Big Data Closing the analytics gap - Sas Institute€¦ · Copyright © 2011, SAS Institute Inc. All rights reserved. SAS High Performance Analytics (Product) Lösungs-Architektur

Copyright © 2011, SAS Institute Inc. All rights reserved.

MPP – Massiv Parallele Verarbeitung

Ihre Aufgabe:

Bringen Sie 300 Passagiere in einen Zug.

20 Wagen, 2 Eingänge pro Wagen.

Wie lange dauert es?

Page 19: Big Data Closing the analytics gap - Sas Institute€¦ · Copyright © 2011, SAS Institute Inc. All rights reserved. SAS High Performance Analytics (Product) Lösungs-Architektur

Copyright © 2011, SAS Institute Inc. All rights reserved.

SAS AND EMC GREENPLUM UAP INTEGRATED ARCHITECTURE

Data

Scientist

Data

Engineer

Data

Analyst

Bl

Analyst LOB

User

Data

Platform

Admin

DA

TA

SC

IEN

CE

TE

AM

Greenplum Chorus - Analytic Productivity Layer

SAS Analytics

Private/Hybrid Cloud Infrastructure or Appliance

SAS Business Intelligence

SAS Information Management

Greenplum Database Greenplum Hadoop

Data Access & Query Layer (SAS ACCESS, SQL, MapReduce)

Page 20: Big Data Closing the analytics gap - Sas Institute€¦ · Copyright © 2011, SAS Institute Inc. All rights reserved. SAS High Performance Analytics (Product) Lösungs-Architektur

Copyright © 2011, SAS Institute Inc. All rights reserved.

2011 EMC Data Hero Award Winner

Silver Spring Networks – Energy Category

6. Big Data

transformiert

ganze Industrien.

Page 21: Big Data Closing the analytics gap - Sas Institute€¦ · Copyright © 2011, SAS Institute Inc. All rights reserved. SAS High Performance Analytics (Product) Lösungs-Architektur

Copyright © 2011, SAS Institute Inc. All rights reserved.

Alte Big Data Weisheit

„Wenn starker Wind aufkommt,

dann bauen die Einen Mauern,

die Anderen errichten Windmühlen.“

Page 22: Big Data Closing the analytics gap - Sas Institute€¦ · Copyright © 2011, SAS Institute Inc. All rights reserved. SAS High Performance Analytics (Product) Lösungs-Architektur

Copyright © 2011, SAS Institute Inc. All rights reserved.

All Data

Faster Answers

Elastic & Scalable

1 The Journey To Big Data

2 Data Science

Collaboration

Self-Service

Agile Analytics People & Productivity Focus

3 Real-Time Decisions

New Applications

Data Monetization

Analytic Productivity Platform

Agile Process & Tools

Predictive Enterprise Application Focus

Big Data Enabled Apps

Big Data Analytics Technology Focus

Analytics Engines

Cloud Infrastructure

Analytic Engines

Page 23: Big Data Closing the analytics gap - Sas Institute€¦ · Copyright © 2011, SAS Institute Inc. All rights reserved. SAS High Performance Analytics (Product) Lösungs-Architektur

Copyright © 2011, SAS Institute Inc. All rights reserved.

7. Big Data

verändert Leben.

2011 EMC Data Hero Award Winner Global Virus Forecasting – Public Service Category

Page 24: Big Data Closing the analytics gap - Sas Institute€¦ · Copyright © 2011, SAS Institute Inc. All rights reserved. SAS High Performance Analytics (Product) Lösungs-Architektur

Copyright © 2011, SAS Institute Inc. All rights reserved.

www.datascienceseries.com

Die Data Science Series

sind ein Forum für Menschen,

die die Welt verändern möchten.

Page 25: Big Data Closing the analytics gap - Sas Institute€¦ · Copyright © 2011, SAS Institute Inc. All rights reserved. SAS High Performance Analytics (Product) Lösungs-Architektur

Copyright © 2011, SAS Institute Inc. All rights reserved.

Page 26: Big Data Closing the analytics gap - Sas Institute€¦ · Copyright © 2011, SAS Institute Inc. All rights reserved. SAS High Performance Analytics (Product) Lösungs-Architektur

Copyright © 2011, SAS Institute Inc. All rights reserved.

Page 27: Big Data Closing the analytics gap - Sas Institute€¦ · Copyright © 2011, SAS Institute Inc. All rights reserved. SAS High Performance Analytics (Product) Lösungs-Architektur

Copyright © 2011, SAS Institute Inc. All rights reserved.

The Human Face of BIG DATA

Finding Islands of Understanding in a Sea of Information.

www.thehumanfaceofbigdata.com

Page 28: Big Data Closing the analytics gap - Sas Institute€¦ · Copyright © 2011, SAS Institute Inc. All rights reserved. SAS High Performance Analytics (Product) Lösungs-Architektur

Copyright © 2011, SAS Institute Inc. All rights reserved.

Zusammenfassung

Big Data ist heute

Big Data differenziert

Big Data macht Spass

Menschen kommen vor Technologie

Page 29: Big Data Closing the analytics gap - Sas Institute€¦ · Copyright © 2011, SAS Institute Inc. All rights reserved. SAS High Performance Analytics (Product) Lösungs-Architektur

Copyright © 2011, SAS Institute Inc. All rights reserved.

Page 30: Big Data Closing the analytics gap - Sas Institute€¦ · Copyright © 2011, SAS Institute Inc. All rights reserved. SAS High Performance Analytics (Product) Lösungs-Architektur

Copyright © 2011, SAS Institute Inc. All rights reserved.

Data Is Digital Air

“Data is increasingly digital air:

the oxygen we breathe and the

carbon dioxide that we exhale.

It can be a source of both,

sustenance and pollution.”

Danah Boyd, Microsoft Researcher

Page 31: Big Data Closing the analytics gap - Sas Institute€¦ · Copyright © 2011, SAS Institute Inc. All rights reserved. SAS High Performance Analytics (Product) Lösungs-Architektur

Copyright © 2010, SAS Institute Inc. All rights reserved.

make connections • share ideas • be inspired

André Münger

+41 79 708 85 99

[email protected]

EMC Computer Systems AG

Hardturmstrasse 181

8005 Zürich / Schweiz