bob montemurro chief architect think big analytics · 3 so far we have delivered 200+ successful...

32
1 Bob Montemurro Chief Architect – Think Big Analytics Analytic Ecosystem Trends

Upload: others

Post on 14-Mar-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Bob Montemurro Chief Architect Think Big Analytics · 3 So far we have delivered 200+ successful projects for 100+ clients worldwide. Global delivery model to balance needs (on-site,

1

Bob Montemurro

Chief Architect – Think Big Analytics

Analytic Ecosystem Trends

Page 2: Bob Montemurro Chief Architect Think Big Analytics · 3 So far we have delivered 200+ successful projects for 100+ clients worldwide. Global delivery model to balance needs (on-site,

2

Objective of this Session

Analytic Ecosystem Architecture Trends

• Outline key global emerging ecosystem trends and patterns

• Break through the technology hype

• Understand the changing analytic personas

• Challenge technology-first thinking

Page 3: Bob Montemurro Chief Architect Think Big Analytics · 3 So far we have delivered 200+ successful projects for 100+ clients worldwide. Global delivery model to balance needs (on-site,

3

So far we have delivered200+ successful projects for100+ clients worldwide.

Global delivery model to balance needs (on-site, near-shore, off-shore)

Vendor-neutral with an ecosystem focus.

Full spectrum consulting, business, architecture, data engineering, data science & BI and managed services.

Apache Hadoop and cloud ecosystem integration.

Founded in 2010industry thought leader.

Fixed fee offerings for data science and engineering.

Who Is Think Big?1st Big Data provider 100%

focused around open source.

Page 4: Bob Montemurro Chief Architect Think Big Analytics · 3 So far we have delivered 200+ successful projects for 100+ clients worldwide. Global delivery model to balance needs (on-site,

4

What Every Company Wants

Take advantage of open sourceEmbrace the cloud

Enable real time analytics for everybodyThe Next Big Thing: AI Shorten time to value

Build a Data LakeNo Downtime

Always On!

Secure Everything

Page 5: Bob Montemurro Chief Architect Think Big Analytics · 3 So far we have delivered 200+ successful projects for 100+ clients worldwide. Global delivery model to balance needs (on-site,

5

Too Much Choice?

Page 6: Bob Montemurro Chief Architect Think Big Analytics · 3 So far we have delivered 200+ successful projects for 100+ clients worldwide. Global delivery model to balance needs (on-site,

6

• Business Driven Architecture & Enablement Strategy• Information Architecture• Architecture and Data Strategy Governance• A Business Continuity Continuum• An Ecosystem That Includes a Data Lake• DevOps and Analytic Ops Capabilities

What Every Company Needs

Page 7: Bob Montemurro Chief Architect Think Big Analytics · 3 So far we have delivered 200+ successful projects for 100+ clients worldwide. Global delivery model to balance needs (on-site,

7

Architecture Trend #1

Data Lake 2.0

• Restart of Data Lakes, establishing provenance and trust in data

• Establishing a foundation of raw data in the image of the source

• Enabling downstream facilities to further process and analyze data

Page 8: Bob Montemurro Chief Architect Think Big Analytics · 3 So far we have delivered 200+ successful projects for 100+ clients worldwide. Global delivery model to balance needs (on-site,

8

Access

Perform transformations on

some of the data to aid in generalization, reuse and promote

cross-functional usage through

common relationships

Integration

Acquire data from one or more sources and do very minimal

work to make some of it

consumable

Allow the user to

draw upon any

other tier to aid

in rapid creation

of temporary

data sets to fulfill

an analytical

task

Acquisition

Optimize the data for access in cases where it is frequently used to make access easier and faster

Shared

User Defined

Relevant terms:

Ingest; Landing;

Staging; Buffer;

Data Lake

Relevant terms:

Preparation;

Transformation;

Generalized Core;

Extensible Layer;

Data Product;

Foundation

Relevant terms:Business Specific;

Semantic Layer;

Data Mart, Presentation,

Consumption

Relevant terms: Discovery; Data

Labs; Sandboxes;

Sandpits; Agile

SourcesAny system or process that

provides data that can or

may be used for analysis.

These sources can take the form of highly

structured, such as a

database or in multi or

unstructured format such as

log data or machine

generated data.

Delivery

The usage of data for

various types of analysis: Descriptive, Diagnostic,

Predictive, or Prescriptive.

Think Big Reference Information Architecture

Page 9: Bob Montemurro Chief Architect Think Big Analytics · 3 So far we have delivered 200+ successful projects for 100+ clients worldwide. Global delivery model to balance needs (on-site,

9

Reference Architecture Components

Page 10: Bob Montemurro Chief Architect Think Big Analytics · 3 So far we have delivered 200+ successful projects for 100+ clients worldwide. Global delivery model to balance needs (on-site,

10

Security, Workload ManagementAccessIntegration

Lan

din

g

Acquisition

Sh

are

d V

iew

s &

Se

rvic

es

Op

tim

ize

d S

tru

ctu

res

Co

mm

on

Ke

ys

De

riv

ed

Va

lue

s

Co

mm

on

Su

mm

ari

es

Use

r D

efin

ed

Da

ta S

ets

Sta

nd

ard

iza

tio

n

REFERENCE INFORMATION ARCHITECTURE

Reference Information Architecture Usage Patterns

Consumers

MarketingExecutives

OperationalSystems

CustomersPartners

KnowledgeWorkers

BusinessAnalysts

DataScientists

Engineers

Page 11: Bob Montemurro Chief Architect Think Big Analytics · 3 So far we have delivered 200+ successful projects for 100+ clients worldwide. Global delivery model to balance needs (on-site,

11

Architecture Trend #2

Users are Demanding a Variety of Capabilities

• Skills evolving

• Tools offer more capability

• Moving to a culture of analytics

Page 12: Bob Montemurro Chief Architect Think Big Analytics · 3 So far we have delivered 200+ successful projects for 100+ clients worldwide. Global delivery model to balance needs (on-site,

12

Interoperability

Extensibility

Portability Supportability

Industry Accepted

Processing Scale

Service Level

Trust Data Protection

Analytic Ecosystem – Styles, Engines, and Consumers

Internal

Data Analysts

Power

Users

Data Scientists

Analytic Engines

SQL

Adaptive & Machine Learning

Advanced Analytics

Examples - Styles of Analytics

Operational Reporting Dashboard Reporting

Ad-Hoc Interactive Reporting

Query & Drill Down

Alerts

Decision Factors for Right Engine ?

Data Mining & Forecasting Behavioral Analysis

Pattern Analysis

Predictive Modeling

Graph & Relationship Analytics

Streaming Analytics

Optimization

Rapid Hypothesis Testing

Text Processing Natural Language Processing

Image Processing

Machine Learning Deep Learning

Adaptive Learning

Customers

Autonomous

Applications

Page 13: Bob Montemurro Chief Architect Think Big Analytics · 3 So far we have delivered 200+ successful projects for 100+ clients worldwide. Global delivery model to balance needs (on-site,

13

Security, Workload ManagementAccessIntegration

Lan

din

g

Acquisition

Sh

are

d V

iew

s &

Se

rvic

es

Op

tim

ize

d S

tru

ctu

res

Co

mm

on

Ke

ys

De

riv

ed

Va

lue

s

Co

mm

on

Su

mm

ari

es

Use

r D

efin

ed

Da

ta S

ets

Sta

nd

ard

iza

tio

n

REFERENCE INFORMATION ARCHITECTURE

Reference Information Architecture Usage Patterns

Consumers

MarketingExecutives

OperationalSystems

CustomersPartners

KnowledgeWorkers

BusinessAnalysts

DataScientists

Engineers

Power Users

Data Scientists and Engineers

BI Users

Page 14: Bob Montemurro Chief Architect Think Big Analytics · 3 So far we have delivered 200+ successful projects for 100+ clients worldwide. Global delivery model to balance needs (on-site,

14

Coffee?

Page 15: Bob Montemurro Chief Architect Think Big Analytics · 3 So far we have delivered 200+ successful projects for 100+ clients worldwide. Global delivery model to balance needs (on-site,

15

Yesterday’s News is no Longer Acceptable

• Remember when you waited until the morning newspaper to see the sports scores?

• How is that acceptable with your business?

• Support streaming ingest

Architecture Trend #3

Page 16: Bob Montemurro Chief Architect Think Big Analytics · 3 So far we have delivered 200+ successful projects for 100+ clients worldwide. Global delivery model to balance needs (on-site,

16

Operational, Technical & Business Metadata

BusinessGenerated

HumanGenerated

InteractionGenerated

MachineGenerated

Lan

din

g Z

on

e

Ra

wSuspense

User Defined

Lan

din

g Z

on

e

Ra

w

Suspense

BU1..n

Regional and

Departmental

Views

Operational

Analytics &

Hot Views

ADS

Graph

Datamart

Consumers

MarketingExecutives

OperationalSystems

CustomersPartners

KnowledgeWorkers

BusinessAnalysts

DataScientists

Engineers

Sources Analytics

Operational

Analytics &

Hot ViewsMaster Data

Reference

Data

Transaction

& Event Data

Behavioral

Data

Media

Data

Descriptive

Diagnostic

Prescriptive

Adaptive

Predictive

Page 17: Bob Montemurro Chief Architect Think Big Analytics · 3 So far we have delivered 200+ successful projects for 100+ clients worldwide. Global delivery model to balance needs (on-site,

17

Operational, Technical & Business Metadata

BusinessGenerated

HumanGenerated

InteractionGenerated

MachineGenerated

Consumers

MarketingExecutives

OperationalSystems

CustomersPartners

KnowledgeWorkers

BusinessAnalysts

DataScientists

Engineers

Sources Analytics

Lan

din

g Z

on

e

Ra

wSuspense

Lan

din

g Z

on

e

Ra

w

Suspense

Master Data

Reference

Data

Transaction

& Event Data

Transaction

& Event Data

Reference

Data

Master Data

Behavioral

Data

Media

Data

Regional and

Departmental

Views

Operational

Analytics &

Hot Views

ADS

Graph

SQL/ NOSQL Datamart

User Defined BU1..n

DW Hadoop Other DI

Descriptive

Diagnostic

Prescriptive

Adaptive

Predictive

Page 18: Bob Montemurro Chief Architect Think Big Analytics · 3 So far we have delivered 200+ successful projects for 100+ clients worldwide. Global delivery model to balance needs (on-site,

18

Operational, Technical & Business Metadata

BusinessGenerated

HumanGenerated

InteractionGenerated

MachineGenerated

Consumers

MarketingExecutives

OperationalSystems

CustomersPartners

KnowledgeWorkers

BusinessAnalysts

DataScientists

Engineers

Sources Analytics

Lan

din

g Z

on

e

Ra

wSuspense

Lan

din

g Z

on

e

Ra

w

Suspense

Master Data

Reference

Data

Transaction

& Event Data

Transaction

& Event Data

Reference

Data

Master Data

Behavioral

Data

Media

Data

Regional and

Departmental

Views

Operational

Analytics &

Hot Views

ADS

Graph

SQL/ NOSQL Datamart

User Defined BU1..n

DW Hadoop Other DI

Data Lake

Data Warehouse

Master Data Store

Data Projection

EXPLORATORY

ZONEEXPLORATORY

ZONE

Descriptive

Diagnostic

Prescriptive

Adaptive

Predictive

Page 19: Bob Montemurro Chief Architect Think Big Analytics · 3 So far we have delivered 200+ successful projects for 100+ clients worldwide. Global delivery model to balance needs (on-site,

19

Data Governance

Balancing Application Centricity v Data Centricity

• Your architecture must support both, a solid core foundation, along with business/application specific views of data

Architecture Trend #4

Page 20: Bob Montemurro Chief Architect Think Big Analytics · 3 So far we have delivered 200+ successful projects for 100+ clients worldwide. Global delivery model to balance needs (on-site,

20

Data Centricity and Integration

Page 21: Bob Montemurro Chief Architect Think Big Analytics · 3 So far we have delivered 200+ successful projects for 100+ clients worldwide. Global delivery model to balance needs (on-site,

21

2855

BOOKINGS

Given the rise in churn

rates, identify specific

loyalty customers that have had a baggage

claim with an

unplanned change of

routing.

And where a

cancellation or IROPhave occurred

Communicate with

customer that might be a risk for churn to

proactively retain

before they share their experience on social

media

SCHEDULE

SOCIAL MEDIA

ACCOUNT

COMPLAINT

CUSTOMER

CHURN RATES

Bag Routing

Bookings

Customer Care

Baggage

Customer

Flight

IROP

Page 22: Bob Montemurro Chief Architect Think Big Analytics · 3 So far we have delivered 200+ successful projects for 100+ clients worldwide. Global delivery model to balance needs (on-site,

22

Agility is more than methodology

• Self-service is a given

• Re-use promotes agile decisioning

• DevOps to automate testing, training and operationalization

Architecture Trend #5

Page 23: Bob Montemurro Chief Architect Think Big Analytics · 3 So far we have delivered 200+ successful projects for 100+ clients worldwide. Global delivery model to balance needs (on-site,

23

A Story on Agility

Page 24: Bob Montemurro Chief Architect Think Big Analytics · 3 So far we have delivered 200+ successful projects for 100+ clients worldwide. Global delivery model to balance needs (on-site,

24

Challenge With Analytic Discovery

Page 25: Bob Montemurro Chief Architect Think Big Analytics · 3 So far we have delivered 200+ successful projects for 100+ clients worldwide. Global delivery model to balance needs (on-site,

25

Architecture Trend #6

The lines between Analytics and Operations becoming blurred

• Analytical Ecosystems are no longer the last action in operational projects

• Perform processing and analysis closer to the source

Page 26: Bob Montemurro Chief Architect Think Big Analytics · 3 So far we have delivered 200+ successful projects for 100+ clients worldwide. Global delivery model to balance needs (on-site,

26

API Enabled Analytics

• Creation of data products that can easily be consumed by calling applications or processes

• Serviced based with appropriate levels of security and obfuscation applied

Analytic Architecture Trend #7

Page 27: Bob Montemurro Chief Architect Think Big Analytics · 3 So far we have delivered 200+ successful projects for 100+ clients worldwide. Global delivery model to balance needs (on-site,

27

Data Virtualization (Federation) needs a plan!

• Lead with business use cases

• Consider whether the pattern is a query or a pipe

• Measure the key factors of volume of data, frequency of access, and delta of the data

Analytic Architecture Trend #8

Page 28: Bob Montemurro Chief Architect Think Big Analytics · 3 So far we have delivered 200+ successful projects for 100+ clients worldwide. Global delivery model to balance needs (on-site,

28

External metadata processing layer above multiple source systems– Denodo, Cisco Data Virtualization

(Composite), Presto, etc…

Pros: – Isolation of dependency on

processes from any “other” source system

– Common access point

Cons: – Limited optimization based on cost,

predicate pushdown

– Additional processing layer for every access

© 2017 Teradata

Data Virtualization Tier Pattern

DATA VIRTUALIZATION

Page 29: Bob Montemurro Chief Architect Think Big Analytics · 3 So far we have delivered 200+ successful projects for 100+ clients worldwide. Global delivery model to balance needs (on-site,

29 © 2017 Teradata

Data Virtualization Pass-Thru Pattern

Pass-Through integration from primary source to other system(s)– Querygrid, SAP HANA Vora, etc…

Pros: – Advanced optimization techniques,

predicate pushdown

– Uses minimal systems without execution tier

Cons: – Inter-system dependency on

shipping results

– Additional processing layer for every access

Page 30: Bob Montemurro Chief Architect Think Big Analytics · 3 So far we have delivered 200+ successful projects for 100+ clients worldwide. Global delivery model to balance needs (on-site,

30

Business Continuity Planning

• Define & test your recovery strategy

• Integrate it with your architecture governance practice

• Continually re-evaluate business impact and recovery priorities

• Cover both systems and applications

• Think beyond system to site and city

Architecture Oversight #1

Page 31: Bob Montemurro Chief Architect Think Big Analytics · 3 So far we have delivered 200+ successful projects for 100+ clients worldwide. Global delivery model to balance needs (on-site,

Conclusion & Takeaways

Data and Analytics is an ongoing evolution

User skills are evolving and have a variety of

needs

Focus must be on business enablement

Page 32: Bob Montemurro Chief Architect Think Big Analytics · 3 So far we have delivered 200+ successful projects for 100+ clients worldwide. Global delivery model to balance needs (on-site,

32 © 2014 Teradata

Questions and Answers