lean data architecture: minimize investment, maximize value brandon byars [email protected]...

42
Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars [email protected] m @BrandonByars Manvir Grewal [email protected] @ManvirGrewal

Upload: natalie-bennett

Post on 31-Dec-2015

222 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

Lean Data Architecture:Minimize Investment,

Maximize Value

Brandon [email protected]@BrandonByars

Manvir [email protected]@ManvirGrewal

Page 2: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

Brave New World

Every 2 days we create as much informationas we did from the dawn of civilization to 2003

- Eric Schmidt, 2011

Big Data• Volume• Velocity• Variety

NoSQL•Schemaless•Non-relational

Page 3: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

Future, Backwards

Page 4: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

Data Platform

Page 5: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

Support variety of analytics/users

Page 6: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

Ubiquitous access

Logicwolf.com

Page 7: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

Evolves as business changes

Page 8: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

Sound good…So what's stopping you?

Page 9: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

Companies lose $8.2 Million annually due to bad data – Gartner

Page 10: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

Study estimated total cost to US economy of data quality problems at over US$600 billion per annum - Wayne Eckerson

Page 11: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

SpreadMarts

Page 12: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com
Page 13: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

Didn’t test well enough…

Page 14: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

Upfront and Inflexible design

Vk.com

Page 15: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

Bring everything, we may need it

Page 16: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com
Page 17: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

Manual Testing is good for…

Page 18: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

Not good for data or software

Page 19: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

Spreadsheet comparisons…over and over!

Page 20: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

Where are your bottlenecks?

Page 21: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com
Page 22: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

The Data Focus Problem

"Data warehousing and BI is one of the largest expenses in CIO budgets, and to-date has had limited value to the business."

- Jeff Smith, CEO Shared Services, Suncorp

Gartner estimates that between 70-80% of BI projects are failures

Page 23: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

How do we get to the future?

Wallchan.com

Page 24: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

App Analysis Modeling DBA ETL Testing Report

2 weeks4 weeks

2 weeks4 weeks

2 weeks4 weeks

2 weeks4 weeks

2 weeks4 weeks

2 weeks4 weeks

2 weeks

Customer

Page 25: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

Value Streams

App Analysis Modeling

Page 26: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

Systems Thinking

How long would it take you to add a single column to a report?

From Idea to Prod…

Page 27: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

Muda

Page 28: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

Eight Wastes

Logismarket.de

Page 29: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

BI/DW: Status Quo

Data

Page 30: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

Lean BI: Value First

value

Cost & Time

100%

100%

Page 31: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

MVP

Sometimes we accidentally build something that nobody wants, in which case it doesn’t matter if we do it on time and on budget

Prototype reportDashboard populated with static dataRudimentary scoring model

Page 32: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

Reduce Batch Size

Dev2ops.org

Page 33: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

Reduce WIP

Page 34: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

Work In Progress

L = λW

Page 35: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

Simplify Architecture – Message Based

ReportsListenerBus

DB

DB

DB DB

Page 36: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

Automated Testing

Page 37: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

How do we test?

Test data set based on profilingVolume is not necessaryVariety is necessaryVariability/Complexity is necessaryVelocity is necessary

System testingPlay transactions from sources in UAT/pre-prod environment

Page 38: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

Developer Sandboxes

Page 39: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

Automated Build

Page 40: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

Continuous Integration

Page 41: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

References

Page 42: Lean Data Architecture: Minimize Investment, Maximize Value Brandon Byars bbyars@thoughtworks.com @BrandonByars Manvir Grewal mgrewal@thoughtworks.com

Data Explosion is coming!