big data universe - how we design architectures

38
How we design data architecture Mate Gulyas

Upload: mate-gulyas

Post on 12-Jan-2017

265 views

Category:

Software


1 download

TRANSCRIPT

Page 1: Big Data Universe - How we design architectures

How we design data architectureMate Gulyas

Page 2: Big Data Universe - How we design architectures

CTO & Co-FounderGULYÁS MÁTÉ

@gulyasm

Page 3: Big Data Universe - How we design architectures

ARCHITECTURE?●CODE ARCHITECTURE●GENERAL

INFRASTRUCTURE●DATA INFRASTRUCTURE

@gulyasm

Page 4: Big Data Universe - How we design architectures

ON THE NEXT EPISODE OF BIG DATA...

1.WHAT DO WE DESIGN FOR?

2.OUR STORY, OUR FAILURES

@gulyasm

Page 5: Big Data Universe - How we design architectures

WHAT DO WE DESIGN FOR?

Page 6: Big Data Universe - How we design architectures

WHAT DO WE DESIGN FOR?

●SCALABILITY●MAINTAINABILITY●COST

@gulyasm

Page 7: Big Data Universe - How we design architectures

SCALABILITY AND MAINTAINABILITY

ARE RESULTS OF A GOOD DESIGN

Page 8: Big Data Universe - How we design architectures

WHAT DO WE REALLY DESIGN FOR?

●SIMPLICITY

●RESILIENCY

●SMALL ITERATIONS

●SELF SERVICE

@gulyasm

Page 9: Big Data Universe - How we design architectures

WHAT DO WE REALLY DESIGN FOR?

●SIMPLICITY

●RESILIENCY

●SMALL ITERATIONS

●SELF SERVICE

@gulyasm

Page 10: Big Data Universe - How we design architectures

SIMPLICITY

SIMPLE THINGS

SCALE WELL

@gulyasm

Page 11: Big Data Universe - How we design architectures

SIMPLICITY

SIMPLE THINGS ARE EASY TO UNDERSTAND

@gulyasm

Page 12: Big Data Universe - How we design architectures

SIMPLICITY

BORING TECHNOLOGY IS GOOD TECHNOLOGY

@gulyasm

Page 13: Big Data Universe - How we design architectures

SMALL ITERATIONS

THE UNKNOWNS● THE UNKNOWNS

● THE UNKNOWN UNKNOWNS

@gulyasm

Page 14: Big Data Universe - How we design architectures

SMALL ITERATIONS @gulyasm

Page 15: Big Data Universe - How we design architectures

END RESULT @gulyasm

Page 16: Big Data Universe - How we design architectures

SMALL ITERATIONS @gulyasm

Page 17: Big Data Universe - How we design architectures

SMALL ITERATIONS @gulyasm

Page 18: Big Data Universe - How we design architectures

SMALL ITERATIONS @gulyasm

Page 19: Big Data Universe - How we design architectures

SMALL ITERATIONS @gulyasm

Page 20: Big Data Universe - How we design architectures

SMALL ITERATIONS @gulyasm

Page 21: Big Data Universe - How we design architectures

SMALL ITERATIONS @gulyasm

Page 22: Big Data Universe - How we design architectures

SELF SERVICE

YOUR SOFTWARE/IT INFRASTRUCTURE

IMPACTS THE WHOLE ORGANIZATION

Page 23: Big Data Universe - How we design architectures
Page 24: Big Data Universe - How we design architectures
Page 25: Big Data Universe - How we design architectures

ENBRITELY

DATA

PLATFORM

Page 26: Big Data Universe - How we design architectures

Product placeholder

Page 27: Big Data Universe - How we design architectures

Luigi TOOLS

Luigi + enbrite.ly extensions = Gabo Luigi

WORKFLOW ENGINE

Page 28: Big Data Universe - How we design architectures

Tools we created

GABO LUIGI

Page 29: Big Data Universe - How we design architectures

Spark TOOLS

0.5-4TB daily data1-10B events

Ad-hoc batch queries: 20TB data

Page 30: Big Data Universe - How we design architectures

Spark TOOLS

●SPENT 3 MONTHS OPTIMIZING IT

●20+ NODE CLUSTERS●UNIT TESTS

Page 31: Big Data Universe - How we design architectures

AWS TOOLS

●16 services●110+ machines●1-4 EMR clusters (1-20 node)●100TB+ on S3●All clients has separate

infrastructure

Page 32: Big Data Universe - How we design architectures
Page 33: Big Data Universe - How we design architectures

HOW WE GOT HERE?

2014

MONOLITHIC PYTHON ANALYTICS

2015

JAN

EVALUATE BIG DATA TECHNOLOGIES

2015

SEP

STARTED WORK ON DP

2016FEB

DPPRODUCTION READY

2016JULSAAS DP

@gulyasm

Page 34: Big Data Universe - How we design architectures

HAVE FUN!

@gulyasm

Page 35: Big Data Universe - How we design architectures

PRACTICE AT HOME

@gulyasm

Page 36: Big Data Universe - How we design architectures

WE ARE HIRING!

Page 37: Big Data Universe - How we design architectures

WE ARE HIRING!

Page 38: Big Data Universe - How we design architectures

MATE [email protected]

@gulyasm@enbritely

THANK YOU!