amanda casari, senior data scientist, concur at mlconf sea - 5/20/16
TRANSCRIPT
SCALING DATA SCIENCE PRODUCTS NOT DATA SCIENCE TEAMS
long winded views of scaling up @amcasari MLconf seattle, 2016may20
nasa
@
data science via random walks
senior product manager +
senior data scientist
@ Concur Labs
control systems
engineering +
robotics + legos
officer in USN
operations research
analyst
wandering dirtbag +
conservation volunteerEE +
applied math
+ complex systems
underwater robotics
consultant
extraordinaire
SAHM
@
?
• Is this a Product Design problem?
• Is this a Mathy ML problem?
• Is this a Software Engineering problem?
@
data science therapy: let’s talk about your problems
how do we keep new customers happy?
@
product design problem: managing user expectation
@
how are we going to maintain so many more models?
@
software engineering problem: scale a code base
@
how can we use the same code when we have different customer bases?
@
mathy ML problem: feature design
@
how accurate will your product be in the new market?
@
software engineering problem: test test test
@
how are you going to be personalized for a customer base you don’t know?
@
mathy ML problem: cold start
@
how do we know you are right?
@
product design problem: identifying feedback loops
@