recommender trends 2014

36
@torbenbrodt #recsys Recommender Trends ACM RecSys 2014 Silicon Valley USA Torben Brodt plista GmbH inspired by .. Stammtisch Nov 13th 2014

Upload: torben-brodt

Post on 27-Jun-2015

851 views

Category:

Engineering


2 download

DESCRIPTION

Coming right from the Recommender Systems conference in San Francisco, I present some latest developments in the field of large scale recommendation engines and machine learning.

TRANSCRIPT

Page 1: Recommender Trends 2014

@torbenbrodt #recsys

Recommender TrendsACM RecSys 2014Silicon Valley USA

Torben Brodtplista GmbH

inspired by ..StammtischNov 13th 2014

Page 2: Recommender Trends 2014

@torbenbrodt #recsys

Silicon Valley

Image by New Media at the University of Maine

Page 3: Recommender Trends 2014

@torbenbrodt #recsys

RecSys 2014 was ..

● 1 day workshop● 3 day tech conference (see )● 1 day conference

Page 4: Recommender Trends 2014

@torbenbrodt #recsys

biased with my experience

● Head of Data Engineering● > 6y plista

○ News, advertising, real-time● Open!

DevOps MathCore

Page 5: Recommender Trends 2014

@torbenbrodt #recsys

Contents

1. Product2. Algorithms3. Metrics4. Openness5. Crazy Stuff6. Missing

Page 6: Recommender Trends 2014

@torbenbrodt #recsys

Product, ”Data Driven Decisions”

“We take a proposal for an original production or for a piece of content we’re going to buy and we plug in all the data we can abou tit into our models. We’re able to predict reach and hours for that piece of content even before it exists with reasonable precision in a way that helps us to say, ‘this is worth funding’ or ‘that’s not worth funding,’ ”

NEIL HUNT Netflix

Page 7: Recommender Trends 2014

@torbenbrodt #recsys

Product, “Search & Recommendationshould (not?) converge”

HECTOR GARCIA-MOLINAProfessor, Stanford University

DEBORA DONATOStumbleUpon

Page 8: Recommender Trends 2014

@torbenbrodt #recsys

Product, “Use Human Experts”

ERIC COLSONStitch Fix

Humans send you customized outfits. Machines suggest clothes and judge stuff.

Page 9: Recommender Trends 2014

@torbenbrodt #recsys

Product, “Explain your knowledge”

● Xbox explains why their recommendations are utile

● Cortana builds ML model of user and still allows to change it

Build Trust!

Page 10: Recommender Trends 2014

@torbenbrodt #recsys

Product, “Care about Privacy”

once you lose your customer because of privacy, you will never get him back

solutions● store user history on client side● ..

Page 11: Recommender Trends 2014

@torbenbrodt #recsys

Product, ”Allow User Interaction”

HECTOR GARCIA-MOLINAProfessor, Computer Science and Electrical Engineering Departments of Stanford University

Page 12: Recommender Trends 2014

@torbenbrodt #recsys

Product, “active learning”

Why do vague passive learning when you can ask the user?

.. implicitly or explicitly

http://en.wikipedia.org/wiki/Active_learning_(machine_learning)

SMRITI BHAGATTechnicolor

Page 13: Recommender Trends 2014

@torbenbrodt #recsys

Algorithms

Page 14: Recommender Trends 2014

@torbenbrodt #recsys

Algorithms, ”Matrix Factorization”

[...] faster by replacing inner product with PCA trees

NOAM KOENIGSTEINMicrosoft R&D

Page 15: Recommender Trends 2014

@torbenbrodt #recsys

Algorithms, “Ensembles”

● Multi Armed Bandits● Ensemble Methods● Global Optimization

https://github.com/Yelp/MOE

Page 16: Recommender Trends 2014

@torbenbrodt #recsys

Algorithms, “How does MOE work”

DR. SCOTT CLARKYelp

1. Build Gaussian Process (GP) with points sampled so far

2. Optimize covariance hyperparameters of GP3. Find point(s) of highest Expected Improvement

within parameter domain4. Return optimal next best point(s) to sample

https://github.com/Yelp/MOE

Page 17: Recommender Trends 2014

@torbenbrodt #recsys

Algorithms, “Topic Modelling”

● LDA is standard● datascience tasks

○ where to cut○ how many topics

● where to use?

http://en.wikipedia.org/wiki/Topic_model

Page 18: Recommender Trends 2014

@torbenbrodt #recsys

Algorithms, “Content”

● Sense identifiers (int) instead of keywords● Word sense disambiguation

Page 19: Recommender Trends 2014

@torbenbrodt #recsys

Metrics, “Stakeholders”

● Business Value● Consumer Value● Conflicting goals?● Diversity?

NEIL HUNT Netflix

Page 20: Recommender Trends 2014

@torbenbrodt #recsys

Metrics, “Dwell Time”

● Client Side implementation

● Yahoo ensures dwell-time is comparable across different context (device, etc)

● it correlates to clicks, but is more meaningful XING YI

Yahoo Labs

Page 21: Recommender Trends 2014

@torbenbrodt #recsys

Metrics, “Increasing signals”

Get the full lifetime journey● reservation● rating● billing / tipping

JEREMY SCHIFFOpenTable

Page 22: Recommender Trends 2014

@torbenbrodt #recsys

Openness, “Software Side”

Companies share software● credits to Twitter, Yelp, others

Finally Paper results can be reused (github)

Page 23: Recommender Trends 2014

@torbenbrodt #recsys

Openness, “Data Side”

Wikipedia, DBPedia, common crawl

Companies share Data & Challenges● credits to Netflix, Tmall, Criteo

Page 24: Recommender Trends 2014

@torbenbrodt #recsys

Openness, “Connectivity”

Everything is possible!To Me and to You

● Connect to Facebook○ access open graph

● Get Fulltext without 10k servers● Use Apache Mahout, Azure ML, etc

Page 25: Recommender Trends 2014

@torbenbrodt #recsys

Openness, “Connectivity”

● Give students the chance to learn

● CoLaboratory Notebook

http://venturebeat.com/2014/08/08/google-whips-up-a-chrome-app-to-let-data-scientists-work-together/

Page 26: Recommender Trends 2014

@torbenbrodt #recsys

Openness, “Connectivity”

● Azure Marketplace allows to exchange machine learning models

● RapidMiner makes workflows reproducable

https://datamarket.azure.com/browse/data

Page 27: Recommender Trends 2014

@torbenbrodt #recsys

Crazy Stuff

Industry Sessions…● Facebook News● Shopkick● Stumble Upon● climate institute● ...

Page 28: Recommender Trends 2014

@torbenbrodt #recsys

Crazy Stuff, “music genome project”

1 song = 450 musical characteristics from trained music analyst

ERIK M. SCHMIDTPandora

Page 29: Recommender Trends 2014

@torbenbrodt #recsys

Crazy Stuff, “LinkedIn A/B testing”

● XLNT Platform● Key Component !● Continuous Deployment

YA XULinkedInhttps://engineering.linkedin.com/ab-testing/xlnt-

platform-driving-ab-testing-linkedin

Page 30: Recommender Trends 2014

@torbenbrodt #recsys

Crazy Stuff, “Google Deep Learning”

● Application?○ Pixels, Audio, Searches,

Translation● Embeddings● Language Models● Scalability

JEFF DEANGoogle

Page 31: Recommender Trends 2014

@torbenbrodt #recsys

Missing? “Uncovered Topics”

Page 32: Recommender Trends 2014

@torbenbrodt #recsys

Missing, “Probabilistic Data Structures”

probabilistic counting, hyperLogLog, etc

http://research.neustar.biz/https://streamdrill.com/

Page 33: Recommender Trends 2014

@torbenbrodt #recsys

Missing

Large Scale?● Computational Costs● Real-Time Recs

Page 34: Recommender Trends 2014

@torbenbrodt #recsys

Questions?

Torben Brodtplista GmbH

Page 35: Recommender Trends 2014

@torbenbrodt #recsys

● hard to convince mgmt (?!)● start measuring

example● coupons 1/week might

decrease revenueJEREMY SCHIFFOpenTable

Metrics, “Long Term Satisfaction”

Page 36: Recommender Trends 2014

@torbenbrodt #recsys

Resume, ”we enhance services”

Large Size Companies cannot exist without data science● Netflix● Zalando● etc