Transcript
Page 1: Machine Learning and Big Data at Foursquare

Machine Learning and Big Data at FoursquareBlake Shaw, PhDData Scientist @ Foursquare@metablake

Page 2: Machine Learning and Big Data at Foursquare
Page 3: Machine Learning and Big Data at Foursquare

What is foursquare?

An app that helps you explore your city and connect with friends

A platform for location based services and data

Page 4: Machine Learning and Big Data at Foursquare

What is foursquare?

People use foursquare to:• check in to places• discover new places• share w/ friends• get tips about places • get deals• earn points and badges• keep track of visits

Page 5: Machine Learning and Big Data at Foursquare

What is foursquare?

Mobile Social

Local

Page 6: Machine Learning and Big Data at Foursquare

10,000,000+ people

25,000,000+ places

1,000,000,000+ check-ins

10,000+ actions/second

Stats

Page 7: Machine Learning and Big Data at Foursquare

Growth

Page 8: Machine Learning and Big Data at Foursquare

Growth

Page 9: Machine Learning and Big Data at Foursquare

Growth

Page 10: Machine Learning and Big Data at Foursquare

Learning with location data

• Check-ins are a rich source of data that describe human behavior

• We apply machine learning algorithms to the collective movement patterns of millions of people to build exciting new services

Page 11: Machine Learning and Big Data at Foursquare

Recommendation engine

•foursquare explore provides realtime recommendations using:• location• time of day• check-in history• friends preferences• venue similarities

Page 12: Machine Learning and Big Data at Foursquare

Signals about places

Page 13: Machine Learning and Big Data at Foursquare

Networks of people

Page 14: Machine Learning and Big Data at Foursquare

Networks of people

Brooklyn

SFManha-an

Australia

Page 15: Machine Learning and Big Data at Foursquare

Open questions

• How to measure similarity between people and places?• How to determine influence in large

networks of people and places?• What statistics can we use to describe

people’s behavior in the real-world?• How do we predict what information will be

timely and relevant to a user?

Page 16: Machine Learning and Big Data at Foursquare

Our data stack

• MongoDB• Amazon S3, Elastic Mapreduce• Hadoop• Hive• Flume• R and Matlab

Page 17: Machine Learning and Big Data at Foursquare

Join us!foursquare is hiring! 85+ people and growing

foursquare.com/jobs

Blake Shaw@[email protected]


Top Related