applications of r (dataweek 2014)
DESCRIPTION
Adoption of the R language has grown rapidly in the last few years, and is ranked as the number-one data science language in several surveys. This accelerating R adoption curve has been driven by the Big Data revolution, and the fact that so many data scientists — having learned R at university — are actively unlocking the secrets hidden in these new, vast data troves. In more than 6 years of writing for the Revolutions blog, I’ve discovered hundreds of applications of R in business, in government, and in the non-profit sector. Sometimes the use of R is obvious, and sometimes it takes a little bit of detective work to learn how R is operating behind the scenes. In this talk, I'll recount some of my favourite applications of R, and show how R is behind some amazing innovations in today’s world.TRANSCRIPT
Applications of RHow companies use data science to succeed
David Smith @revodavid
DataWeek San Francisco, September 17 2014
Chief Community OfficerRevolution Analytics
What is R?
Most widely used data analysis software• Used by 2M+ data scientists, statisticians and analysts
Most powerful statistical programming language• Flexible, extensible and comprehensive for productivity
Create beautiful and unique data visualizations• As seen in New York Times, Twitter and Flowing Data
Thriving open-source community• Leading edge of analytics research
Fills the talent gap• New graduates prefer R
www.revolutionanalytics.com/what-r
3
R’s popularity is growing rapidly
R Usage GrowthRexer Data Miner Survey, 2007-2013
• Rexer Data Miner Survey • IEEE Spectrum, July 2014
#9: R
Language PopularityIEEE Spectrum Top Programming Languages
4
R is among the highest-paid IT skills in the US
• Dice Tech Salary Survey, January 2014
• O’Reilly Strata 2013 Data Science Salary Survey
Applications of R
5
• Exploratory Data Analysis
• Experimental Analysis
“Generally, we use R to move fast when we get a new data set. With R, we don’t need to develop custom tools or write a bunch of code. Instead, we can just go about cleaning and exploring the data.” — Solomon Messing, data scientist at Facebook
• Big-Data Visualization
“It resonated with many people. It's not just a pretty picture, it's a reaffirmation of the impact we have in connecting people, even across oceans and borders.” — Paul Butler, data scientist, Facebook
8
“The great beauty of R is that you can modify it to do all sorts of things.” — Hal VarianChief Economist, Google
• Advertising Effectiveness
“R is really important to the point that it's hard to overvalue it.” — Daryl Pregibon Head of Statistics, Google
• Economic forecasting
9
Calculating ROI for Marketing campaigns
CausalImpact: Bayesian structural time-series models
10
The New York Times
Interactive Features
• Election Forecast• Dialect Quiz
Data Journalism
• NFL Draft Picks• Wealth distribution in USA
11
The New York Times
Data Visualization
• Facebook IPO• Baseball legends
12
• Data Visualization • Semantic clustering
“A common pattern for me is that I'll code a MapReduce job in Scala, do some simple command-line munging on the results, pass the data into Python or R for further analysis, pull from a database to grab some extra fields, and so on, often integrating what I find into some machine learning models in the end” — Ed Chen, Data Scientist, Twitter
13
Pub
lic A
ffairs
• Casualty estimation in Warzones • Political Analysis
14
Wea
ther
and
Clim
ate
• Flood Warnings• Climate change forecasts
15
Video Gaming
• Multiplayer Matchmaking
• Player Churn• Game design
• Difficulty curve• Level trouble-spots
• In-game purchase optimization• Fraud detection• Player communities
• Game Analysis
Vid
eo G
ames
16
Fin
ance
and
Ban
king
• Credit Risk Analysis • Financial Networks
17
Com
pani
es U
sing
RSocial media
GoogleFacebook
TwitterFoursquareKickstartereHarmony
Finance
American Century
ANZCredit SuisseNationwide
LloydsBofA
Media
New York Times
EconomistNew Scientist
XBox
Software Vendors
Revolution AnalyticsRstudio
ZementisAlteryxSAPIBMHP
SASTeradataTIBCOOracle
OneTickDataCamp
Services
MangoAccenture
DeloitteScientific RevenueOpenBI
Coursera
Analytics
ZillowTrulia
DataSongExelate
X+1PredictWise
Government
FDACPFB
City of ChicagoNOAANIST
Public Affairs
HRDAGSunlight
FoundationBenetech
RealClimate
Other
FordJohn DeereMonsantoNordstrom
UberEtsy
www.revolutionanalytics.com/companies-using-r
18
OUR COMPANY
The leading providerof advanced analytics software and services
based on open source R, since 2007
OUR SOFTWARE
The only Big Data, Big Analytics software platform based on the data science
language R
SOME KUDOS
VisionaryGartner Magic Quadrantfor Advanced Analytics
Platforms, 2014
Thank YouDavid Smith
Download these slides from:blog.revolutionanalytics.com/2014/09/dataweek.html
[email protected], @revodavid