use the right tool -- r in infrastructure and business analytics
TRANSCRIPT
R in Infrastructure and Business AnalyticsAacutegnes Salaacutenki
ALWAYS USE THE RIGHT TOOL
ALWAYS USE THE RIGHT TOOL
ALWAYS USE THE RIGHT TOOLndash UNLESS ndash
YOU KNOW ONLYR
ndash THEN ndashALWAYS USE R
Infrastructure Analytics
Business Analytics
Infrastructure Analytics
Business Analytics
Infrastructure Analytics
Business Analytics
Infrastructure Analytics
Business Analytics
PhD student
Fault Tolerant Systems Research Group
Systems with availability of 9999
Data Analyst in Product Management
Secret Sauce Partners
Data-driven fashion industry
PhD student
Fault Tolerant Systems Research Group
Systems with availability of 9999
Data Analyst in Product Management
Secret Sauce Partners
Data-driven fashion industry
Continuous sampling Stream of events
User1
User2
Metric1
Metric2
time
time time
time
Continuous sampling Stream of events
User1
User2
Metric1
Metric2
time
time time
time
Time
Archived data csv
Read-filter-join
Archived data databases
Joins in SQL already
Scraping
Archived data csv
Read-filter-join
Archived data databases
Joins in SQL already
Scraping
datatableRPostgreSQL
Rvest
Long format
when where what value
Wide format
lots of additional information
User1 time
CPU
Memory
1 CPU
2 CPU
3 CPU
1 Mem
2 Mem
3 Mem
Long format
when where what value
Wide format
lots of additional information
User1 time
CPU
Memory
1 CPU
2 CPU
3 CPU
1 Mem
2 Mem
3 Mem
CPU Mem
1
2
3
Long format
when where what value
Wide format
lots of additional information
reshape2dplyr
User1 time
plyr
CPU
Memory
tidyr
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
Static pdf reports ExcelGoogle spreadsheets
Package development
rmarkdown
knitr
xlsxgooglesheets
devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr
shiny
devtools
purrcaret stringr
Confidence in data mungingDomain knowledgecuriosity Sense of achievement
+=
ALWAYS USE THE RIGHT TOOL
ALWAYS USE THE RIGHT TOOL
ALWAYS USE THE RIGHT TOOLndash UNLESS ndash
YOU KNOW ONLYR
ndash THEN ndashALWAYS USE R
Infrastructure Analytics
Business Analytics
Infrastructure Analytics
Business Analytics
Infrastructure Analytics
Business Analytics
Infrastructure Analytics
Business Analytics
PhD student
Fault Tolerant Systems Research Group
Systems with availability of 9999
Data Analyst in Product Management
Secret Sauce Partners
Data-driven fashion industry
PhD student
Fault Tolerant Systems Research Group
Systems with availability of 9999
Data Analyst in Product Management
Secret Sauce Partners
Data-driven fashion industry
Continuous sampling Stream of events
User1
User2
Metric1
Metric2
time
time time
time
Continuous sampling Stream of events
User1
User2
Metric1
Metric2
time
time time
time
Time
Archived data csv
Read-filter-join
Archived data databases
Joins in SQL already
Scraping
Archived data csv
Read-filter-join
Archived data databases
Joins in SQL already
Scraping
datatableRPostgreSQL
Rvest
Long format
when where what value
Wide format
lots of additional information
User1 time
CPU
Memory
1 CPU
2 CPU
3 CPU
1 Mem
2 Mem
3 Mem
Long format
when where what value
Wide format
lots of additional information
User1 time
CPU
Memory
1 CPU
2 CPU
3 CPU
1 Mem
2 Mem
3 Mem
CPU Mem
1
2
3
Long format
when where what value
Wide format
lots of additional information
reshape2dplyr
User1 time
plyr
CPU
Memory
tidyr
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
Static pdf reports ExcelGoogle spreadsheets
Package development
rmarkdown
knitr
xlsxgooglesheets
devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr
shiny
devtools
purrcaret stringr
Confidence in data mungingDomain knowledgecuriosity Sense of achievement
+=
ALWAYS USE THE RIGHT TOOL
ALWAYS USE THE RIGHT TOOLndash UNLESS ndash
YOU KNOW ONLYR
ndash THEN ndashALWAYS USE R
Infrastructure Analytics
Business Analytics
Infrastructure Analytics
Business Analytics
Infrastructure Analytics
Business Analytics
Infrastructure Analytics
Business Analytics
PhD student
Fault Tolerant Systems Research Group
Systems with availability of 9999
Data Analyst in Product Management
Secret Sauce Partners
Data-driven fashion industry
PhD student
Fault Tolerant Systems Research Group
Systems with availability of 9999
Data Analyst in Product Management
Secret Sauce Partners
Data-driven fashion industry
Continuous sampling Stream of events
User1
User2
Metric1
Metric2
time
time time
time
Continuous sampling Stream of events
User1
User2
Metric1
Metric2
time
time time
time
Time
Archived data csv
Read-filter-join
Archived data databases
Joins in SQL already
Scraping
Archived data csv
Read-filter-join
Archived data databases
Joins in SQL already
Scraping
datatableRPostgreSQL
Rvest
Long format
when where what value
Wide format
lots of additional information
User1 time
CPU
Memory
1 CPU
2 CPU
3 CPU
1 Mem
2 Mem
3 Mem
Long format
when where what value
Wide format
lots of additional information
User1 time
CPU
Memory
1 CPU
2 CPU
3 CPU
1 Mem
2 Mem
3 Mem
CPU Mem
1
2
3
Long format
when where what value
Wide format
lots of additional information
reshape2dplyr
User1 time
plyr
CPU
Memory
tidyr
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
Static pdf reports ExcelGoogle spreadsheets
Package development
rmarkdown
knitr
xlsxgooglesheets
devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr
shiny
devtools
purrcaret stringr
Confidence in data mungingDomain knowledgecuriosity Sense of achievement
+=
ALWAYS USE THE RIGHT TOOLndash UNLESS ndash
YOU KNOW ONLYR
ndash THEN ndashALWAYS USE R
Infrastructure Analytics
Business Analytics
Infrastructure Analytics
Business Analytics
Infrastructure Analytics
Business Analytics
Infrastructure Analytics
Business Analytics
PhD student
Fault Tolerant Systems Research Group
Systems with availability of 9999
Data Analyst in Product Management
Secret Sauce Partners
Data-driven fashion industry
PhD student
Fault Tolerant Systems Research Group
Systems with availability of 9999
Data Analyst in Product Management
Secret Sauce Partners
Data-driven fashion industry
Continuous sampling Stream of events
User1
User2
Metric1
Metric2
time
time time
time
Continuous sampling Stream of events
User1
User2
Metric1
Metric2
time
time time
time
Time
Archived data csv
Read-filter-join
Archived data databases
Joins in SQL already
Scraping
Archived data csv
Read-filter-join
Archived data databases
Joins in SQL already
Scraping
datatableRPostgreSQL
Rvest
Long format
when where what value
Wide format
lots of additional information
User1 time
CPU
Memory
1 CPU
2 CPU
3 CPU
1 Mem
2 Mem
3 Mem
Long format
when where what value
Wide format
lots of additional information
User1 time
CPU
Memory
1 CPU
2 CPU
3 CPU
1 Mem
2 Mem
3 Mem
CPU Mem
1
2
3
Long format
when where what value
Wide format
lots of additional information
reshape2dplyr
User1 time
plyr
CPU
Memory
tidyr
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
Static pdf reports ExcelGoogle spreadsheets
Package development
rmarkdown
knitr
xlsxgooglesheets
devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr
shiny
devtools
purrcaret stringr
Confidence in data mungingDomain knowledgecuriosity Sense of achievement
+=
Infrastructure Analytics
Business Analytics
Infrastructure Analytics
Business Analytics
Infrastructure Analytics
Business Analytics
Infrastructure Analytics
Business Analytics
PhD student
Fault Tolerant Systems Research Group
Systems with availability of 9999
Data Analyst in Product Management
Secret Sauce Partners
Data-driven fashion industry
PhD student
Fault Tolerant Systems Research Group
Systems with availability of 9999
Data Analyst in Product Management
Secret Sauce Partners
Data-driven fashion industry
Continuous sampling Stream of events
User1
User2
Metric1
Metric2
time
time time
time
Continuous sampling Stream of events
User1
User2
Metric1
Metric2
time
time time
time
Time
Archived data csv
Read-filter-join
Archived data databases
Joins in SQL already
Scraping
Archived data csv
Read-filter-join
Archived data databases
Joins in SQL already
Scraping
datatableRPostgreSQL
Rvest
Long format
when where what value
Wide format
lots of additional information
User1 time
CPU
Memory
1 CPU
2 CPU
3 CPU
1 Mem
2 Mem
3 Mem
Long format
when where what value
Wide format
lots of additional information
User1 time
CPU
Memory
1 CPU
2 CPU
3 CPU
1 Mem
2 Mem
3 Mem
CPU Mem
1
2
3
Long format
when where what value
Wide format
lots of additional information
reshape2dplyr
User1 time
plyr
CPU
Memory
tidyr
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
Static pdf reports ExcelGoogle spreadsheets
Package development
rmarkdown
knitr
xlsxgooglesheets
devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr
shiny
devtools
purrcaret stringr
Confidence in data mungingDomain knowledgecuriosity Sense of achievement
+=
Infrastructure Analytics
Business Analytics
Infrastructure Analytics
Business Analytics
Infrastructure Analytics
Business Analytics
PhD student
Fault Tolerant Systems Research Group
Systems with availability of 9999
Data Analyst in Product Management
Secret Sauce Partners
Data-driven fashion industry
PhD student
Fault Tolerant Systems Research Group
Systems with availability of 9999
Data Analyst in Product Management
Secret Sauce Partners
Data-driven fashion industry
Continuous sampling Stream of events
User1
User2
Metric1
Metric2
time
time time
time
Continuous sampling Stream of events
User1
User2
Metric1
Metric2
time
time time
time
Time
Archived data csv
Read-filter-join
Archived data databases
Joins in SQL already
Scraping
Archived data csv
Read-filter-join
Archived data databases
Joins in SQL already
Scraping
datatableRPostgreSQL
Rvest
Long format
when where what value
Wide format
lots of additional information
User1 time
CPU
Memory
1 CPU
2 CPU
3 CPU
1 Mem
2 Mem
3 Mem
Long format
when where what value
Wide format
lots of additional information
User1 time
CPU
Memory
1 CPU
2 CPU
3 CPU
1 Mem
2 Mem
3 Mem
CPU Mem
1
2
3
Long format
when where what value
Wide format
lots of additional information
reshape2dplyr
User1 time
plyr
CPU
Memory
tidyr
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
Static pdf reports ExcelGoogle spreadsheets
Package development
rmarkdown
knitr
xlsxgooglesheets
devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr
shiny
devtools
purrcaret stringr
Confidence in data mungingDomain knowledgecuriosity Sense of achievement
+=
Infrastructure Analytics
Business Analytics
Infrastructure Analytics
Business Analytics
PhD student
Fault Tolerant Systems Research Group
Systems with availability of 9999
Data Analyst in Product Management
Secret Sauce Partners
Data-driven fashion industry
PhD student
Fault Tolerant Systems Research Group
Systems with availability of 9999
Data Analyst in Product Management
Secret Sauce Partners
Data-driven fashion industry
Continuous sampling Stream of events
User1
User2
Metric1
Metric2
time
time time
time
Continuous sampling Stream of events
User1
User2
Metric1
Metric2
time
time time
time
Time
Archived data csv
Read-filter-join
Archived data databases
Joins in SQL already
Scraping
Archived data csv
Read-filter-join
Archived data databases
Joins in SQL already
Scraping
datatableRPostgreSQL
Rvest
Long format
when where what value
Wide format
lots of additional information
User1 time
CPU
Memory
1 CPU
2 CPU
3 CPU
1 Mem
2 Mem
3 Mem
Long format
when where what value
Wide format
lots of additional information
User1 time
CPU
Memory
1 CPU
2 CPU
3 CPU
1 Mem
2 Mem
3 Mem
CPU Mem
1
2
3
Long format
when where what value
Wide format
lots of additional information
reshape2dplyr
User1 time
plyr
CPU
Memory
tidyr
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
Static pdf reports ExcelGoogle spreadsheets
Package development
rmarkdown
knitr
xlsxgooglesheets
devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr
shiny
devtools
purrcaret stringr
Confidence in data mungingDomain knowledgecuriosity Sense of achievement
+=
Infrastructure Analytics
Business Analytics
PhD student
Fault Tolerant Systems Research Group
Systems with availability of 9999
Data Analyst in Product Management
Secret Sauce Partners
Data-driven fashion industry
PhD student
Fault Tolerant Systems Research Group
Systems with availability of 9999
Data Analyst in Product Management
Secret Sauce Partners
Data-driven fashion industry
Continuous sampling Stream of events
User1
User2
Metric1
Metric2
time
time time
time
Continuous sampling Stream of events
User1
User2
Metric1
Metric2
time
time time
time
Time
Archived data csv
Read-filter-join
Archived data databases
Joins in SQL already
Scraping
Archived data csv
Read-filter-join
Archived data databases
Joins in SQL already
Scraping
datatableRPostgreSQL
Rvest
Long format
when where what value
Wide format
lots of additional information
User1 time
CPU
Memory
1 CPU
2 CPU
3 CPU
1 Mem
2 Mem
3 Mem
Long format
when where what value
Wide format
lots of additional information
User1 time
CPU
Memory
1 CPU
2 CPU
3 CPU
1 Mem
2 Mem
3 Mem
CPU Mem
1
2
3
Long format
when where what value
Wide format
lots of additional information
reshape2dplyr
User1 time
plyr
CPU
Memory
tidyr
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
Static pdf reports ExcelGoogle spreadsheets
Package development
rmarkdown
knitr
xlsxgooglesheets
devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr
shiny
devtools
purrcaret stringr
Confidence in data mungingDomain knowledgecuriosity Sense of achievement
+=
PhD student
Fault Tolerant Systems Research Group
Systems with availability of 9999
Data Analyst in Product Management
Secret Sauce Partners
Data-driven fashion industry
PhD student
Fault Tolerant Systems Research Group
Systems with availability of 9999
Data Analyst in Product Management
Secret Sauce Partners
Data-driven fashion industry
Continuous sampling Stream of events
User1
User2
Metric1
Metric2
time
time time
time
Continuous sampling Stream of events
User1
User2
Metric1
Metric2
time
time time
time
Time
Archived data csv
Read-filter-join
Archived data databases
Joins in SQL already
Scraping
Archived data csv
Read-filter-join
Archived data databases
Joins in SQL already
Scraping
datatableRPostgreSQL
Rvest
Long format
when where what value
Wide format
lots of additional information
User1 time
CPU
Memory
1 CPU
2 CPU
3 CPU
1 Mem
2 Mem
3 Mem
Long format
when where what value
Wide format
lots of additional information
User1 time
CPU
Memory
1 CPU
2 CPU
3 CPU
1 Mem
2 Mem
3 Mem
CPU Mem
1
2
3
Long format
when where what value
Wide format
lots of additional information
reshape2dplyr
User1 time
plyr
CPU
Memory
tidyr
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
Static pdf reports ExcelGoogle spreadsheets
Package development
rmarkdown
knitr
xlsxgooglesheets
devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr
shiny
devtools
purrcaret stringr
Confidence in data mungingDomain knowledgecuriosity Sense of achievement
+=
PhD student
Fault Tolerant Systems Research Group
Systems with availability of 9999
Data Analyst in Product Management
Secret Sauce Partners
Data-driven fashion industry
Continuous sampling Stream of events
User1
User2
Metric1
Metric2
time
time time
time
Continuous sampling Stream of events
User1
User2
Metric1
Metric2
time
time time
time
Time
Archived data csv
Read-filter-join
Archived data databases
Joins in SQL already
Scraping
Archived data csv
Read-filter-join
Archived data databases
Joins in SQL already
Scraping
datatableRPostgreSQL
Rvest
Long format
when where what value
Wide format
lots of additional information
User1 time
CPU
Memory
1 CPU
2 CPU
3 CPU
1 Mem
2 Mem
3 Mem
Long format
when where what value
Wide format
lots of additional information
User1 time
CPU
Memory
1 CPU
2 CPU
3 CPU
1 Mem
2 Mem
3 Mem
CPU Mem
1
2
3
Long format
when where what value
Wide format
lots of additional information
reshape2dplyr
User1 time
plyr
CPU
Memory
tidyr
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
Static pdf reports ExcelGoogle spreadsheets
Package development
rmarkdown
knitr
xlsxgooglesheets
devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr
shiny
devtools
purrcaret stringr
Confidence in data mungingDomain knowledgecuriosity Sense of achievement
+=
Continuous sampling Stream of events
User1
User2
Metric1
Metric2
time
time time
time
Continuous sampling Stream of events
User1
User2
Metric1
Metric2
time
time time
time
Time
Archived data csv
Read-filter-join
Archived data databases
Joins in SQL already
Scraping
Archived data csv
Read-filter-join
Archived data databases
Joins in SQL already
Scraping
datatableRPostgreSQL
Rvest
Long format
when where what value
Wide format
lots of additional information
User1 time
CPU
Memory
1 CPU
2 CPU
3 CPU
1 Mem
2 Mem
3 Mem
Long format
when where what value
Wide format
lots of additional information
User1 time
CPU
Memory
1 CPU
2 CPU
3 CPU
1 Mem
2 Mem
3 Mem
CPU Mem
1
2
3
Long format
when where what value
Wide format
lots of additional information
reshape2dplyr
User1 time
plyr
CPU
Memory
tidyr
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
Static pdf reports ExcelGoogle spreadsheets
Package development
rmarkdown
knitr
xlsxgooglesheets
devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr
shiny
devtools
purrcaret stringr
Confidence in data mungingDomain knowledgecuriosity Sense of achievement
+=
Continuous sampling Stream of events
User1
User2
Metric1
Metric2
time
time time
time
Time
Archived data csv
Read-filter-join
Archived data databases
Joins in SQL already
Scraping
Archived data csv
Read-filter-join
Archived data databases
Joins in SQL already
Scraping
datatableRPostgreSQL
Rvest
Long format
when where what value
Wide format
lots of additional information
User1 time
CPU
Memory
1 CPU
2 CPU
3 CPU
1 Mem
2 Mem
3 Mem
Long format
when where what value
Wide format
lots of additional information
User1 time
CPU
Memory
1 CPU
2 CPU
3 CPU
1 Mem
2 Mem
3 Mem
CPU Mem
1
2
3
Long format
when where what value
Wide format
lots of additional information
reshape2dplyr
User1 time
plyr
CPU
Memory
tidyr
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
Static pdf reports ExcelGoogle spreadsheets
Package development
rmarkdown
knitr
xlsxgooglesheets
devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr
shiny
devtools
purrcaret stringr
Confidence in data mungingDomain knowledgecuriosity Sense of achievement
+=
Archived data csv
Read-filter-join
Archived data databases
Joins in SQL already
Scraping
Archived data csv
Read-filter-join
Archived data databases
Joins in SQL already
Scraping
datatableRPostgreSQL
Rvest
Long format
when where what value
Wide format
lots of additional information
User1 time
CPU
Memory
1 CPU
2 CPU
3 CPU
1 Mem
2 Mem
3 Mem
Long format
when where what value
Wide format
lots of additional information
User1 time
CPU
Memory
1 CPU
2 CPU
3 CPU
1 Mem
2 Mem
3 Mem
CPU Mem
1
2
3
Long format
when where what value
Wide format
lots of additional information
reshape2dplyr
User1 time
plyr
CPU
Memory
tidyr
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
Static pdf reports ExcelGoogle spreadsheets
Package development
rmarkdown
knitr
xlsxgooglesheets
devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr
shiny
devtools
purrcaret stringr
Confidence in data mungingDomain knowledgecuriosity Sense of achievement
+=
Archived data csv
Read-filter-join
Archived data databases
Joins in SQL already
Scraping
datatableRPostgreSQL
Rvest
Long format
when where what value
Wide format
lots of additional information
User1 time
CPU
Memory
1 CPU
2 CPU
3 CPU
1 Mem
2 Mem
3 Mem
Long format
when where what value
Wide format
lots of additional information
User1 time
CPU
Memory
1 CPU
2 CPU
3 CPU
1 Mem
2 Mem
3 Mem
CPU Mem
1
2
3
Long format
when where what value
Wide format
lots of additional information
reshape2dplyr
User1 time
plyr
CPU
Memory
tidyr
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
Static pdf reports ExcelGoogle spreadsheets
Package development
rmarkdown
knitr
xlsxgooglesheets
devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr
shiny
devtools
purrcaret stringr
Confidence in data mungingDomain knowledgecuriosity Sense of achievement
+=
Long format
when where what value
Wide format
lots of additional information
User1 time
CPU
Memory
1 CPU
2 CPU
3 CPU
1 Mem
2 Mem
3 Mem
Long format
when where what value
Wide format
lots of additional information
User1 time
CPU
Memory
1 CPU
2 CPU
3 CPU
1 Mem
2 Mem
3 Mem
CPU Mem
1
2
3
Long format
when where what value
Wide format
lots of additional information
reshape2dplyr
User1 time
plyr
CPU
Memory
tidyr
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
Static pdf reports ExcelGoogle spreadsheets
Package development
rmarkdown
knitr
xlsxgooglesheets
devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr
shiny
devtools
purrcaret stringr
Confidence in data mungingDomain knowledgecuriosity Sense of achievement
+=
Long format
when where what value
Wide format
lots of additional information
User1 time
CPU
Memory
1 CPU
2 CPU
3 CPU
1 Mem
2 Mem
3 Mem
CPU Mem
1
2
3
Long format
when where what value
Wide format
lots of additional information
reshape2dplyr
User1 time
plyr
CPU
Memory
tidyr
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
Static pdf reports ExcelGoogle spreadsheets
Package development
rmarkdown
knitr
xlsxgooglesheets
devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr
shiny
devtools
purrcaret stringr
Confidence in data mungingDomain knowledgecuriosity Sense of achievement
+=
Long format
when where what value
Wide format
lots of additional information
reshape2dplyr
User1 time
plyr
CPU
Memory
tidyr
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
Static pdf reports ExcelGoogle spreadsheets
Package development
rmarkdown
knitr
xlsxgooglesheets
devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr
shiny
devtools
purrcaret stringr
Confidence in data mungingDomain knowledgecuriosity Sense of achievement
+=
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
Static pdf reports ExcelGoogle spreadsheets
Package development
rmarkdown
knitr
xlsxgooglesheets
devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr
shiny
devtools
purrcaret stringr
Confidence in data mungingDomain knowledgecuriosity Sense of achievement
+=
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
Static pdf reports ExcelGoogle spreadsheets
Package development
rmarkdown
knitr
xlsxgooglesheets
devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr
shiny
devtools
purrcaret stringr
Confidence in data mungingDomain knowledgecuriosity Sense of achievement
+=
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
Static pdf reports ExcelGoogle spreadsheets
Package development
rmarkdown
knitr
xlsxgooglesheets
devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr
shiny
devtools
purrcaret stringr
Confidence in data mungingDomain knowledgecuriosity Sense of achievement
+=
ggplot2
geom_point geom_line
transformation linear scale
ggplot2
geom_tile geom_density
transformations log
ggplot2
Static pdf reports ExcelGoogle spreadsheets
Package development
rmarkdown
knitr
xlsxgooglesheets
devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr
shiny
devtools
purrcaret stringr
Confidence in data mungingDomain knowledgecuriosity Sense of achievement
+=
Static pdf reports ExcelGoogle spreadsheets
Package development
rmarkdown
knitr
xlsxgooglesheets
devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr
shiny
devtools
purrcaret stringr
Confidence in data mungingDomain knowledgecuriosity Sense of achievement
+=
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr
shiny
devtools
purrcaret stringr
Confidence in data mungingDomain knowledgecuriosity Sense of achievement
+=
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr
shiny
devtools
purrcaret stringr
Confidence in data mungingDomain knowledgecuriosity Sense of achievement
+=
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr devtools
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr
shiny
devtools
purrcaret stringr
Confidence in data mungingDomain knowledgecuriosity Sense of achievement
+=
rmarkdownknitr
xlsxgooglesheetsggplot2
datatable
RPostgreSQL
Rvest
reshape2
dplyr
plyrtidyr
shiny
devtools
purrcaret stringr
Confidence in data mungingDomain knowledgecuriosity Sense of achievement
+=
Confidence in data mungingDomain knowledgecuriosity Sense of achievement
+=