![Page 1: Skillshare - Let's talk about R in Data Journalism](https://reader038.vdocuments.us/reader038/viewer/2022100508/55d39e0fbb61eb05278b4836/html5/thumbnails/1.jpg)
Let’s Talk About PRESENTED BY
DAVID SELASSIE OPOKU
@sdopoku
11 August 2015
An introduction for Data-driven Journalism
![Page 2: Skillshare - Let's talk about R in Data Journalism](https://reader038.vdocuments.us/reader038/viewer/2022100508/55d39e0fbb61eb05278b4836/html5/thumbnails/2.jpg)
Outline1. TaRget audience
2. About R: What is R?
3. Example Use Case & Best PRactices
4. Setup & RStudio
5. Resources
![Page 3: Skillshare - Let's talk about R in Data Journalism](https://reader038.vdocuments.us/reader038/viewer/2022100508/55d39e0fbb61eb05278b4836/html5/thumbnails/3.jpg)
Target Audience
![Page 4: Skillshare - Let's talk about R in Data Journalism](https://reader038.vdocuments.us/reader038/viewer/2022100508/55d39e0fbb61eb05278b4836/html5/thumbnails/4.jpg)
R is a great tool for anyone who works with data
● Data journalists
● School of Data fellows
● Open Data enthusiasts
● People curious about or new to R
● Statisticians
![Page 5: Skillshare - Let's talk about R in Data Journalism](https://reader038.vdocuments.us/reader038/viewer/2022100508/55d39e0fbb61eb05278b4836/html5/thumbnails/5.jpg)
About
![Page 6: Skillshare - Let's talk about R in Data Journalism](https://reader038.vdocuments.us/reader038/viewer/2022100508/55d39e0fbb61eb05278b4836/html5/thumbnails/6.jpg)
What is R?1. Open source
2. Statistical computing & graphics programming language
and environment
3. More than just statistics and graphics
4. Wealth of functionality i.e packages
5. RStudio: a powerful integrated development
environment (IDE)
![Page 7: Skillshare - Let's talk about R in Data Journalism](https://reader038.vdocuments.us/reader038/viewer/2022100508/55d39e0fbb61eb05278b4836/html5/thumbnails/7.jpg)
R vs. Spreadsheet-like software
1. More powerful data manipulation capabilities
2. It reads any type of data
3. Easier automation & faster computation
4. It supports larger data sets
5. Advanced Statistics capabilities
6. State-of-the-art graphics with packages such as ggplot2
7. It runs on many platforms
8. Anyone can contribute packages to improve its functionality
See: 14 Reasons Why R is Better Than Excel
![Page 8: Skillshare - Let's talk about R in Data Journalism](https://reader038.vdocuments.us/reader038/viewer/2022100508/55d39e0fbb61eb05278b4836/html5/thumbnails/8.jpg)
Setup R & RStudio
![Page 9: Skillshare - Let's talk about R in Data Journalism](https://reader038.vdocuments.us/reader038/viewer/2022100508/55d39e0fbb61eb05278b4836/html5/thumbnails/9.jpg)
Live Demo of R & RStudio Installation; RStudio
Environmnent
![Page 10: Skillshare - Let's talk about R in Data Journalism](https://reader038.vdocuments.us/reader038/viewer/2022100508/55d39e0fbb61eb05278b4836/html5/thumbnails/10.jpg)
R in the Data pipeline
![Page 11: Skillshare - Let's talk about R in Data Journalism](https://reader038.vdocuments.us/reader038/viewer/2022100508/55d39e0fbb61eb05278b4836/html5/thumbnails/11.jpg)
Popular R Packages In The Data pipeline ❖ Find & Obtain
➢ quandl (finance & economics) | foreign (SAS, SPSS) | RODBC,
RMySQL, RPostgresSQL, RSQLite (Databases) | XLConnect, xlsx (Excel)➢ Maps: sp, maptools, maps, ggmap➢ Web: XML, jsonlite, httr
❖ Clean & Verify➢ dplyr, tidyr (data manipulation) | stringr (regular expressions &
strings) | lubridate (dates and times)
❖ Analyze➢ car, randomForest, glmnet, caret,
❖ Visualise ➢ ggplot2, ggvis, rgl, leaflet, htmlwidgets, shiny, googleVis
❖ Report ➢ shiny, R Markdown, xtable, knitr
![Page 12: Skillshare - Let's talk about R in Data Journalism](https://reader038.vdocuments.us/reader038/viewer/2022100508/55d39e0fbb61eb05278b4836/html5/thumbnails/12.jpg)
Example Use Case
![Page 13: Skillshare - Let's talk about R in Data Journalism](https://reader038.vdocuments.us/reader038/viewer/2022100508/55d39e0fbb61eb05278b4836/html5/thumbnails/13.jpg)
![Page 14: Skillshare - Let's talk about R in Data Journalism](https://reader038.vdocuments.us/reader038/viewer/2022100508/55d39e0fbb61eb05278b4836/html5/thumbnails/14.jpg)
Resources
![Page 15: Skillshare - Let's talk about R in Data Journalism](https://reader038.vdocuments.us/reader038/viewer/2022100508/55d39e0fbb61eb05278b4836/html5/thumbnails/15.jpg)
Resources - Individuals & Organisations 1. R Project
2. RStudio
3. Datacamp
4. Hadley Wickham - @hadleywickham
5. R-bloggers
6. Nathan Yau’s Flowing Data Tutorials
![Page 16: Skillshare - Let's talk about R in Data Journalism](https://reader038.vdocuments.us/reader038/viewer/2022100508/55d39e0fbb61eb05278b4836/html5/thumbnails/16.jpg)
Resources - Tutorials, Articles & Books
Article: Data Analysts Captivated by R’s Power
Tutorials & Webinars
1. http://www.r-tutor.com/r-introduction
2. Code School’s Try R
3. 5 data visualizations in 5 minutes: each in 5 lines or less of R
4. RStudio Webinars
Cheatsheets: https://www.rstudio.com/resources/cheatsheets/
Books
1. R Cookbook (O'Reilly Cookbooks) by Paul Teetor
2. R Graphics Cookbook by Winston Chang
3. RStudio List of Training Books
![Page 17: Skillshare - Let's talk about R in Data Journalism](https://reader038.vdocuments.us/reader038/viewer/2022100508/55d39e0fbb61eb05278b4836/html5/thumbnails/17.jpg)
References 1. What is R?
2. Beginner's guide to R: Introduction
3. How SAS, R & SPSS compare [infographic]
4. Comparison of R, Matlab, SciPy, Excel, SAS, SPSS, Stata
5. Garrett Grolemund’s Quick list of useful R packages
6. 14 reasons why R is better than Excel
7. An overview of RStudio Features