lecture 14: visualization - github pages

Post on 23-Oct-2021

3 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Harvard IACSCS109APavlos Protopapas, Kevin Rader, and Chris Tanner

A vehicle for EDA and communication

Lecture 14: Visualization

As the matplotlib thickens …

CS109A, PROTOPAPAS, RADER, TANNER 3

• Homework 3 is due Oct 7 (Wed) @ 11:59pm

• Didn’t submit Milestone 1? Want to change project selection?E-mail the helpline at cs109a2020@gmail.com

• After lecture, please update your Zoom to the latest version (5.3.1).

ANNOUNCEMENTS

• Understand why visualization/plotting is important

• Learn aspects that tend to make visualizes effective and ineffective

• Feel comfortable designing plots that best convey your message

• Gain experience in producing plots with Python

4

Learning Objectives

be more cognizant of broader design choices(e.g., typography, s p a c i n g , colors) Extra Goal

5

Agenda

CommunicationMotivation

Key Considerations

EDA Refresher

Effective VisualizationGraphical Integrity

Scope

Displays

Sensible Design

6

Agenda

CommunicationMotivation

Key Considerations

EDA Refresher

Effective VisualizationGraphical Integrity

Scope

Displays

Sensible Design

Assume you know a given dataset is credible,

complete with the info you want, and has no missing values.

7

Why do further EDA?

Purposes of EDA:• Maximize insight into a dataset

• Uncover underlying structure

• Detect outliers

• Test underlying assumptions

• Develop parsimonious models

8

CS109A, PROTOPAPAS, RADER, TANNER 9

EDA Refresher: English Premier League

name club age position market value

Alexis Sanchez

Mesut Ozil

Petr Cech

Theo Walcott

Laurent Koscielny

Arsenal

Arsenal

Arsenal

Arsenal

Arsenal

28

28

35

28

31

LW

AM

GK

RW

CB

65

50

7

20

22

from www.transfermarkt.us

CS109A, PROTOPAPAS, RADER, TANNER 10

EDA Refresher: English Premier League

CS109A, PROTOPAPAS, RADER, TANNER 11

EDA Refresher: English Premier League

CS109A, PROTOPAPAS, RADER, TANNER 12

EDA Refresher: English Premier League

Are the outliers legit?

CS109A, PROTOPAPAS, RADER, TANNER 13

EDA Refresher: English Premier League

CS109A, PROTOPAPAS, RADER, TANNER 14

EDA Refresher: English Premier League

CS109A, PROTOPAPAS, RADER, TANNER 15

EDA Refresher: English Premier League

16

Agenda

CommunicationMotivation

Key Considerations

EDA Refresher

Effective VisualizationGraphical Integrity

Scope

Displays

Sensible Design

DISCLAIMER. Some of these examples involve political data. In no way should this be taken as a signal of my support or endorsement in any beliefs; the point is merely to convey good and bad choices when it comes to effective visualization.

17

Agenda

CommunicationMotivation

Key Considerations

EDA Refresher

Effective VisualizationGraphical Integrity

Scope

Displays

Sensible Design

DISCLAIMER. Some of these examples involve political data. In no way should this be taken as a signal of my support or endorsement in any beliefs; the point is merely to convey good and bad choices when it comes to effective visualization.

18

Agenda

CommunicationMotivation

Key Considerations

EDA Refresher

Effective VisualizationGraphical Integrity

Scope

Displays

Sensible Design

DISCLAIMER. Some of these examples involve political data. In no way should this be taken as a signal of my support or endorsement in any beliefs; the point is merely to convey good and bad choices when it comes to effective visualization.

top related