exploring data
DESCRIPTION
Exploring Data. G oals of the day: Develop an understanding of NGSS & CCSS as it relates to data and data analysis Work with the data literacy framework Discuss ways to we can work together as a k-12 team to support students and each other to use data regularly and effectively. Problem: - PowerPoint PPT PresentationTRANSCRIPT
Exploring Data
Goals of the day:• Develop an understanding of NGSS & CCSS as it relates to data
and data analysis• Work with the data literacy framework• Discuss ways to we can work together as a k-12 team to
support students and each other to use data regularly and effectively.
Problem:
If you are a Park Ranger stationed at Old Faithful in Yellowstone National Park, how would you answer Park visitors when they arrive at Old Faithful and ask you "How long until the next eruption?" What could you tell visitors hoping to watch the geyser erupt to help them decide whether to wait or to go visit the restroom or get lunch first?
To see if the time between eruptions follows a predictable pattern, graduate students observed Old Faithful around the clock for two weeks in August, 1990, and measured the time in minutes between every eruption. The table presents the minutes they recorded for each 24-hour period during the two weeks.
How would you use these data to give Park visitors a useful answer to their question?
Let’s remind outselves of the Principles of the Framework (and NGSS)
• Children are born investigators• The focus is on core ideas and practices
and crosscutting concepts.• Understanding develops over time• Science and engineering require both
knowledge and practice.• Ideas have to connect to students’
interests and experiences• We must promote equity
The Acadia Learning Data Literacy Project
A framework for using data in science classroomsDr. Molly Schauffler, University of Maine Climate Change Institute, Dept. of Earth and Climate Sciences, and Center for Research in STEM Education
Hannah Webber, Education Program Manager, Schoodic Institute
Dr. Sarah Nelson, University of Maine Sen. George Mitchell Center for Environmental &Watershed Research and Dept. of Plant, Soil, and Environmental Sciences
Ryan Weatherbee, Research Associate, University of Maine School of Marine Sciences; Graduate student, Center for Research in STEM Education
Bill Zoellick, Director of Education Research, Schoodic Institute
A varied team of grade 6- 12 classroom teacher leaders
Why Data Literacy is Hard
The word
“Data”is plural
He is 180 cm tall
(that is a “fact”!)
How tall is this boy?
180 cm
152 cm146 cm 140 cm
How tall is this group of children?Now we have “data”
(and a more complicated answer)
Which boy is taller?
… a pretty easy question to answer
Which group is taller?
… a more complicated question to answer
Talking about a group usuallymeans dealing with variability
1. Visualize variability:
How data are distributed (dot plot, box and whisker
plot, histogram)
2. Describe variability
in the data using statistical language
(distribution range, shape, measure of center)
3. Display, describe, and interpret
the data in the context of a question
4. Explain how the evidence
supports the interpretation, accounting for variability in
the data
1. Visualize variability:
How data are distributed (dot plot, box and whisker
plot, histogram)
State Average Monthly Consumption (kWh)
Connecticut 731Maine 531Massachusetts 627New Hampshire 615Rhode Island 597Vermont 565New Jersey 691New York 603Pennsylvania 837Illinois 767Indiana 997Michigan 676Ohio 895Wisconsin 703Iowa 873Kansas 945Minnesota 793Missouri 1,060Nebraska 1,000North Dakota 1,091South Dakota 980Delaware 942District of Columbia 721Florida 1,081
(Each dot represents a state)
• The powerful thing about frequency graphs is that they provide a way to show all separate facts in a group of data in a single picture.
• Graphs are a visual language for telling stories about data.
• Success in making sense of data depends on learning to use the “picture language” of graphs.
Visualize Variability
Visualize Data
"How long until the next eruption?"
2. Describe variability
in the data using statistical language
(distribution range, shape, measure of center)
Describe Variability
"Hurricanes in 2012 had wind speeds between 80 and 115 knots. There were 10 hurricanes
in 2012, and their average wind speed was 97 knots. Within the range they were spread out,
not very clumped."
How do these words describe and what do they say about the data?
K-2Compare, more, less, how many, total,
3-5Difference, middle, spread out, clumped
6-8 Mean, median, mode, center, range, spread, shape, variability, pattern, sample, random, outlier, frequency, probability, bivariate, scattered, clustering, positive or negative correlation, linear relationship,nonlinear relationship, slope, closeness to line, categorical, numerical, shifted,biased, even, slightly
Describing Words
9-12Distribution related to center (mean, median), spread (interquartile range, standard deviation), normal, skewed, correlation coefficient, correlation vs. causation, bi-modal
In the first years of this project …we learned that although many students had learned some of the elements of this language …they did not know how to understand it or “speak” it.
Example: Compare two groups
Do rocky streams have more dragonflies in them than mucky streams do?
Example: Show and discuss relationship between two variables
Is mercury concentration correlated with fish weight?
“Their hypothesis is wrong because all the points don’t fall on the line”. Ninth grader
Example: Interpret a scatter plot
Is there a correlation between fish weight and mercury concentration in fish tissue?
The Data Literacy Project has found that some teachers are not fluent in “graph
language.”• Have not really thought about variability in data• Don’t tend to critique students’ graphs but focus on
mechanics• Don’t think about linking the graph to a question• Do not know how to use graphing & spreadsheet
software• Are eager to improve their own understanding & skill
Many teachers
3. Display, describe, and interpret
the data in the context of a question
Scientists and engineers make observations and collect data with a question in mind.
They collect data from more than one example – from as many examples as is feasible. When measuring many examples, scientists and engineers expect that the data they collect will vary.
When answering their question, they have to come to understand why the data vary.
Three important ideas:
Scientists make observations and measurements (collect data) with a question in mind.
a. Which chemical is most lethal to fish? b. Is the number of registered cell phones correlated with
gross national income?c. Has monthly household electricity use in our town
decreased over the last year?d. ?e. ?
There are THREE kinds of questions that students are most likely to encounter in
high school and college science courses!
Compare two or more groups in a single variable
Correlation: See how strongly two variables are correlated with each other
Time series: See how a measurement changes through time
Variability is often forgotten!
1. Which car manufacturer makes the most fuel-efficient fleet of vehicles, Chevrolet or Toyota?
2. Has Penobscot Bay warmed over the last 10 years?
3. Which beam shape supports the most weight?
4. How deep are the lakes in Maine?
5. How tall is this tree?
6. Is there a relationship between wind speed and air pressure?
a. Compares groupsb. Correlationc. Time seriesd. Variabilitye. Not a statistical question
What kind of question is each of the following?
1. Which car manufacturer makes the most fuel-efficient fleet of vehicles, Chevrolet or Toyota? Compares two groups of cars in mpg rating.
2. Has Penobscot Bay warmed over the last 10 years? Time series, (how water temperature (e.g. mean annual or mean monthly temp) changed through time).
3. Which beam shape supports the most weight? Compares groups. An engineer would test several beams of each type. Beam shape is categorical data (e.g. I-beam, or flat beam, or a round beam), and is not continuous numeric data (e.g. 1,2,3, etc.), so it can’t be about a correlation between two numeric factors.
4. How deep are the lakes in Maine? variability
5. How tall is this tree? Not a statistical question
6. Is there a relationship between wind speed and air pressure? Correlation; both wind speed and air pressure are numeric factors.
Answers to: What kind of question is each of the following?
1. How did the heights of the bean plants change over the last two weeks?
2. What was the score of the Red Sox game?
3. Is mercury content in fish related to fish weight?
4. Do cats and dogs have the same resting pulse rate?
5. Which region has the most severe earthquakes, Japan or Alaska?
a. Compares groupsb. Correlationc. Time seriesd. Variabilitye. Not a statistical question
For PRACTICE!What kind of question is each of the following?
Write one question of each type that is about something that interests you.
Variability: how one variable varies
Compare two or more groups in a single variable
Correlation: See how strongly two variables are correlated with each other
Time series: See how a measurement changes through time
(What measurements would you need to investigate each of your questions?)
What kind of graph should I make?
The best kind of graph to use depends on what kind of question you are asking.
1. Variability within a group? frequency plot
2. Compare groups? 2 frequency plots or 1 bar graph
3. Correlation between two variables? scatter plot
4. Change through time? line graph
1. Variability within a groupFREQUENCY PLOTS (dot plot, histogram, box & whisker)
How much rainfall does Bangor get during the month of July?
10.000.501.001.502.002.503.003.504.00
Bangor Mean July precipitation (1953-2009)
JULY
inch
es
Data source: http://www.ncdc.noaa.gov/oa/climate/stationlocator.html
Bar graph
Dot plot
FREQUENCY PLOTSDot plot, histogram, box & whisker plot
How much rainfall does Bangor get during the month of July?
(NOTE: Y-axis = counts)
2. Compare groups in a single variableTwo FREQUENCY PLOTS or a BAR GRAPH
During which month does Bangor receive more precipitation, July or September?
3. Are two variables correlated?SCATTER PLOTS
Is the mileage (miles per gallon) a car gets related to its weight?
Data source: US Environmental Protection Agency, epa.gov
4. Change through timeLINE GRAPHS
How has the rate of Chlamydia infections in Maine changed through time?
1980 1985 1990 1995 2000 2005 2010 20150
50
100
150
200
250
300
350
Rates of Chlamydia in Maine (1984-2009)
Year
case
s pe
r 100
,000
pop
Data source: US Center for Disease Control from http://wonder.cdc.gov/wonder/help/stdm.html
5. How something is divided into partsPIE CHART or STACKED BAR GRAPH
How much of Maine’s electricity is generated by renewable fuels?
Sources of fuel for electricity generation in Maine (2010)
Data source: US Energy Information Agency (eia.gov)
Graph Choice Chart: Link choice of graph type to a question
Pedagogy• Frame a question prior to graphing• Chose graph type based on the question• Begin with Hand drawn graphs • The task does not end with making the graph. Tell
the story of the graph Students need language for describing graphs and variability and for making argument based on evidence
• Students and teachers puzzle over data and graphs routinely (> once a week)
• Check students’ skills frequently• Really scrutinize students’ graphs to understand their
thinking.• Talk about it.
“They always just make a bar graph”
“My students would not be able to put these numbers on the axis.” (e.g. 2.5, 2.05)
“Throw out the outlier!”
“They won’t like it if the point doesn’t fall on the grid line”
Can’t distinguish between categorical & numerical data
Students think of the graph as the endpoint (rather than as a tool for telling a story about how things relate)
Trouble spots (Teacher comments)
4. Explain how the evidence
supports the interpretation, accounting for variability in
the data
Grade 9 Example
Data Story
Another Grade 9 Example
I’ll pause for a moment so you can let this information sink in.”
Assessing Student’s Graphing Skills
The Rubric
Let’s Practice!
What is a question you can ask from this data?
What kind of question did you ask? (compare, correlate, time series, etc)
What kind of graph will help you answer your question
Describe (evidence) and Interpret (claim & reasoning) your graph
• Conducting experiments that collect data• NOAA• Canadian National Forest Data Base• National Phenology Network• Neracoos http://neracoos.org/realtime_map• Giovanni• US Energy Information Agency
Where to get data?
The Date Literacy Website
participatoryscience.org
Debrief with grade span teams
K-2
3-5
6-89-12
Report Out
Technology’s role to support student learning with data?
BIG Data
Next Steps?