advanced methods and analysis for the learning and social sciences

68
Advanced Methods and Analysis for the Learning and Social Sciences PSY505 Spring term, 2012 April 4, 2012

Upload: gefen

Post on 23-Feb-2016

38 views

Category:

Documents


0 download

DESCRIPTION

Advanced Methods and Analysis for the Learning and Social Sciences. PSY505 Spring term, 2012 April 4, 2012. Today’s Class. Discovery with Models. Discovery with Models. Discovery with Models: The Big Idea. A model of a phenomenon is developed Via Prediction Clustering - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Advanced Methods and Analysis for the Learning and Social Sciences

Advanced Methods and Analysis for the Learning and Social Sciences

PSY505Spring term, 2012

April 4, 2012

Page 2: Advanced Methods and Analysis for the Learning and Social Sciences

Today’s Class

• Discovery with Models

Page 3: Advanced Methods and Analysis for the Learning and Social Sciences

Discovery with Models

Page 4: Advanced Methods and Analysis for the Learning and Social Sciences

Discovery with Models: The Big Idea

• A model of a phenomenon is developed • Via – Prediction– Clustering– Knowledge Engineering

• This model is then used as a component in another analysis

Page 5: Advanced Methods and Analysis for the Learning and Social Sciences

Can be used in Prediction

• The created model’s predictions are used as predictor variables in predicting a new variable

• E.g. Classification, Regression

Page 6: Advanced Methods and Analysis for the Learning and Social Sciences

Can be used in Relationship Mining

• The relationships between the created model’s predictions and additional variables are studied

• This can enable a researcher to study the relationship between a complex latent construct and a wide variety of observable constructs

• E.g. Correlation mining, Association Rule Mining

Page 7: Advanced Methods and Analysis for the Learning and Social Sciences

“Increasingly Important…”

• Baker & Yacef (2009) argued that Discovery with Models is a key emerging area of EDM

• I think that’s true, although it has been a bit slower to emerge than I might have expected

Page 8: Advanced Methods and Analysis for the Learning and Social Sciences

Example

• From the reading

Page 9: Advanced Methods and Analysis for the Learning and Social Sciences

Baker & Gowda (2010)

• Models of gaming the system, off-task behavior, and carelessness developed

• Applied to entire year of data from urban, rural, and suburban schools

• Differences in behaviors between different schools studied

Page 10: Advanced Methods and Analysis for the Learning and Social Sciences

10

Students

• Used Cognitive Tutor Geometry during school year 2005-2006

• Approximately 2 days a week• Tutor chosen by teachers– e.g. not all classrooms used software

• However, software used by representative population in each school– e.g. not just gifted or special-needs students

• 3 schools in Southwestern Pennsylvania

Page 11: Advanced Methods and Analysis for the Learning and Social Sciences

11

Urban school

Suburban school

Rural school

% African-American 100% <1% 2%

% White 0% 98% 97%

% Hispanic 0% <1% <1%

Free or Reduced-Price Lunch

99% 4% Not Reported

% Proficient on state math exam

20% 77% Not Reported

Median household income in community

$26,621 $60,307 $32,206

% Children under poverty line

30.8% 2.5% 18.4%

Page 12: Advanced Methods and Analysis for the Learning and Social Sciences

12

Urban school

Suburban school

Rural school

Number of students using software

34 88 435

Average time using software

51 hrs. 35 hrs. 9 hrs.

Total number of actions within software

245, 092 244, 396 484,875

Page 13: Advanced Methods and Analysis for the Learning and Social Sciences

13

Amount of usage

• Not consistent between schools• Clear potential confound• Hard to resolve

• Leaving confound in place is a natural bias (matches how software genuinely used in each setting)

• Setting arbitrary cut-offs on minimum time used and eliminating students, or only looking at the first N minutes of tutor usage, just changes what the selection bias is

Page 14: Advanced Methods and Analysis for the Learning and Social Sciences

14

To address these confounds

• Data was analyzed in two ways– Using all data (more ecologically valid)– Using a time-slice consisting of the 3rd-8th hours

(minutes 120-480) of each student’s usage of the software (more controlled)• This time-slice will not be as representative of the

usage in each school, but avoids the confound of total usage time• 3rd-8th hours were selected, because the initial 2 hours

likely represent interface learning, which are not representative of overall tutor use

Page 15: Advanced Methods and Analysis for the Learning and Social Sciences

Off-task behavior assessed using

• Baker’s (2007) Latent Response Model machine-learned detector of off-task behavior– Trained using data from students using a Cognitive Tutor

for Middle School Mathematics in several suburban schools• Age range was moderately older in this study than in original

training data• But off-task behavior is similar in nature within the two

populations: ceasing to use the software for a significant period of time without seeking help from teacher or software

– Validated to generalize to new students and across Cognitive Tutor lessons

Page 16: Advanced Methods and Analysis for the Learning and Social Sciences

Gaming assessed using

• Baker & de Carvalho’s (2008) Latent Response Model machine-learned detector of gaming the system– Trained using data from age-similar population of

students using Algebra Cognitive Tutor in suburban schools

– Approach validated to generalize between students and between Cognitive Tutor lessons (Baker, Corbett, Roll, & Koedinger, 2008)

– Predicts robust learning in college Genetics (Baker, Gowda, & Corbett, 2011)

Page 17: Advanced Methods and Analysis for the Learning and Social Sciences

Carelessness assessed using

• Baker, Cobett, & Aleven’s (2008) machine-learned detector of contextual slip– Trained and cross-validated using data from year-

long use of same Geometry Cognitive Tutor in suburban schools

– Detectors transfer from USA to Philippines and vice-versa (San Pedro et al., 2011)

Page 18: Advanced Methods and Analysis for the Learning and Social Sciences

18

Application of Models

• Each model was applied to the full data set and time-slice data set from each school

Page 19: Advanced Methods and Analysis for the Learning and Social Sciences

% Off-Task

Urban school

Suburban school

Rural school

Full Data Set 34.1% (18.0%)

15.4% (20.7%)

20.4% (13.3%)

Hours 3-8 25.7% (22.8%)

16.5% (27.5%)

21.0% (16.5%)

Page 20: Advanced Methods and Analysis for the Learning and Social Sciences

% Off-Task

Urban school

Suburban school

Rural school

Full Data Set 34.1% (18.0%)

15.4% (20.7%)

20.4% (13.3%)

Hours 3-8 25.7% (22.8%)

16.5% (27.5%)

21.0% (16.5%)

All differences in color statistically significant at p<0.05, using Tukey’s HSD

Page 21: Advanced Methods and Analysis for the Learning and Social Sciences

% Off-Task

Urban school

Suburban school

Rural school

Full Data Set 34.1% (18.0%)

15.4% (20.7%)

20.4% (13.3%)

Hours 3-8 25.7% (22.8%)

16.5% (27.5%)

21.0% (16.5%)

All differences in color statistically significant at p<0.05, using Tukey’s HSD

Page 22: Advanced Methods and Analysis for the Learning and Social Sciences

% Off-Task

Urban school

Suburban school

Rural school

Full Data Set 34.1% (18.0%)

15.4% (20.7%)

20.4% (13.3%)

Hours 3-8 25.7% (22.8%)

16.5% (27.5%)

21.0% (16.5%)

All purple columns statistically significantly different at p<0.05

Page 23: Advanced Methods and Analysis for the Learning and Social Sciences

Overall Model Goodness

• Full data set: School explains 6.4% of variance in time off-task

• Hours 3-8: School explains 1.2% of variance in time off-task

Page 24: Advanced Methods and Analysis for the Learning and Social Sciences

% Gaming the System

Urban school

Suburban school

Rural school

Full Data Set 7.4% (2.2%)

6.9% (3.1%)

6.6% (1.7%)

Hours 3-8 4.7% (1.9%)

5.9% (7.3%)

6.4% (2.2%)

Page 25: Advanced Methods and Analysis for the Learning and Social Sciences

% Gaming the System

Urban school

Suburban school

Rural school

Full Data Set 7.4% (2.2%)

6.9% (3.1%)

6.6% (1.7%)

Hours 3-8 4.7% (1.9%)

5.9% (7.3%)

6.4% (2.2%)

All differences in color statistically significant at p<0.05, using Tukey’s HSD

Page 26: Advanced Methods and Analysis for the Learning and Social Sciences

% Gaming the System

Urban school

Suburban school

Rural school

Full Data Set 7.4% (2.2%)

6.9% (3.1%)

6.6% (1.7%)

Hours 3-8 4.7% (1.9%)

5.9% (7.3%)

6.4% (2.2%)

Note that the relationship in gaming flips between data sets (but the change is all in the urban school)

Page 27: Advanced Methods and Analysis for the Learning and Social Sciences

% Gaming the System

Urban school

Suburban school

Rural school

Full Data Set 7.4% (2.2%)

6.9% (3.1%)

6.6% (1.7%)

Hours 3-8 4.7% (1.9%)

5.9% (7.3%)

6.4% (2.2%)

All purple columns statistically significantly different at p<0.05

Page 28: Advanced Methods and Analysis for the Learning and Social Sciences

Overall Model Goodness

• Full data set: School explains 1.1% of variance in gaming

• Hours 3-8: School explains 1.7% of variance in gaming

• Student, tutor lesson, and problem all found to predict significantly larger proportion of variance(Baker, 2007; Baker et al, 2009; Muldner et al, 2010)

Page 29: Advanced Methods and Analysis for the Learning and Social Sciences

Slip Probability

Urban school

Suburban school

Rural school

Full Data Set 0.50 (0.07)

0.32 (0.11)

0.27 (0.13)

Hours 3-8 0.53 (0.08)

0.44 (0.17)

0.33 (0.18)

Page 30: Advanced Methods and Analysis for the Learning and Social Sciences

Slip Probability

Urban school

Suburban school

Rural school

Full Data Set 0.50 (0.07)

0.32 (0.11)

0.27 (0.13)

Hours 3-8 0.53 (0.08)

0.44 (0.17)

0.33 (0.18)

All differences in color statistically significant at p<0.05, using Tukey’s HSD

Page 31: Advanced Methods and Analysis for the Learning and Social Sciences

Slip Probability

Urban school

Suburban school

Rural school

Full Data Set 0.50 (0.07)

0.32 (0.11)

0.27 (0.13)

Hours 3-8 0.53 (0.08)

0.44 (0.17)

0.33 (0.18)

All differences in color statistically significant at p<0.05, using Tukey’s HSD

Page 32: Advanced Methods and Analysis for the Learning and Social Sciences

Slip Probability

Urban school

Suburban school

Rural school

Full Data Set 0.50 (0.07)

0.32 (0.11)

0.27 (0.13)

Hours 3-8 0.53 (0.08)

0.44 (0.17)

0.33 (0.18)

All purple columns statistically significantly different at p<0.05; slip probabilities systematically higher in hours 3-8

Page 33: Advanced Methods and Analysis for the Learning and Social Sciences

Overall Model Goodness

• Full data set: School explains 16.5% of variance in slip probability

• Hours 3-8: School explains 11.1% of variance in slip probability

Page 34: Advanced Methods and Analysis for the Learning and Social Sciences

Time-slices agreed

• More off-task behavior and carelessness in the urban school than in the other two schools

Page 35: Advanced Methods and Analysis for the Learning and Social Sciences

Comments

• Significant poverty in both urban and rural schools

• Some factor other than simply socio-economic status explains higher frequency of off-task behavior and carelessness in urban school

Page 36: Advanced Methods and Analysis for the Learning and Social Sciences

Potential hypotheses

• Differences in teacher expertise (urban teachers in USA have higher turnover and are usually less experienced)

• Differences in schools’ facilities, equipment (e.g. computers), and physical environment

• Differences in students’ cultural backgrounds

• Your hypothesis welcome!

Page 37: Advanced Methods and Analysis for the Learning and Social Sciences

Time-slices disagreed

• On how much gaming the system occurred in the urban school

• Less gaming earlier in the year,more gaming later in the year

• Greater novelty effect in urban students?

Page 38: Advanced Methods and Analysis for the Learning and Social Sciences

Another interesting finding

• Carelessness dropped significantly more over the course of the school year in the suburban and rural schools than in the urban school

• Some factor caused the suburban and rural students to become more diligent during the school year– This didn’t happen in the urban school– Not clear why not

Page 39: Advanced Methods and Analysis for the Learning and Social Sciences

Obvious limitations and flaws?

• In the application of discovery with models

Page 40: Advanced Methods and Analysis for the Learning and Social Sciences

Obvious limitations and flaws:How fatal are they?

• One school per category• Different usage pattern in different schools• Detectors only formally validated for suburban

students• Your ideas?

Page 41: Advanced Methods and Analysis for the Learning and Social Sciences

Discovery with Models:Here There Be Monsters

Page 42: Advanced Methods and Analysis for the Learning and Social Sciences

Discovery with Models: Here There Be Monsters

“Rar.”

Page 43: Advanced Methods and Analysis for the Learning and Social Sciences

Discovery with Models: Here There Be Monsters

• It’s really easy to do something badly wrong, for some types of “Discovery with Models” analyses

• No warnings when you do

Page 44: Advanced Methods and Analysis for the Learning and Social Sciences

Think Validity

• Validity is always important for model creation

• Doubly-important for discovery with models– Discovery with Models almost always involves

applying model to new data– How confident are you that your model will apply

to the new data?

Page 45: Advanced Methods and Analysis for the Learning and Social Sciences

Challenges to Valid Application

• What are some challenges to valid application of a model within a discovery with models analysis?

Page 46: Advanced Methods and Analysis for the Learning and Social Sciences

Challenges to Valid Application

• Is model valid for population?• Is model valid for all tutor lessons? (or other

differences)• Is model valid for setting of use? (classroom

versus homework?)• Is the model valid in the first place? (especially

important for knowledge engineered models)

Page 47: Advanced Methods and Analysis for the Learning and Social Sciences

That said• Part of the point of discovery with models is to conduct

analyses that would simply be impossible without the model

• Many types of measures are used outside the context where they were tested– Questionnaires in particular

• The key is to find a balance between paralysis and validity– And be honest about what you did so that others can replicate

and disagree

Page 48: Advanced Methods and Analysis for the Learning and Social Sciences

Example

• Different types of validity concerns

Page 49: Advanced Methods and Analysis for the Learning and Social Sciences

The WPI-ASU squabble

Page 50: Advanced Methods and Analysis for the Learning and Social Sciences

The WPI-ASU squabble

• Baker (2007) used a machine-learned detector of gaming the system to determine whether gaming the system is better predicted by student or tutor lesson

• The detector was validated for a different population than the research population (middle school versus high schools)

• The detector was validated for new lessons in 4 cases, but not for full set

Page 51: Advanced Methods and Analysis for the Learning and Social Sciences

The WPI-ASU squabble

• Muldner et al. (2011) used a knowledge-engineered detector of gaming the system to determine whether gaming the system is better predicted by student or tutor lesson

• The detector was not validated, but trusted because of face validity

Page 52: Advanced Methods and Analysis for the Learning and Social Sciences

The WPI-ASU squabble

• Baker (2007) found that gaming better predicted by lesson

• Muldner et al. (2011) found that gaming better predicted by student

• Which one should we trust?

Page 53: Advanced Methods and Analysis for the Learning and Social Sciences

The WPI-ASU squabble

• Recent unpublished research by the two groups working together

• Found that a human coder’s labeling of clips where the two detectors disagree…

• Achieved K=0.17 with the ML detector• And K= -0.17 with the KE detector

Page 54: Advanced Methods and Analysis for the Learning and Social Sciences

The WPI-ASU squabble

• Which one should we trust?

• (Neither?)

Page 55: Advanced Methods and Analysis for the Learning and Social Sciences

ML and KE don’t always disagree• In Baker et al. (2008)

• Baker and colleagues conducted two studies correlating ML detectors of gaming to learner characteristics questionnaires

• Walonoski and Heffernan conducted one study correlating KE detector of gaming to learner characteristics questionnaires

• Very similar overall results!

Page 56: Advanced Methods and Analysis for the Learning and Social Sciences

Converging evidence increases our trust in the methods!

Page 57: Advanced Methods and Analysis for the Learning and Social Sciences

Models on top of Models

Page 58: Advanced Methods and Analysis for the Learning and Social Sciences

Models on top of Models

• Another area of Discovery with Models is composing models out of other models

• Examples:– Models of Gaming the System and Help-Seeking use Bayesian

Knowledge-Tracing models as components (Baker et al., 2004, 2008a, 2008b; Aleven et al., 2004, 2006)

– Models of Preparation for Future Learning use models of Gaming the System as components (Baker et al., 2011)

– Models of Affect use models of Off-Task Behavior as components (Baker et al., in press)

Page 59: Advanced Methods and Analysis for the Learning and Social Sciences

Models on top of Models

• When I talk about this, people often worry about building a model on top of imperfect models

• Will the error “pile up”?

Page 60: Advanced Methods and Analysis for the Learning and Social Sciences

Models on top of Models

• But is this really a risk?

• If the final model successfully predicts the final construct, do we care if the model it uses internally is imperfect?

• What would be the costs?

Page 61: Advanced Methods and Analysis for the Learning and Social Sciences

What are some

• Other examples of discovery with models analyses?

Page 62: Advanced Methods and Analysis for the Learning and Social Sciences

Examples

• Sweet, can you tell us aboutSan Pedro, M.O.C., Baker, R.S.J.d., Rodrigo, M.M. (2011) The Relationship between Carelessness and Affect in a Cognitive Tutor. Proceedings of the 4th bi-annual International Conference on Affective Computing and Intelligent Interaction.

Page 63: Advanced Methods and Analysis for the Learning and Social Sciences

Examples

• Mike Wixon, can you tell us aboutHershkovitz, A., Wixon, M., Baker, R.S.J.d., Gobert, J., Sao Pedro, M. (2011) Carelessness and Goal Orientation in a Science Microworld. Poster paper. Proceedings of the 15th International Conference on Artificial Intelligence in Education, 462-465.

Page 64: Advanced Methods and Analysis for the Learning and Social Sciences

Examples

• Mike Sao Pedro, can you tell us aboutSao Pedro, M.A., Gobert, J., Baker, R.S.J.d. (in press) The Development and Transfer of Data Collection Inquiry Skills across Physical Science Microworlds. To be presented at the American Educational Research Association Conference.

Page 65: Advanced Methods and Analysis for the Learning and Social Sciences

Comments? Questions?

Page 66: Advanced Methods and Analysis for the Learning and Social Sciences

Asgn. 9

• Questions?• Comments?

Page 67: Advanced Methods and Analysis for the Learning and Social Sciences

Next Class• Wednesday, April 9• 3pm-5pm• AK232

• Missing Data and Imputation

• Readings• Schafer, J.L., Graham, J.W. (2002) Missing Data: Our View of

the State of the Art. Psychological Methods, 7 (2), 147-177.

• Assignments Due: 9. IMPUTATION

Page 68: Advanced Methods and Analysis for the Learning and Social Sciences

The End