praxis-teacher-research.org · web view2021. 2. 12. · different teaching methods have been...
TRANSCRIPT
Neural and behavioural correlates of science reasoning during
adolescence.
Jack White-Foy
Birkbeck, University of London
& University College London (Institute of Education)
ABSTRACTInhibitory control is thought to play a role in conceptual change; whereby a
naïve idea is inhibited in favour of the correct scientific one but is never truly
erased. Activation of the anterior cingulate cortex and dorsolateral prefrontal
cortex have been associated with better performance on misconception tasks
in children and adults and could represent a network of error detection and
response inhibition. Adolescence represents a period of development in brain
structure and function but little research into science reasoning and
conceptual change has been undertaken with this age group. Using modified
versions of the Stroop and Go/No-Go tasks with 20 adolescent participants
aged 11-15 years old, semantic and response inhibition performance were
measured. Using functional magnetic resonance imaging data, the left
dorsolateral prefrontal cortex and pre-supplementary motor area showed
greater activation during unique science tasks but was not associated with
better performance on the misconception questions. Whilst there was no
relationship between inhibitory control and performance on misconception
tasks, better verbal IQ, working memory and a larger reaction time cost were
associated with better performance. Implications for teaching and
opportunities for future research are discussed.
1
1. INTRODUCTIONHeavier objects fall faster than lighter objects; organisms have chosen their
adaptations; gravity pulls objects downwards (Driver, Guesne, & Tiberghien,
1985). These are just some of the many misconceptions that students can
bring with them when they start formal education (Brault Foisy, Ahr, Masson,
Borst, & Houdé, 2015). Some students go on to demonstrate more
sophisticated and accurate understanding of science concepts, whilst others
enter adulthood still with these naïve and incorrect ideas about the world
(Dunbar, Fugelsang, & Stein, 2007). One of the many challenges faced by a
teacher of science is to ensure that misconceptions are abandoned in favour
of scientific ones (Duit & Treagust, 2003). The purpose is not only to help their
students pass public examinations but also to ensure that they leave school
with the skills to understand future scientific discoveries and theories (e.g.
climate change) (diSessa, 2006). Such a task can be described as a
challenge due to the often prevalent and robust nature of misconceptions and
the abstract features of science concepts to be learned (Chi, 2005). Achieving
conceptual change, the movement from a naïve to an expert understanding of
a concept, has long been the focus of a large and still growing body of
research (Brault Foisy, Ahr, et al., 2015). Understanding how conceptual
change happens is therefore of great importance to educators and society.
1.1 Conceptual changeDifferent teaching methods have been proposed, which highlights that there is
no consensus on how to achieve conceptual change or on the underlying
theoretical framework (Dunbar et al., 2007). Wiser & Carey (1983) suggested
that conceptual change is similar to a scientific revolution, whereby a
paradigm that is no longer capable of satisfactorily explaining evidence is
abandoned in favour of one that provides a better explanation. Although a
scientific revolution at a societal level is likely to differ to one at a personal
level (Karmiloff-Smith, 1988), similar conditions need to be met: a student
must be dissatisfied with their existing concept, the replacement concept must
be understandable, believable and promise to help explain or discover new
ideas (Posner, Strike, Hewson, & Gertzog, 1982).
2
To put this within the context of a classroom, teachers would first identify the
naïve ideas their students hold. Then they would present students with a
problem to explain using their ideas. Upon being confronted that their naïve
ideas do not provide a solution, the teacher would introduce the new more
appropriate concept (the scientific one). Following this, students would
explicitly contrast both their naïve and scientific concepts (Chi, 2008). The
hope is that students would recognise that their naïve ideas are not as good
at explaining the problem as the scientific one (Rowell & Dawson, 1985).
Goswami (2008) argued that conceptual change is not static but instead a
continuous cycle between equilibrium (where a concept satisfactorily explains
observations), experience followed by disequilibrium (where the naïve theory
does not explain the new experiences), assimilation of a new concept
followed by the return to equilibrium. In much the same way, Inagaki & Hatano
(2002) suggested that when presented with a scientific theory, a students’
naïve theory is disrupted. To regain stability the student must modify their
concept or replace it altogether. This is not always the case, however, due to
resistance to change; if a student’s naïve concept is robust they may be in
denial that anything requires changing and can instead add to their existing
theory to make it fit better to the perceived disruption (diSessa, 2006).
The difficulty with this explanation is that investigations have found mixed
results when studying conceptual changes. Teaching approaches that work
for some students do not work for others and what may first appear as
conceptual change does not last (Vosniadou, 2002). This throws up the next
major consideration for conceptual change research: when conceptual
change occurs, what happens to the naïve theory? Some argue that for
conceptual change to occur, the naïve theory must be eliminated to make way
for the scientific one, since both cannot be held at the same time; others
argue that the naïve theory and the new scientific one coexist (diSessa,
2006). Evidence from research currently favours the latter argument (Dawson,
2014).
3
1.2 Are naïve theories really eliminated?Misconceptions can originate in children from a very young age, when they
begin to observe the world around them and create simple rules about
themselves, others and their environment. For example, by the age of four or
five months, children develop a concept for object permanence (Dunbar et al.,
2007). When an object is shown to be hidden beneath a cover, the infant will
find it, recognising that the object has not vanished even though it is not
visible to them anymore. This concept is not the result of formal training but is
developed from the infant’s own observations and experience (Gropen, Clark-
Chiarelli, Hoisington, & Ehrlich, 2011). Concepts such as these are referred to
as heuristics: simple and quick rules about various phenomena (Brault Foisy,
Potvin, Riopel, & Masson, 2015). Algorithms, on the other hand, are slower
and more accurate logic-based calculations, relying on conscious processing
(Gropen et al., 2011). Adults have developed greater control over which level
of processing to choose, recognising when the experiential approach
(heuristic) is inappropriate and allowing more time for the analytic (algorithm)
(Gropen et al., 2011).
Early on in development, heuristics may help a child to navigate the world
around them but they can become insufficient to explain more complex
phenomena. Heuristics then become misconceptions and may hinder
understanding and problem-solving (Flavell, 1985). This is particularly
important in secondary school, which exposes children to abstract concepts in
biology, chemistry and physics (Driver, Squires, Rushworth, & Wood-
Robinson, 2015). A classic experiment by Piaget (1997) demonstrated the
‘length-equals-number’ misconception. Children were presented with two
parallel lines each with the same number of dots. In one of the lines the dots
were more spread out (Figure 1B). Children between one and seven years of
age incorrectly stated that the longer line contains more dots, demonstrating
that they were relying on their naïve idea of this relationship (Houdé, 2000).
To complete this task successfully and state that the number of dots is equal,
children needed to abandon the heuristic and count each dot (Borst, Simon,
Vidal, & Houdé, 2013).
4
Figure 1: (A) Children over two years old correctly stated that there are equal numbers of blue and red dots (B) Children aged two to seven years incorrectly stated that there are more blue dots than red dots.
After seven years of age, typically developing children no longer demonstrate
the misconception that length equals number. This could support the idea that
for conceptual change to occur, the naïve idea is deleted and replaced with a
more accurate one. This theory, however, is inadequate to explain further
investigations. Piaget (1997) investigated development of object permanence
using the ‘A-not-B’ experiment (an extension of the object permanence
experiment). Up to a year old, once an infant has learned that an object is
hidden under cover A, they will continue to search under cover A even when
the object is moved under cover B in full view (Figure 2).
Figure 2: The A-not-B task. Infants aged up to a year still search under cover A (brown) despite having seen the object being moved under cover B (grey) (figure adapted from Marcovitch & Zelazo, 1999).
Piaget (1997) argued this was a violation of object permanence whereby the
infant was unable to coordinate and activate the heuristic that would have
allowed them to find the object. Lewandowsky & Li (1995) suggested instead
that there was competition between two coexisting memory traces: the
5
memory of the role of Cover A and the memory of seeing the object move
under Cover B. Diamond (1998) however, argued that both the length-equals-
number and A-not-B tasks show a failure to inhibit the naïve idea, which
would have allowed the algorithm to be processed. Dempster & Brainerd
(1995) supported this conclusion, stating that poor performance on these
tasks is due to the inability to inhibit the interference from the heuristic, rather
than inappropriate selection of the correct conceptual framework to follow.
These two examples of early development highlight the recurring finding that
misconceptions can reappear even after they have supposedly been erased.
Rowell and Dawson (1977) described a series of studies that investigated the
success and duration of conceptual change in adolescents. They found that
some students reverted to naïve ideas after a few weeks or a few months
after demonstrating successful understanding of a new concept. They also
found that that some students correctly answered questions that were
phrased in one context but reverted to misconceptions for another, even if
both questions related to the same concept. This finding is not unique to
children; even professional scientists can demonstrate naïve and incorrect
ideas when reaction time is limited (Masson, Potvin, Riopel, & Brault Foisy,
2014). This adds support to the model that following conceptual change,
naïve and expert concepts can coexist into adulthood.
Further evidence for this model came from studies with Alzheimer’s patients.
Lombrozo, Keleman, & Zaitchik (2007) discovered that participants reverted
back to naïve concepts when their cognitive abilities diminished. If conceptual
change is the deletion and replacement of ideas, either conceptual change did
not take place in these Alzheimer’s patients, or their scientific ideas were
replaced with new naïve ones. Alternatively, their naïve ideas survived
conceptual change and somehow these patients reverted back to them
(diSessa, 2006).
The principles of neural plasticity can go someway to explaining why
memories for naïve concepts can appear to be so robust. Memories are
formed from the strengthening of synapses between neurons within a neural
6
network (Gazzaniga, Ivry, & Mangun, 2009). Synapses are strengthened
following repeated activation from experience, observations and interactions
(Huttenlocher, 2002). Whilst memories can be fragile as they move from the
hippocampus (short term) to the cerebral cortex (long term), once established
they only deteriorate slowly after non-use. The idea that conceptual change
occurs by eradicating an entire concept, in a single lesson or even a short
series of lessons, is not supported by the neural basis for memory formation
and forgetting. Mefoh (2010) argued that the appearance of forgetting is
actually due to competition for retrieval between long-term memory traces.
The question becomes whether conceptual change is the appropriate
activation of the scientific concept, the inhibition of the incorrect naïve
concept, or a combination of the two.
1.3 The role of inhibitory controlThere is strong evidence that for a scientific theory to be expressed, the
misconception must be inhibited. As discussed above, heuristics, which form
the basis of misconceptions, are accessed faster than algorithms.
Unfortunately for learners, the more challenging scientific concepts rely on
algorithms in order to solve problems. Lewandowsky & Li (1995) suggested
that there is competition, likened to a race, between memory traces. If it were
just a simple race that determined which concept was expressed, the
heuristic, being faster, would always win. Furthermore, reaction times are
longer for correct expert responses compared to incorrect novices responses,
which suggests that additional processing is taking place (Babai &
Amsterdamer, 2008). This cannot be easily accounted for by the selection-
activation hypothesis alone. If that were the case the same reaction time on
either a naïve or a scientific answer would be expected. The novice who
expresses the misconception has either not learned the correct concept, or is
unable to inhibit their naïve concept and so the heuristic wins the race. An
expert, on the other hand, is able to inhibit the misconception allowing the
correct view to be expressed. The additional processing involved in inhibitory
control could account for the longer time taken to respond. If inhibitory control
plays a role in conceptual change, a question relevant to education is whether
7
a student’s ability to inhibit responses makes the process of conceptual
change particularly difficult.
The Stroop task and the Go/No-Go task have been used to investigate
inhibitory control. Different performance on these two tasks suggests that
there may be different aspects of inhibitory control; whereby the Stroop task
targets semantic inhibition (ignoring interference to give a correct response)
whereas the Go/No-Go task targets response inhibition (identifying the correct
stimulus that warrants a response) (Morooka et al., 2014). The original Stroop
task presents a participant with a series of words of colours. The participant
must read the word out loud, and ignore the colour of the letters. In the simple
version of the task, the colour of the word is congruent to the word itself (e.g.
‘Blue’). In the complex version of the task, the colour of the letters is
incongruent to the word itself (e.g. ‘Blue’). To be successful, participants must
inhibit their desire to state the colour of the letters and instead read the word.
Children and adults alike take longer to complete the incongruent than the
congruent trials. Inhibitory control generally improves with age, though when
pressed for time, performance is reduced (Morooka et al., 2014).
In the Go/No-Go task, participants are asked to respond to one stimulus (the
‘Go’ trial) but not another (the ‘No-Go’ trial). Accuracy and reaction time
generally improve with age resulting in fewer incorrect responses for the ‘No-
Go’ trial (Cragg & Nation, 2008). Yet some studies have found that
adolescence sees an increase in reaction times, which can be due to simply
slowing down as well as some responses being initiated and then stopped
(Cragg & Nation, 2008). Accuracy still increases but at the expense of faster
reaction times. Neuroimaging studies on inhibitory control tasks have shown
that, in adults, performance on these tasks was associated with greater
activation of the ventrolateral and dorsolateral prefrontal cortex (VLPFC and
DLPFC respectively) and the anterior cingulate cortex (ACC) (Crone & Dahl,
2012; Durston et al., 2002). An adolescent’s brain is very different to that of a
child or an adult: white matter volumes change significantly during
adolescence (O’Hare & Sowell, 2008) and various structural and functional
changes take place, both in a linear and non-linear trajectory (Dumontheil,
8
Houlton, Christoff, & Blakemore, 2010). Mason & Just (2016) have shown that
different brain regions are activated during inhibitory control tasks at different
ages during adolescence. Figure 3 summarises changes in activation through
adolescence during inhibitory control, working memory and switching tasks
(Durston et al., 2002; Velanova, Wheeler, & Luna, 2008; Crone & Dahl, 2012).
Figure 3: Functional changes during adolescence (Crone & Dahl, 2012).
The pattern of changes in activation during inhibitory control tasks is
inconsistent and varies between studies. Several functional magnetic
resonance imaging (fMRI) studies have found an increase in activation of the
prefrontal cortex (PFC) (Adleman et al., 2002; Rubia et al., 2006; Koolschijn,
Schel, de Rooij, Rombouts, & Crone, 2011; Smith, Halari, Giampetro,
Brammer, & Rubia, 2011), the ACC (Koolschijn et al., 2011) and the inferior
frontal gyrus (IFG) (Tamm, Menon, & Reiss, 2002; Smith et al., 2011), relative
to baseline. Others have found a decrease in activation in these regions
(Crone & Dahl, 2012). These findings suggest that adolescence is a period in
development where engagement of the PFC varies in different tasks, which
could represent different strategies to achieve the same goals and
performance as adults (Dumontheil et al., 2010).
9
1.4 Neural correlates of conceptual changeFaced with competing and inconsistent approaches to conceptual change,
researchers have turned to neuroimaging to help identify the underlying
neural processes that occur with conceptual change, and whether the
evidence would support a role of inhibitory control in the resolution of
misconceptions. In doing so, it is hoped that a clearer and agreed theoretical
model can be developed, by accessing processes that are unavailable to
behavioural studies alone (Cacioppo, Berntson, & Nusbaum, 2008).
Fugelsang & Dunbar (2005) demonstrated that information is treated
differently depending on whether it agrees or conflicts with existing ideas.
When participants were shown new data that agreed with a learned rule, the
caudate and parahippocampal gyrus were more activated than baseline.
These regions have previously been associated with learning (Durston et al.,
2002). When the data conflicted, ACC, precuneus and DLPFC were activated
more than baseline (Fugelsang & Dunbar, 2005). The ACC is associated with
error detection and conflict monitoring, whilst the DLPFC is associated with
inhibitory control and response selection (Botvinick, 2007). These brain
regions were found to be still differentially activated even after several trials,
suggesting their activation was not related to learning, which had presumably
already taken place.
Dunbar et al. (2007) went on to show that physics experts showed greater
activation of the ACC compared to baseline when viewing two balls of
different sizes hit the ground at different times (a common misconception),
whereas novices showed greater activation of the ACC when viewing the two
balls hit the ground at the same time (the scientifically correct version).
Dunbar et al. (2007) concluded that the experts had undergone conceptual
change and so the ACC detected conflict between the incorrect stimulus and
their correct scientific belief. Interestingly, half of the novices were still able to
give a correct answer, which suggests that whilst they may not have
undergone conceptual change, they were still able to give a correct answer
without understanding why.
10
Brault Foisy, Potvin, et al. (2015) found that, compared to experts, novices
showed more activation in the ACC when evaluating non-scientific than
scientific stimuli relative to baseline, despite novices scoring lower than
experts. Brault Foisy, Potvin, et al. (2015) proposed that this was because
novices were at a late stage of conceptual change but not sufficiently
advanced to answer the questions correctly. The authors went on to suggest
that their results showed that the participants were able to recognise when an
intuitive response was given, which made them aware of their potential biases
and hence activated the ACC. De Neys, Vartanian, & Goel (2008) argued that
the ACC can be activated for both correct and incorrect answers but the PFC
is only activated when giving correct answers. Several studies have echoed
these findings with different types of task: electricity (Masson, Potvin, Riopel,
Brault Foisy, & Lafortune, 2012), mechanics (Dunbar et al., 2007) and
chemistry (Nelson, Lizcano, Atkins, & Dunbar, 2007). These studies found
greater activation of the ACC and DLPFC in adults when answering
misconception questions correctly.
Brault Foisy, Potvin, et al. (2015) emphasised the difficulty of measuring
inhibitory control at the precise moment of answering a science task. They
argued that by measuring activity in brain areas associated with inhibitory
control, it would reveal to what extent inhibitory control was being engaged.
One issue with previous neuroimaging studies on conceptual change is that
they rely on reverse inference, namely on concluding that inhibitory control is
recruited when ACC and/or DLPFC show increased activation. However the
ACC and DLPFC show activation in a wide range of tasks beyond inhibitory
control or conflict resolution tasks (Duncan, 2010; Crittenden & Duncan,
2014). Variability in activation during inhibitory control tasks compounds the
difficulty in attempting to interpret and generalise brain activation during
science tasks, particularly when comparing one age group with another.
1.5 The present study The findings from the behavioural and neuroimaging studies discussed here
offer valuable insights into the possible mechanisms behind conceptual
change. A limitation arises by the participants’ ages in these studies. Either
11
very young children or adults were used. In terms of conceptual change, there
are very few studies on adolescence, even though as reviewed above it is a
time of development of inhibitory control, in addition to significant changes in
brain structure and brain function more broadly.
The present study followed on from previous behavioural work by the lead
investigator on this project (Brookman, 2015), which investigated in an
adolescent sample the role of inhibitory control in answering questions on
science and mathematics (Brookman, 2015). Results showed that students
who scored higher on inhibitory control tasks were better at answering
science and mathematics misconception questions correctly (Brookman,
2015), suggesting that inhibitory control played a role when answering
questions in science and mathematics, and supporting previous evidence in
children and adults.
The present study is part of a neuroimaging project in adolescents and had
two lines of inquiry. The first was the relationship between inhibitory control
and performance on the science misconception questions. Control tasks were
included which tested working memory, verbal IQ, reasoning IQ and
analogical reasoning. These control tasks were included to assess the
specificity of the relationship between inhibitory control and the science tasks.
Several studies have linked working memory to inhibitory control and
resistance to interference. Engle (2005) argued that working memory is an
attentional control system evolved to prevent interference. Ohlsson (2009)
suggested that conceptual change relies on the ability of learners to recognise
analogies between different domains; whereby understanding can be
transferred from a simple idea to a complex one. The second line of inquiry
investigated activation of the prefrontal cortex, in particular the ACC and
DLPFC as regions of interest, during science misconception questions. The
final line of inquiry was whether activation of these two areas was associated
with higher accuracy and longer reaction times on science misconception
questions. The main hypotheses were:
12
i) Better performance on behavioural tasks (inhibitory control, working
memory and analogical reasoning) would be associated with better
accuracy and reaction time on science misconception questions.
ii) The ACC and DLPFC would be activated more when answering
misconception questions relative to baseline.
iii) Activation of the ACC and DLPFC would be associated with better
accuracy and longer reaction times when answering misconception tasks.
2. METHOD2.1 ParticipantsParticipants were recruited from a selective and fee-paying public secondary
school in East London. A total of 20 participants took part, of these 10 were
male, 10 were female. Ages were between 11 and 15 years (mean = 13.7, SD
= 1.2). All participants were screened for learning difficulties, mental health
issues and behavioural problems, none of which were present.
Ethics approval was granted on 28 January 2016 by the local ethics
committee. All participants and parents provided written consent and were
reminded that they had the right to withdraw at any time during or after the
testing session.
2.2 ProcedureTasks measuring inhibitory control and science understanding were
conducted in two stages: the first stage was practice and was carried out on a
laptop outside of the scanner. The second stage was carried out inside the
scanner and constitutes the data used for analysis. The lead investigator
carried out the scanning, whilst I assisted with the practice phase and other
data collection outside the scanner. To measure working memory, we used a
backward digit task and a visuospatial task (VSWM). To measure inhibitory
control we used the Stroop task (congruent and incongruent) and Go/No-Go
tasks (simple and complex).
13
2.3 Tasksi) Inhibitory Control
a) Go/No-Go
The Go/No-Go task was adapted from Watanabe et al. (2002). This task
followed a block design, made up of three block types: Go blocks (100% Go
blocks), simple Go/No-Go (50% Go, 50% No-Go) and complex Go/No-Go
(50% Go, 50% No-Go) blocks (Figure 4A). Each trial lasted for 1.1 seconds.
There were a total of 20 trials per block, with four repeats of each block type.
Fixation blocks were presented at 10 seconds at the start, 15 seconds at the
middle of the task and 10 seconds at the end of the task. In each block, 50%
of the trials were Go, which always required a response and 50% were No-Go
where no response was expected.
Figure 4: Go/No-Go task. (A) Timing of task blocks. (B) Example trials of the Go, simple Go/No-Go and complex Go/No-Go blocks.
In Go blocks, participants were asked to press a key to indicate on which side
of the screen a beige square was shown, using their left and right index
fingers (Figure 4B). In simple Go/No-Go blocks, participants were again asked
14
to press a key to indicate on which side of the screen the beige square was
shown, while they were told not to press any buttons when they square was
blue (Figure 4B). Complex Go/No-Go blocks followed a 1-back paradigm to
add a working memory load. Participants were asked to press a button using
their index fingers to indicate which side a yellow or pink square was shown
but only if the square was the same colour on the current trial as on the
previous trial (Figure 4B).
Participants were given a practice round before scanning. During this time,
the instructor could explain the instructions, which were presented on screen,
check understanding of the instructions and answer any questions. If the
participant made three or more errors out of 15 trials, the programme would
inform them of such and restart the practice until they made no more than two
errors. Accuracy on Go and No-Go trials were recorded and reaction times
were recorded for Go trials. The test run in the scanner lasted 5.9 min in total.
b) Numerical Stroop task
This variation of the Stroop task was adapted from Brookman (2015), and
followed a block design made up of two alternating block types, which
included either 100% congruent trials or 50% congruent trials and 50%
incongruent trials. Each trial lasted 1.5 seconds. There were a total of 14 trials
per block, with five repeats of each block type (Figure 5A). Fixation baseline
blocks were also presented for 10 seconds at the start; 15 seconds in the
middle of the task and 10 seconds at the end of the task, as in the Go/No-Go
task.
15
Figure 5: Numerical Stroop task. (A) Timing of task blocks. (B) Example trials of the congruent and mixed blocks.
Participants were shown one, two, three or four digits in the middle of the
screen, which were formed of the digits 1, 2, 3 or 4. Participants were asked
to press a key to indicate how many of the digits they could see but to ignore
the digit itself. On congruent trials, the number of digits and the digits
themselves matched, for example, ‘3 3 3’ where the correct answer would be
to press the key corresponding to “3”. On incongruent trials, the number of
digits and the digits themselves did not matched, e.g. ‘4 4’, where the correct
answer would be to press the key corresponding to “2”. Participants
responded with their middle and index fingers of both hands, 1 being the left-
most response, 4 the right-most response. Figure 5B shows example
sequences of trials in congruent and mixed (50% congruent, 50%
incongruent) blocks. The test run in the scanner lasted 4.9 min in total.
ii) Science Knowledge and Misconceptions
This task was modified from a previous study carried out by the lead
investigator in 2015. Feedback on these stimuli was sought from specialist
subject teachers in two schools, covering biology, chemistry, physics and
mathematics. Using this feedback, the stimuli were improved and aimed to
test participants’ knowledge and understanding of concepts in science and
16
mathematics. Concepts were chosen that were thought to access common
misconceptions that students typically demonstrate in lessons at this age
range. This was confirmed using national curriculum data, research by Driver
et al. (2015), analysis of schemes of work across different GCSE courses
(Edexcel, AQA and OCR exam boards) and consulting subject specialist
teachers. The focus of this dissertation is on scientific misconceptions and so
the data for mathematical performance are not included.
The task comprised four runs alternating between science-related and
mathematics-related questions. Participants were pseudo-randomly assigned
to two alternative sequences: i) science-maths-science-maths or ii) maths-
science-maths-science. There were a total of 96 trials; 48 trials that accessed
a common misconception and 48 control questions that were designed to
relate to the same topic but not to a misconception. An equal number of trials
were presented that related to biology, chemistry and physics. There were
four types of trials: i) a misconception trial that was presented as a false
statement, ii) a misconception trial presented as a true statement, iii) a control
trial presented as a false statement, or iv) a control trial presented as a true
statement. Each slide had a grey background to reduce contrast in the
scanner and therefore eye fatigue, and included a mixture of text and
drawings/schemas to make the task more engaging (Figure 6A). Participants
were asked to select whether they thought the statement was definitely true,
probably true, probably false or definitely false, using their middle and index
fingers. These four options allowed an estimation of how confident
participants were in their answer (Figure 6A).
17
Figure 6: Science knowledge and misconception task. (A) Example stimuli. (B) Example sequence of trials.
Each trial lasted for a maximum of 16 seconds. Trials would end upon a
response from a participant, or would proceed automatically after 12 seconds.
Once participant had responded or when 12 seconds had passed the stimulus
disappeared and participants were either presented with a fixation cross or
where asked to press a left/right key with their index fingers to indicate the
direction of arrows presented on the screen, up to 16 seconds after the
science or maths stimulus was initially presented (Figure 6B). Additional
fixation blocks were presented for 10 seconds at the start of run; 15 seconds
in the middle of the run and 10 seconds at the end of the run.
Participants were asked to be as quick as they could without making errors.
Participants were told that the response buttons would be surrounded by a
red box if they had not responded and only had three seconds to do so.
iii) Visuospatial working memory task
To test visuospatial working memory, participants were shown a 4x4 grid on
the screen. A series of dots would appear on the screen one at a time in
different locations in the grid. Participants were asked to recall the positions
and order that the dots appeared in and were then asked to use a computer
18
mouse to indicate the sequence by clicking on each location in turn. If the
participant made an error in reproducing the sequence, the message ‘Wrong’
would flash on the screen and the next sequence would be shown. The length
of the sequences increased from three dots per sequence until a maximum of
eight dots per sequence until the participant got three or more sequences
wrong out of four at a given level (Figure 7A). The visuospatial working
memory score is the total number of correctly recalled sequences.
Figure 7: Example stimuli of the (A) Visuospatial working memory task; (B) Matrix reasoning subtest of the WASI (Wechsler, 2011); (C) Analogical reasoning task.
iv) Verbal working memory task
To test participants’ verbal working memory, they were presented with a
sequence of digits between ‘1’ and ‘9’ verbally by the experimenter and were
asked to repeat the sequence back to the experimenter in reverse order.
Trials increased in length by one digit from two digits upwards until the
participant got two or more sequences wrong out of the four trials of a level.
Reaction time was not recorded. The backwards digit score is the total
number of sequences correctly recalled.
v) Wechsler Abbreviated Scale of Intelligence (WASI)
The Vocabulary and Matrix Reasoning subtests of the WASI (Wechsler, 2011)
were used to assess participants’ general ability. In particular it was important
to assess each participant’s level of verbal IQ to give an indication of their
accessibility of the stimuli in both the science and maths questions and the
analogical reasoning tasks. This was to help identify if any participants were
scoring particularly low on any of these tasks simply because they did not
know what the words meant, rather than due to a misconception or
19
misunderstanding of the analogical relationship between analogical pairs. In
the Vocabulary subtest participants are asked what a series of words mean.
The Matrix Reasoning subtest is a non-verbal reasoning task, which took the
form of shapes and patterns in a sequence. Participants were asked to select
from a list of potential responses as to which would come next in the
sequence, based on the previous ones (Figure 7B).
vi) Analogical reasoning
Analogies were presented to participants using online Google Forms. They
were told that they would be presented with a series of analogies in the form
of A:B::C:D, whereby A has a relationship to B and C has a relationship to D.
Participants were told that the type of relationship between A and B was
similar to the relationship between C and D. Participants were presented with
A:B and C. They were asked to choose D from a list of potential responses
(Figure 7C).
The stimuli were adapted from a study by Leech, Mareschal, & Cooper
(2007). This study tested analogical reasoning in students of a similar age to
the participants, which meant we were less likely to experience a ceiling or
floor effect on performance. There were 24 questions in total. The questions
were modified to make them more accessible in terms of vocabulary, using
more common alternatives (e.g. unjust instead of bigoted). In addition, some
words were outdated in terms of social experience, and were therefore
replaced with similar but more modern alternatives (e.g. CD player instead of
cassette player).
Participants were first given four practice analogies, during which time they
could ask any questions and the experimenter could explain any incorrect
answers. They then answered the questions at their own pace. Participants
were told to be as quick as possible without making any mistakes. The time to
complete all tasks was recorded using an Apple iPhone. Timing started when
participants scrolled down to the first test question and ended when
participants clicked on the ‘submit’ button at the bottom of the page containing
the questions. The number of correct responses was also recorded.
20
2.4 Statistical AnalysisOne-way repeated measures Analysis of Variance (ANOVAs) were performed
to analyse the Go/No-Go task data and paired samples t-tests for the
Numerical Stroop data. Whilst not the main focus of this study, it is useful to
confirm that these tests are presenting a challenge to the participants in line
with previous findings. Due to the sample size, the sample was considered as
a single age group. Whilst it is acknowledged that there may be individual
differences due to age, which is included as a predictor in the regressions,
analysis of performance on the behavioural tests was not carried out with age
as a factor.
Standard analysis of fMRI data was performed by the lead investigator using
Statistical Parametric Mapping software, who provided me with mean
parameter estimates within a pre-supplementary motor area (pre-SMA)
cluster extending into the ACC and a DLPFC cluster corresponding to
activation in each science problem trial type. Scores for each science
(biology, chemistry and physics) were calculated as a percentage, broken
down by question type (control or a misconception) and the format of the
question (true or false). Reaction times were also recorded and presented in
milliseconds. Following statistical analysis of the neuroimaging data
(Appendix 1), regions of interest in the pre-supplementary motor area (pre-
SMA) (cluster extending into the ACC), left DLPFC and the primary and
secondary visual cortex were identified in the contrast for all science trials
versus baseline. For the purposes of this investigation, only the left DLPFC
and pre-SMA were included in the analyses due to their association with
inhibitory control. One-way repeated measures ANOVAs were conducted to
determine whether there was a statistically significant difference in activation
in these regions during different types of science question, whether
misconception (correct or incorrect) or control correct (there were not enough
incorrect control trials for fMRI data analysis of this trial type).
Regression analyses were carried out using the general linear model. The first
of these was to investigate the relationship between the behavioural tasks and
performance (accuracy and reaction time) on the science misconceptions.
21
The impact of inhibitory demand was measured using a cost value; calculated
on the Stroop task as the difference in accuracy and reaction times between
the congruent and incongruent trials. For the Go/No-Go, in addition to the
accuracy scores, two costs were calculated each for reaction time. These
were: (a) the difference between simple Go trials and Go Only trials and (b)
the difference between the complex 1-back Go trials and the simple Go trials.
Regression analyses were also carried out to investigate the relationship
between brain activation of the pre-SMA and the left DLPFC on accuracy and
reaction time when answering science misconception tasks.
Tests for collinearity were carried out but none were present. There was no
collinearity as VIF was below 10 and the tolerance value was above .2 on all
analyses. Exclusionary criteria were put in place, to identify outliers further
than 3.29 SD from the mean. In addition, visual examination of boxplots and
multivariate analysis using Cook’s distance, Mahalanobis distance and
standardised DFFIT were used to identify outliers that could be influencing the
data.
Six multiple regression analyses were carried out to examine the relationships
between variables on either accuracy or reaction time when answering
science misconception tasks, entering behavioural task performance, pre-
SMA activation or DLPFC activation as possible predictors. In the first multiple
regressions with behavioural data only, Block 1 contained age, gender and
accuracy/reaction time on control questions; using the enter method. The
latter predictor was included as performance was considered to reflect
general science knowledge and was assumed to be related to performance
on the subsequent misconception tasks that were on the same topic. The
remaining blocks used the stepwise method and included the following
predictors: Go/No-Go accuracy and reaction time costs (simple-Go Only and
complex-simple), Stroop accuracy and reaction time costs (incongruent-
congruent), WASI matrix IQ and verbal IQ (Wechsler, 2011), backward digit
score, visual working memory score and analogical reasoning score and time
taken. The stepwise method was chosen as no specific hypotheses were held
22
as to which variables would account for the greatest variance in science
accuracy or reaction times.
For the regressions investigating the associations of the pre-SMA and LDPFC
activations, the same control variables were entered in Block 1 using the enter
method (age, gender, accuracy/reaction time on control science questions).
Activation of the pre-SMA and LDPFC when answering the science control
questions and science misconception questions (correctly and incorrectly)
were entered into Block 2 separately using the stepwise method. Again, there
were no specific hypotheses as to which activation event would have the
greatest association.
Figure 8: Activation during Science Control and Misconception trials (lateral view of
the left hemisphere, medial view of the left hemisphere, lateral view of the right
hemisphere) voxel pFWE <0.05. ROIs used in the analyses are highlighted. NB:
pFWE is the family wise error rate, which is the probability that one of the values will
be greater than the alpha (Friston, Ashburner, Kiebel, Nichols, & Penny, 2007).
3. RESULTS3.1 Stroop taskA paired samples t-test was carried out to determine whether participants
scored significantly better when presented with congruent trials than
incongruent trials. Congruent trials of congruent and mixed blocks were
combined (although see Figure 9A for data split according to block type).
Participants were less accurate on incongruent trials (M = .61, SD = .09) than
congruent trials (M = .91, SD = .06) (t(19) = 20.299, p < .001, d = 4.55).
23
Figure 9: Numerical Stroop task behavioural results. (A) Accuracy (M ± SE) as a function of trial and block types. (B) RT (M ± SE) as a function of trial and block types.
Similarly a paired samples t-test was run on Stroop reaction time (RT) data.
Participants responded more slowly on incongruent trials (M = 748, SD = 61
ms) than on congruent trials (M = 641, SD = 48 ms, Figure 9B) (t(19) = -
19.179, p < .001, d = 4.28).
In summary, participants were significantly faster and more accurate on the
congruent compared to the incongruent trials.
3.2Go/No-Go tasksA one-way repeated measures ANOVA was run on the Go/No-Go accuracy
data to compare performance on the five trial types (Go in Go blocks, Go and
No-Go trials in simple blocks, and Go and No-Go trials in complex blocks).
Mauchly's test of sphericity indicated that the assumption of sphericity had
been violated, χ2(9) = 19.27, p = .024. Epsilon (ε) was 0.67 and was used to
correct the ANOVA. Results show a main effect of trial type (F(2.68, 48.22) =
12.79, p = <.001). Planned post-hoc comparisons show that there was no
difference in accuracy between Go trials in Go blocks and in simple blocks (p
> .05) however participants were less accurate in Go trials in complex blocks
than Go trials in Go blocks and in simple blocks (p = <.05). Within the
complex task, accuracy did not differ between Go and No-Go trials (p > 05),
while within the simple task, accuracy was lower in No-Go trials than Go trials
(p = <01) (Figure 10A).
24
Figure 10: Go/No-Go behavioural data. (A) Accuracy (M ± SE) as a function of trial and block types. (B) RT (M ± SE) as a function of trial and block types.
A one-way repeated measures ANOVA was carried out to determine the
effect of trial type (Go Only, simple Go and complex Go) on RT. There was a
significant effect of trial type (F(2, 38) = 74.545, p < .001, partial η2 = .80). RT
increased from Go Only trials (M = 423, SD = 35 ms) to simple Go trials (M =
470, SD = 39 ms) and complex Go trials (M = 508, SD = 44 ms), in that order
(Figure 10B). Post hoc analysis with a Bonferroni adjustment revealed that RT
statistically significantly differed between all three trial types (all p’s < .001)
(Figure 10B).
Overall, participants were faster on the Go trials compared with No-Go trials in
both the simple and complex versions of this task. Performance was higher on
the Go trials compared to the No-Go trials only in the simple version.
3.3Science TaskA paired samples t-test was carried out for accuracy on the science control
and misconception questions. Participants were less accurate on the
misconception questions (M = .86, SD = .15) than on the control questions (M
= .67, SD = .11) (t(19) = 6.287, p < .001, d = 1.41, Figure 11A).
25
Figure 11: Behavioural results of the Science task. . (A) Accuracy (M ± SE) as a function of trial and block types. (B) RT (M ± SE) as a function of trial and block types.
A paired samples t-test was also carried out for RT on the science control and
misconception questions. Participants took longer to answer the
misconception questions (M = 4946, SD = 735 ms) than the control questions
(M = 6081, SD = 913 ms) (t(19) = 9.24, p < .001, d = 2.07, Figure 11B).
Thus, participants were faster and more accurate when answering control
science questions than the misconception questions.
3.4Brain activations: pre-SMA and DLPFCActivation of the pre-SMA was lowest for misconception incorrect (M = 1.10,
SD = .69) but increased for control correct (M = 1.12, SD = .41) with the
highest activation for misconception correct (M = 1.20, SD = .56) (Figure
12A). However, the question type did not elicit statistically significant changes
in activation of the pre-SMA (F(2, 38) = .424, p = .658, partial η2 = .022).
26
Figure 12: Estimated changes in BOLD as a function of question type in the (A) pre-SMA and (B) DLPFC (M ± SE).
Activation of the left DLPFC was lowest for control correct (M = 1.31, SD
= .41) but increased for misconception incorrect (M = 1.32, SD = .54) with the
highest activation for misconception correct (M = 1.39, SD = .50) (Figure
12B). However, the question type did not elicit statistically significant changes
in activation of the DLPFC (F(2, 38) = .416, p = .663, partial η2 = .021). Thus
neither ROI showed differential activation as a function of trial type.
3.5 Regressions: accuracy on science misconception tasksi) Behavioural tasks
A multiple regression was run to determine whether performance on
behavioural tasks explained variance in accuracy on the science
misconception tasks. In addition to the control variables (Model 1), only WASI
verbal IQ and the two RT costs of the Go/No-Go task were selected in the
final model (Model 4). One score for visuospatial working memory (VSWM)
was missing from the data set and so this participant was not included in the
regression.
Visual examination of boxplots and analysis of Cook’s distance revealed an
outlier that was considerably further away from the other plots. This
participant (number 4) was excluded. As a result, the backward digit score
was found to be significant (Model 5), which improved the overall fit of the
model, increasing R2 by 8.3 %.
27
The final model (Model 5) contained age, gender, accuracy on control
questions, verbal IQ, Go/No-Go RT costs and backward digit score as
predictors of accuracy on misconception questions. This was statistically
significant, R2 = .90, F(7,10) = 12.370, p = .001; adjusted R2 = .82. Thus,
Model 5 accounted for 90% of the variance.
Unstandardised Coefficients
Standardised
Coefficients
t pBStd. Error Beta
Model 1R2 = .30,F(3, 14) = 1.993,
(Constant) .153 .305 .500 .625Age -.016 .025 -.176 -.634 .536Gender -.006 .049 -.028 -.125 .902Control Q. Acc. .840 .368 .631 2.281 .039
Model 2R2 = .59,R2 change = .29,F(1, 13) = 9.229,
(Constant) -.637 .355 -1.797 .096Age .004 .021 .040 .175 .864Gender -.002 .039 -.008 -.046 .964Control Q. Acc. .466 .317 .349 1.470 .165Verbal IQ .008 .002 .590 3.049 .009
Model 3R2 = .73,R2 change = .13,F(1, 12) = 5.801,
(Constant) -.561 .305 -1.843 .090Age .013 .018 .138 .683 .507Gender .007 .034 .031 .204 .842Control Q. Acc. .158 .299 .118 .526 .608Verbal IQ .009 .002 .683 4.026 .002RT Cost (comp. – sim.)
-.002 .001 -.408 -2.408 .033
Model 4R2 = .83,R2 change = .10,F(1, 11) = 6.379,
(Constant) -.232 .285 -.814 .433Age .004 .016 .046 .266 .795Gender .011 .028 .051 .396 .700Control Q. Acc. .045 .253 .034 .179 .861Verbal IQ .007 .002 .552 3.666 .004RT Cost (comp. – sim.)
-.002 .001 -.408 -2.897 .015
RT Cost (sim. – Go)
.002 .001 .383 2.526 .028
Model 5R2 = .90,R2 change = .10,F(7,10) = 12.370,Adjusted R2 = .82.
(Constant) -.047 .241 -.194 .850Age -.002 .013 -.022 -.156 .879Gender .013 .023 .059 .572 .580Control Q. Acc. -.117 .213 -.088 -.548 .596Verbal IQ .006 .002 .499 4.048 .002RT Cost (comp. – sim.)
-.002 .000 -.407 -3.573 .005
RT Cost (sim. – Go)
.002 .001 .456 3.622 .005
Back Digit Score .011 .004 .311 2.615 .026Table 1: Regression Model Coefficients – Behavioural Tests as Predictors of Accuracy on Misconceptions (significant predictors are highlighted in bold; RT is
28
reaction time).Higher accuracy on the misconception trials were related with higher scores
on the Vocabulary subtest of the WASI, a smaller cost of the 1-back load on
RT in the Go/No-Go tasks, a larger cost (i.e. slowing down) in mixed simple
Go/No-Go blocks compared to Go only blocks, and greater verbal working
memory scores. The standardised betas were similar across these four
variables, suggesting they accounted for a similar proportion of variance in
Science misconception accuracy.
ii) Activation in pre-SMA and DLPFC
A multiple regression was run to determine whether activation of the pre-SMA
brain region (when answering control or misconception questions correctly or
misconceptions incorrectly) had a relationship with accuracy on the science
misconception tasks. In addition to the control variables (Model 1), none of the
pre-SMA measures were selected into the model.
Visual examination of boxplots and analysis of Cook’s distance revealed an
outlier that was considerably further away from the other plots. This
participant (number 4 again) was excluded from the regression, which was
repeated. This revealed a new outlier based on Cook’s distance (number 19),
which was removed and the regression repeated. This did not improve the
model, which was not significant, R2 = .18, F(3,14) = 1.028, p = .410; adjusted
R2 = .01. Similar results were observed in the DLPFC, whereby brain
activations in neither trial type significantly accounted for variance in science
misconception performance, even when possible outliers were excluded.
3.6 Regressions: RT on science misconception tasksi) Behavioural tasks
Following a multiple regression, only reaction time on control questions was
had a significant relationship with reaction time on the misconception tasks.
One outlier was identified from visual examination of boxplots and analysis of
Cook’s distance. This participant (number 10) was excluded from the
regression, which was repeated. This did not find any additional significant
predictors but did improve the overall fit of the model, increasing R2 by 3.4 %.
29
See Table 2 for full details on the regression model.
The final model contained age, gender, and reaction time on control questions
as predictors for reaction time on misconception questions (Model 1) and was
statistically significant, R2 = .77, F(3,13) = 14.563, p =<.001; adjusted R2
= .72. Thus, the model accounted for 77% of the variance.
Unstandardised Coefficients
Standardised
Coefficientst pB Std. Error Beta
Model 1R2 = .77,F(3,13) = 14.563,Adj. R2 = .72.
(Constant) 2068.303 2253.591 .918 .375Age (years) -116.277 119.648 -.144 -.972 .349Gender code -51.391 257.543 -.027 -.200 .845Science RT (control Qs)
1.142 .212 .812 5.380 .000
Table 2: Regression Model Coefficients – Predictors of Reaction Time on Misconceptions (significant predictors are highlighted in bold; RT is reaction time).
ii) The relationship between activation in pre-SMA and DLPFC and RT
A multiple regression was run to determine if activation of the pre-SMA brain
region (when answering control or misconception questions correctly or
misconceptions incorrectly), were associated with reaction time on the
science misconception tasks. In addition to the control variables (Model 1),
none of the variables were entered into the model. Thus DLPFC activation
was not associated with RT in science misconception trials either.
In summary, accuracy on the science misconception tasks only had a
relationship with the reaction time cost on the Go/No-Go tasks, verbal IQ and
the backward digit score; whereby a higher cost between the simple Go/No-
Go and Go Only task, a higher verbal IQ score and a higher score on the
backward digit task was associated with better performance on the
misconception tasks. Interestingly, the higher the cost between the complex
and simple Go/No-Go task had a negative relationship with performance on
the misconception tasks. Activation of the regions of interest was not
associated with better performance. There was no relationship between the
behavioural tasks or brain activations and faster reaction times on the
misconception questions. The only variable that was associated with faster
reaction times was reaction times on the control questions.
30
4. DISCUSSION4.1 ResultsThe focus of this investigation was the relationship between cognitive skills
(e.g. inhibitory control), brain activation in the DLPFC and ACC and
performance when answering science misconception questions. Few studies
have investigated the neural correlates of inhibitory control in adolescents;
this study being the first one of its kind to investigate this in the context of
conceptual change in science education. The results show that only verbal IQ,
working memory and reaction time on the Go/No-Go task were significantly
associated with performance and only reaction time on control questions was
associated with reaction time on the misconception questions. Therefore, the
first hypothesis was partially accepted but the null hypothesis was accepted
for the second and third hypotheses.
The first hypothesis was that better performance on behavioural tasks
(inhibitory control, working memory and analogical reasoning) would be
associated with better accuracy and reaction time on science misconception
questions. A better verbal IQ, better working memory and longer reaction
times on the inhibitory control component of the Go/No-Go tasks associated
with better performance on the science misconception tasks. The role of
verbal IQ may be in allowing access to the language used in the questions.
Biology, chemistry and physics are dominated by a large number of technical
words that must be known and understood in order to recognise which
concept is being discussed. Science concepts are complex, drawing on laws,
characteristics, behaviours and relationships, which helps to explain why
verbal working memory was also found to be associated with better accuracy.
Solaz-Portolés & Sanjosé-López (2009) suggested that working memory may
be a moderator for inhibitory control, whereby it is responsible for not only
maintaining information but also in selecting what is relevant or not.
Interestingly, visuospatial working memory bore no relationship to improved
accuracy. It is possible that the science tasks did not access that component.
The diagrams that accompanied the trials may have mitigated visuospatial
31
working memory demands, possibly allowing participants with poorer VSWM
to not be disadvantaged.
The association of additional processing time with the Go/No-Go task could
reflect extra processing involved in inhibitory control, or that participants
slowed down to improve accuracy. Slowing down could be a distinct cognitive
process in itself, whereby a participant recognised that a task was more
challenging and so, by slowing down, allowed activation of the inhibitory
control mechanism to choose the correct response. Training students to
improve accuracy may involve encouraging them to slow down. This may give
them more time to activate the inhibitory control network. Performance on the
Stroop task, however, was not found to be significantly associated with
accuracy on science misconception problems. As discussed, Morooka et al.
(2014) found differences in performance on the Go/No-Go and Stroop task,
which suggests inhibitory control has distinct components. It is possible that
the phrasing of the misconception questions did not introduce sufficient
interference to discriminate between participants. It would be interesting if
misconception questions were modified to introduce additional interference,
possibly through the use of multiple choice questions. This may be an
interesting avenue to explore since multiple choice questions are often
undervalued by teachers and students alike, who tend to underestimate their
difficulty (Harrington, 2014; Meadows, 2016).
The second and third hypotheses were that the ACC and DLPFC would be
activated more when answering misconception questions than control
questions and that greater activation would be associated with longer reaction
times and improved accuracy. Whilst the left DLPFC was activated more
during the science tasks, the ACC was not. The pre-SMA, whilst not a
planned region of interest, was also found to be significantly activated relative
to baseline. These are interesting findings, since the ACC is associated with
error recognition in children and adults. This suggests that adolescents may
use the ACC differently to children and adults, instead using a different brain
structure for error detection. As described above, the DLPFC is associated
with inhibitory control, specifically in interference or conflict resolution. It may
32
be the case that in adolescents the ACC does not pick up the possible conflict
of misconception problems (considering control and misconception trials in
the same way) and so does not show increased activation in misconception
than control trials. In turn, the ACC does not call for greater activation in the
DLPFC to resolve the conflict, and therefore neither activation in the ACC nor
in the DLPFC correlates with misconception performance.
Brault Foisy, Potvin, et al. (2015) found that novices showed greater activation
of the pre-SMA when looking at scientific stimuli compared to experts.
Activation in this brain region has been associated with organising and
preparing voluntary movement. It is possible that greater effort was needed by
the participants due to unfamiliarity with the task and the additional challenge
of the more complex questions. Criaud & Boulinguez (2013) carried out a
meta-analysis of fMRI studies into the role of the pre-SMA and concluded that
the pre-SMA was mostly involved with working memory and engagement of
attentional control. It could be that the stimuli resulted in a high demand for
resources, particularly from working memory.
In summary, the results of this study suggest that domain general skills
relating to verbal IQ and working memory, as well as possibly slower and
more careful responses, are associated with better accuracy in misconception
science problems. In the brain there is little difference between control and
misconception trials, with general recruitment of DLPFC and pre-SMA,
extending into the ACC, which were not specifically associated with better
performance on misconception trials.
4.2 LimitationsThe sample size in this study was just 20 adolescents. It is common for fMRI
studies to involve such small sizes but it does present difficulty when
interpreting and generalising results, especially when running correlational
analyses investigating individual differences. Whilst outliers were identified
and removed, in such small sample sizes the influence of each individual
result on the mean is greater than when larger samples are used.
Associations between inhibitory control performance and neural activation and
33
science misconception may still be observed in larger samples. As discussed,
a growing body of research is discovering that in adolescents, there are
significant structural and functional changes throughout this stage of
development. Whilst we considered the sample to represent adolescence in
general, the range of ages covered 11 years to 15 years, which could be
associated with considerable changes in the brain. Ideally future studies
would recruit more participants of each year group to identify age-related
overlaps between the neural networks underlying science reasoning and
inhibitory control.
The previous study by Brookman (2015) echoed the findings of the present
study whereby students’ performance was worse on the misconception
questions than the control questions. This would suggest that these questions
were more difficult because they challenged the participants’ misconceptions
and demanding inhibition of these ideas to allow the scientific ones to be
expressed. The limitation of this is that students were not pre-tested to
confirm which misconceptions they held. Indeed much research in conceptual
change presumes that poor performance reflects a student’s misconception,
rather than simply not knowing the answer. diSessa (2006) argued that even
where no prior misconception exists, learning new science concepts is
challenging and takes time. Pre-testing participants to establish which
misconceptions they held prior to completing the science tasks would give
greater insight into the reasons for poor performance and in which situations
inhibitory control could arguably be unnecessary where no prior
misconception existed.
4.3 Implications and Future ResearchMasson et al. (2014) argued that students need to develop skills necessary to
identify when inhibitory control is an appropriate response. Luna & Sweeney
(2004) argued that inhibitory control is not consistently applied until the brain
has fully matured. This presents a challenge for adolescents. They are
entering a period of intense educational demand in preparation for life-
changing examinations. To be successful, they need to develop a range of
cognitive skills that demand the use of different brain regions. Whilst various
34
factors can influence the trajectories of brain and cognitive development,
adolescents are somewhat constrained by the physical structures and neural
pathways of their brain. This presents an incongruity between the expectation
of conceptual change across science subjects and the limitations of
adolescent developmental trajectories. It is possible that too much is expected
at too young an age. Dawson (2014) found that conceptual change is not an
all-or-nothing event and is not always permanent. Conceptual change occurs
faster and with less training for some topics than others. Houdé (2000) argued
that conceptual change was not simply the acquisition of new knowledge but
was also developing metacognitive skills: the awareness of when it was
appropriate to draw on a particular concept over another in a particular
situation. It is possible that some misconceptions are harder to inhibit than
others for certain students at certain ages. Improving our understanding of the
development of conceptual change may allow the development of more
appropriate curricula across the Key Stages that match the cognitive abilities
of adolescents.
A growing body of research has investigated the potential for inhibitory control
training. As is the common theme in conceptual change research, the results
have been inconsistent. Overall, the greatest effect of training has been for
lower ability students, those with learning difficulties or those from a lower
socioeconomic status (Hackman, Gallop, Evans, & Farah, 2015; Neville et al.,
2013). The participants in the current study, and in many others cited in this
dissertation, demonstrated that inhibitory control of misconceptions is not an
all or nothing event. Importantly, other cognitive abilities not assessed in the
current study may play a role, beyond the suggested role of inhibitory control.
If a student does not recognise conflict to begin with then they may not initiate
appropriate inhibitory control. Successful training programmes may well focus
on techniques for coding the correct scientific concept and the associated
metacognitive skills. Once this has been achieved, the task of inhibiting the
misconception is arguably the same for each concept; it is having the
awareness of which is correct and which is the misconception that may be key
to successful conceptual change across all domains (Houdé, 2000).
35
A longitudinal fMRI study with children as they enter adolescence and
adulthood would potentially reveal the minute changes that take place in
structure and function of the brain. Scanning before, during and after a topic is
taught (both for topics that relate to prior misconceptions as well as brand new
concepts) would help reveal how adolescents at different ages achieve
conceptual change and how the neural strategies correlate with the
development of other cognitive processes. Inhibitory control strategies may be
domain-specific and domain-general.
Whilst some work has been carried out to identify when certain
misconceptions are created, very little is still known about what conditions are
needed for a child to develop these and in what conditions they disappear.
What we still do not know is whether neural architecture differs for different
misconceptions; whether there is any qualitative or quantitative difference in
neural activation for successful conceptual change across different concepts
within and between subjects. Rowell and Dawson (1985) were unable to
predict which students would respond to different types of instruction or how
long the effects of any observed conceptual change would last. Combining
such information with neural correlates would shed greater light on the
cognitive abilities of children at different ages and how it relates to specific
contexts of the level of misconception they have. Future studies could
incorporate this by presenting more material that was constrained to a specific
subject. A study such as this could investigate different strategies and/or
activation levels that are required for older misconceptions (e.g. heavier
objects falling faster) than newer ones and for misconceptions from different
domains (e.g. a movement misconception about gravity compared with a
relational misconception about evolution). This could potentially lead to more
specific teaching methods that are more appropriate for each of the three
sciences typically taught during adolescence.
Finally, this study recruited participants from a selective public secondary
school and represented a cohort with a high socioeconomic status (SES).
Though this was not directly measured, SES is a significant factor on
cognitive skills, academic performance and neural development. The
36
consequences of which could mean significantly different approaches to
teaching and training for conceptual change. A second group of lower SES
adolescents is currently being recruited for this study,
4.4 ConclusionBruer (1997) argued that neuroscience is a bridge too far when using
neuroimaging data to inform teaching. As a teacher of science, it is difficult not
to see the immediate benefits of using neuroimaging to help identify which
misconceptions are prevalent at different ages and how they are overcome. A
great deal of intervention in teaching is through trial and error with some
techniques working for some teachers with specific classes but not with
others. Identifying a neural basis for techniques that activate metacognitive
and inhibitory control skills would be invaluable when confronted with a class
of 20 or more individual students, each at a different stage of conceptual
change. Being able to tailor teaching using robust theory-driven practices is
arguably the way forward (Tommerdahl, 2010).
Understanding how students develop, both in terms of their misconceptions
and cognitive skills, will shed greater light on appropriately matched curricula
(Devonshire & Dommett, 2010). The developmental trajectories of
metacognition and inhibitory control skills may in part determine when such
conceptual change is possible and for which scientific concepts. It may be the
case that conceptual change is expected for certain topics that are beyond the
stage of neural development of some students. Not only will greater
understanding of the neural correlates help develop age-appropriate curricula
but it also opens the possibilities of training methods to help students
accelerate such development. This may be particularly beneficial to students
with learning difficulties and those from lower socioeconomic status.
Socioeconomic status remains one of the most significant factors affecting
cognitive development and academic success. Children living in poverty, in
particular, are at greater risk of failing public examinations, leaving school
early and obtaining lower level jobs than those from higher SES backgrounds.
Arguably these students could benefit the most from the fruits of conceptual
37
change research. Improving skills in inhibitory control and metacognition may
be a way of leveling the playing field. This is particularly important at
adolescence, which marks the start of intense public examinations; the results
of which significantly affect the opportunities available to adolescents and
their subsequent experience in adulthood.
38
5. REFERENCESAdleman, N. E., Menon, V., Blasey, C.M., White, C.D., Warsofsky, I.S.,
Glover, G.H., & Reiss, A.L. (2002). A developmental fMRI study of the
Stroop color-word task. Neuroimage 16, 61–75.
Babai, R., & Amsterdamer, A. (2008). The persistence of solid and liquid
naive conceptions: a reaction time study. Journal of Science
Education & Technology, 17(6), 553–9.
Borst, G., Simon, G., Vidal, J., & Houdé, O. (2013). Inhibitory control and
visuospatial reversibility in Piaget's seminal number conservation
task: a high-density ERP study. Frontiers of Human Neuroscience, 7,
920.
Botvinick, M. (2007). Conflict monitoring and decision making: Reconciling
two perspectives on anterior cingulate function. Cognitive, Affective &
Behavioural Neuroscience, 7(4), 356–66.
Brault Foisy, L.-M., Ahr, E., Masson, S., Borst, G., & Houdé, O. (2015).
Blocking Our Brain: How We Can Avoid Repetitive Mistakes! Frontiers
for Young Minds, 3(1), 616–9.
Brault Foisy, L.-M., Potvin, P., Riopel, M., & Masson, S. (2015). Is inhibition
involved in overcoming a common physics misconception in
mechanics? Trends in Neuroscience and Education, 4(1-2), 26–36.
Brookman, A. (2015). The role of inhibitory control in adolescents’ science
and maths reasoning. London, UK: Birkbeck, University of London.
Bruer, J. T. (1997). Education and the brain: a bridge too far. Educational
Researcher, 26(8), 1–13.
Cacioppo, J. T., Berntson, G. G., & Nusbaum, H. C. (2008). Neuroimaging as
a New Tool in the Toolbox of Psychological Science. Current
Directions in Psychological Science, 17(2), 62-67.
Chi, M. T. H. (2005). Commonsense Conceptions of Emergent Processes:
Why Some Misconceptions Are Robust. Journal of the Learning
Sciences, 14(2), 161-199.
Chi, M. T. H. (2008). Three types of conceptual change: belief revision,
mental model transformation, and categorical shift. In S. Vosniadou
39
(Ed.) International handbook of research on conceptual change (pp.
61–82). New York: Routledge.
Cragg, L., & Nation, K. (2008). Go or no-go? Developmental improvements in
the efficiency of response inhibition in mid-childhood. Developmental
Science, 11(6), 819-827.
Criaud, M., & Boulinguez, P. (2013). Have we been asking the right questions
when assessing response inhibition in go/no-go tasks with fMRI? A
meta-analysis and critical review. Neuroscience and Biobehavioural
Review, 37, 11-23.
Crittenden, B. M., & Duncan, J. (2014). Task Difficulty Manipulation Reveals
Multiple Demand Activity but no Frontal Lobe Hierarchy. Cerebral
Cortex, 24(2), 532-40.
Crone E. A., & Dahl R. E. (2012). Understanding adolescence as a period of
social-affective engagement of goal flexibility. National Review of
Neuroscience, 13, 636–650.
Dawson, C. (2014). Towards a Conceptual Profile: Rethinking Conceptual
Mediation in the Light of Recent Cognitive and Neuroscientific
Findings. Research in Science Education, 44(3), 389–414.
De Neys, W., & Vartanian O, Goel V. (2008). Smarter than we think: when our
brains detect that we are biased. Psychological Sciences, 19(5), 483–
9.
Dempster, F. N., & Brainerd, C. J. (Eds.). (1995). Interference and Inhibition in
Cognition. New York: Academic Press.
Devonshire, I. M., & Dommett, E. J. (2010). Neuroscience: viable applications
in education? The Neuroscientist, 16(4), 349–356.
Diamond, A. (1998). Understanding the A-not-B error: working memory vs.
reinforced response, or active trace vs. latent trace. Developmental
Science, 1, 185-189.
diSessa, A. A. (2006). A history of conceptual change research: threads and
fault lines. In R. K. Sawyer (Ed.), Cambridge Handbook of the
Learning Sciences (pp. 265-281). Cambridge, UK: Cambridge
University Press.
Driver, R., Guesne, E., & Tiberghien, A. (1985). Children's ideas in science.
Milton Keynes: Open University Press.
40
Driver, R., Squires, A., Rushworth, P., & Wood-Robinson, V. (2015). Making
sense of secondary science: Research into children’s ideas (Classic
edition). New York: Routledge.
Duit, R., & Treagust, D. F. (2003). Conceptual change: A powerful framework
for improving science teaching and learning. International Journal of
Science Education, 25(6), 671-688.
Dumontheil, I., Houlton, R., Christoff, K., & Blakemore, S. J. (2010).
Development of relational reasoning during adolescence.
Developmental Science, 13, F15–F24.
Dunbar, K., Fugelsang, J., & Stein, C. (2007). Do naive theories ever go
away? Using brain and behavior to understand changes in concepts.
In M. Lovett, & P. Shah (Eds.), Thinking with data (pp. 193-206). New
York: Lawrence Erlbaum Associates.
Duncan, J. (2010). The multiple-demand (MD) system of the primate brain:
mental programs for intelligent behaviour. Trends in Cognitive
Science, 14(4), 172-9.
Durston, S., Thomas, K. M., Yang, Y., Uluğ, A. M., Zimmerman, R. D., &
Casey, B. J. (2002), A neural basis for the development of inhibitory
control. Developmental Science, 5, F9–F16.
Engle, R. W. (2005). Working memory capacity and inhibition. Paper
presented at the 2005 meeting for the Place of Inhibitory Progress in
Cognition, Arlington, TX.
Flavell, J. (1985). Cognitive development (2nd edition). Englewood Cliffs:
Prentice-Hall.
Friston, K., Ashburner, J., Kiebel, S., Nichols, T., & Penny, W. (2007).
Statistical Parametric Mapping: The Analysis of Functional Brain
Images. London: Academic Press.
Fugelsang, J. A., & Dunbar, K. N. (2005). Brain-based mechanisms
underlying complex causal thinking. Neuropsychologia, 43(8), 1204–
1213.
Gazzaniga, M. S., Ivry, R. B., & Mangun, G. R. (2009). Cognitive
neuroscience. The biology of mind (3rd edition). New York: W.W.
Norton & Company.
41
Goswami, U. (2008). Byron review on the impact of new technologies on
children: a research literature review: child development. Prepared for
the Byron Review, Safer children in a digital world. Annex H.
Annesley: DCSF Publications.
Gropen, J., Clark-Chiarelli, N., Hoisington, C., & Ehrlich, S. B. (2011). The
importance of executive function in early science education. Child
Development Perspectives, 5(4), 298–304.
Hackman, D. A., Gallop, R., Evans, G. W., & Farah, M. J. (2015).
Socioeconomic status and executive function: developmental
trajectories and mediation. Developmental Science, 18(5), 686-702
Harrington, C. (2014). Student success in college. Boston: Wadsworth.
Houdé, O. (2000). Inhibition and cognitive development: object, number,
categorization, and reasoning. Cognitive Development, 15, 63-73.
Huttenlocher, P. R. (2002). Neural plasticity. Cambridge, MA: Harvard
University Press.
Inagaki, K., & Hatano, G. (2002). Young children’s naïve thinking about the
biological world. New York: Psychology Press.
Karmiloff-Smith, A. (1988). The child as a theoretician, not an inductivist. Mind
and Language, 3(3), 183-195.
Koolschijn, P. C., Schel, M. A., de Rooij, M., Rombouts, S. A., & Crone, E. A.
(2011). A three-year longitudinal functional magnetic resonance
imaging study of performance monitoring and test-retest reliability
from childhood to early adulthood. Journal of Neuroscience, 31,
4204–4212.
Leech, R., Mareschal, D., & Cooper, R. P. (2007). Relations as
transformations: Implications for analogical reasoning. The Quarterly
Journal of Experimental Psychology: Human Experimental
Psychology, 60, 897– 908.
Lewandowsky, S., & Li, S. C. (1995). Catastrophic interference in neural
networks. Causes, solutions, and data. In F. N. Dempster & C. J.
Brainerd (Eds.), Interference and Inhibition in Cognition (pp. 329-361).
New York: Academic Press.
Lombrozo T., Kelemen D., & Zaitchik D. (2007). Inferring design.
Psychological Sciences, 18(11), 999–1006.
42
Luna, B., & Sweeney, J. A. (2004). The emergence of collaborative brain
function. fMRI studies of the development of response inhibition.
Annals of the New York Academy of Science, 1021, 296–309.
Marcovitch, S., & Zelazo, P. D. (1999). The A-not-B error: Results from a
logistic meta-analysis. Child Development, 70, 1297-1313.
Mason, R. A., & Just, M. A. (2016). Neural Representations of Physics
Concepts. Psychological Science, 27(6), 904–913.
Masson, S., Potvin, P., Riopel, M., & Brault Foisy, L.-M. (2014), Differences in
Brain Activation Between Novices and Experts in Science During a
Task Involving a Common Misconception in Electricity. Mind, Brain,
and Education, 8, 44–55.
Masson, S., Potvin, P., Riopel, M., Brault Foisy, L.-M., & Lafortune, S. (2012).
Using fMRI to study conceptual change: why and how? International
Journal of Environmental & Science Education, 7(1), 19–35.
Meadows, M. (2016). How to choose your exam board. London: The Office for
Qualifications and Examinations Regulation.
Mefoh, P. C. (2010). Gender differences in proactive, retroactive and no
interference conditions. Gender & Behaviour, 8(2), 3036–3047.
Morooka, T., Ogino, T., Takeuchi, A., Hanafusa, K., Oka, M., & Ohtsuka, Y.
(2014). Relationships between the color-word matching Stroop task
and the Go/No-Go task: Toward a multifaceted assessment of
attention and inhibition abilities of children. Acta Medica Okayama,
66(5), 377-386.
Nelson, J. K., Lizcano, R. A., Atkins, L., & Dunbar, K. (2007). Conceptual
judgments of expert vs. novice chemistry students: an fMRI study. In:
Proceedings of the 48th annual meeting of the Psychonomic Society.
Long Beach: California.
Neville, H. J., Stevens, C., Pakulak, E., Bell, T. A., Fanning, J., Klein, S. &
Isbell, E. (2013). Family-based training program improves brain
function, cognition and behaviour in lower socioeconomic status pre-
schoolers. Proceedings of the National Academy of Sciences,
110(29), 12138-12143.
O’Hare, E. D. & Sowell, E. R. (2008). Imagine human developmental changes
in the grey and white mater of the human brain. In C. A. Nelson & M.
43
Luciana (Eds.), Handbook of developmental cognitive neuroscience
(2nd edition), pp. 23-38). Cambridge, MA: MIT Press.
Ohlsson, S. (2009). Resubsumption: a possible mechanism for conceptual
change and belief revision. Educational Psychologist, 44(1), 20–40.
Piaget, J. (1997). The Child's Conception of Number. London: Routledge.
Posner, G. J., Strike, K. A., Hewson, P. W., & Gertzog, W. A. (1982).
Accommodation of a scientific conception: Toward a theory of
conceptual change. Science Education, 66(2), 211-227.
Rowell, J. A., & Dawson, C. J. (1977). Teaching about floating and sinking: an
attempt to link cognitive psychology with classroom practice. Science
Education, 61(2), 245–253.
Rowell, J. A., & Dawson, C. J. (1985). Equilibration, conflict and instruction: a
new classroom oriented perspective. European Journal of Science
Education, 7, 331–334.
Rubia, K., Smith, A.B., Woolley, J., Nosarti, C., Heyman, I., Taylor, E., &
Brammer, M. (2006). Progressive increase of frontostriatal brain
activation from childhood to adulthood during event-related tasks of
cognitive control. Human Brain Mapping, 27(12), 973-93.
Smith, A. B., Halari, R., Giampetro, V., Brammer, M., & Rubia, K. (2011).
Developmental effects of reward on sustained attention networks.
Neuroimage, 56, 1693–1704.
Solaz-Portolés, J. J., & Sanjosé-López, V. (2009). Working memory in science
problem solving: a review of research. Revista Mexicana de
Psicologia, 26, 79–90.
Tamm, L., Menon, V., & Reiss, A. L. (2002). Maturation of brain function
associated with response inhibition. Journal of the American Academy
of Child and Adolescent Psychiatry, 41, 1231–1238.
Tommerdahl, J. (2010). A model for bridging the gap between neuroscience
and education. Oxford Review of Education, 36(1), 97–109.
Velanova, K., Wheeler, M. E. & Luna, B. (2008). Maturational changes in
anterior cingulate and frontoparietal recruitment support the
development of error processing and inhibitory control. Cerebral
Cortex, 18, 2505–2522.
44
Vosniadou, S. (2002). On the nature of naïve physics. In M. Limon & L.
Mason (Eds.), Reconsidering conceptual change: Issues in theory
and practice (pp. 61-76). Dordrecht: Kluwer Academic Publishers.
Watanabe, J., Sugiura, M., Sato, K., Sato, Y., Maeda, Y., Matsue, Y., … &
Kawashima, R. (2002). The human prefrontal and parietal association
cortices are involved in NO-GO performances: An event-related fMRI
study. NeuroImage, 17(3), 1207–1216.
Wechsler, D. (2011). Wechsler abbreviated scale of intelligence (WASI-II)
(2nd edition). New York, USA: Pearson.
Wiser, M., & Carey, S. (1983). When heat and temperature were one. In D.
Gentner & A. Stevens (Eds.), Mental models (pp. 267-298). Hillsdale,
NJ: Lawrence Erlbaum Associates.
45
6. APPENDIX6.1 MRI data acquisition and analysis (prepared by lead investigator)A 1.5 Tesla Siemens Avanto MRI scanner was used to acquire T1-weighted
structural images and T2-weighted echo-planar volumes with blood-oxygen
level dependent (BOLD) contrast (TR = 1 s, TE = 45 ms) comprising 44 slices,
with a resolution of 3 x 3 x 3 mm. There were six functional runs lasting
between 4.5 and 6.5 min, and one structural scan that lasted 5.5 min, which
typically took place after the first 4 functional runs. The first 4 volumes of each
functional run were discarded to allow for T1 equilibrium effects.
MRI data were preprocessed and analysed using SPM12 (Statistical
Parametric Mapping, Wellcome Trust Centre for Neuroimaging,
www.fil.ion.ucl.ac.uk/spm/ software/spm12/). Functional images were
realigned, spatially normalised and smoothed. Images were realigned using a
second-degree B-spline interpolation for estimation and resliced using a
fourth-degree B-spline interpolation (SMP12 defaults). Realignment estimates
were used to calculate framewise displacement (FD) for each volume which is
a scalar measure of head motion across the six realignment estimates (Siegel
et al., 2014). Volumes with an FD greater than 0.9 mm were censored and
excluded from the general linear model (GLM) estimation, through inclusion of
a regressor of no interest for each censored volume. Scanning runs with more
than 15% of volumes censored were excluded from the analysis (one science
run for one participant). Structural images were coregistered to the mean
realigned functional image, and segmented on the basis of Montreal
Neurological Institute (MNI) registered International Consortium for Brain
Mapping tissue probability maps. Resulting spatial normalisation parameters
were applied to the realigned images to obtain normalised functional images
with a voxel size of 3 x 3 x 3 mm, which were smoothed with an 8-mm full-
width at half maximum Gaussian kernel.
Scanning runs were treated as separate time series and each series was
modelled by a set of regressors in the GLM. Runs of the science and maths
task were each modelled by box-car regressors separately modelling Control
46
Correct, Control Incorrect (if any incorrect responses were given),
Misconception Correct and Misconception Incorrect trials. Durations varied
and corresponded to the RT on each trial. Fixation and the arrows task were
modelled implicitly. All regressors were convolved with a canonical
haemodynamic response function and, together with the separate regressors
representing each censored volume and the mean over scans, comprised the
full model for each session. For the purpose of this research dissertation, two
regions of interest (ROIs) relevant to the science runs of the science and
maths task were identified. A contrast combining the parameter estimates of
all Science trials was run at the first level and entered into a one sample t-test
analysis at the second-level. The resulting SPM map, showing regions of
increased BOLD signal during the correct resolution of science problems, was
family-wise error corrected at p < .05 at the cluster level, with an uncorrected
threshold of p < .001 at the voxel level. The pre-supplementary motor area
(pre-SMA, peak co-ordinates -2, 19, 50, cluster size 212) and the left
dorsolateral prefrontal cortex (DLPFC, peak co-ordinates -45, 13, 32, cluster
size 568) (see Figure 13) were selected as cluster ROIs for further analyses.
Marsbar (http://marsbar.sourceforge.net/) in SPM12 was used to calculate the
mean parameter estimates for the conditions Science Misconception
Incorrect, Science Misconception Correct, and Science Control Correct, in
each ROI, for each participant. These values were copied into SPSS and
analysed as described in the main text.
Figure 13. Clusters identified as ROIs in the science runs of the science and maths task: the left DLPFC (red) and the pre-SMA (blue). ROIs are shown on the average of the 20 participants’ structural scans. Left: x = 30; middle: y = 126; right: z = 119.
47