Eliminating unnecessary workload

The ‘Workload Challenge’ consultation ran between 22 October and 21 November 2014. In February 2015 the analysis of this survey was published. The survey asked three main questions about workload:

  1. Tell us about the unnecessary and unproductive tasks which take up too much of your time. Where do these come from?
  2. Send us your solutions and strategies for tackling workload – what works well in your school?
  3. What do you think should be done to tackle unnecessary workload – by government, by schools or by others?

The consultation received 43,832 responses in total, but only 16,820 respondents answered all three open-ended questions about workload. A systematically selected sample of 10% of the full responses was selected for detailed analysis, equating to 1,685 survey respondents.

Of course, many of the things teachers do outside of teaching classes aren’t entirely ‘unnecessary’ and ‘unproductive’ – the analysis noted that many tasks were identified by teachers as related to essential parts of working within a school, but that the time and volume of the tasks were so great that they were unable to complete them even when working much longer than their contracted hours.

The report identified unwarranted ‘detail’, ‘duplication’ and ‘bureaucracy’ as key elements of excessive workload. These related most often to lesson planning, assessment (including marking) and reporting administration (82%).

Teachers reported that the key drivers of workload were perceived requirements of Ofsted / accountability (53%) and tasks set by school leaders (51%).

In response the DFE set up an Independent Teacher Workload Review Group. This group has recently released three reports looking at Data Management, Planning and Resources, and Marking. There are some interesting discussions about the causes of excessive workload in each of these areas – but I’ll simply list some of the key recommendations first:

Eliminating unnecessary workload associated with data management

The report recommends that only data which is ‘purposeful, valid and reliable’ should be collected. One of the issues it identifies is what they call ‘gold-plating’ – collecting everything ‘just in case’ it is needed for accountability purposes. Where the collection of data becomes an end in itself the report suggests it is not simply unnecessary, it is damaging. It also recommends that summative data should not normally be collected more than three times a year per pupil.

Poor data, for example tracking based on formative assessment, can provide a ‘false comfort’. It gives the false impression that a numerical measure of pupil progress can be tracked and used to draw a wide range of conclusions about pupil and teacher performance when the data are flawed (i.e. neither reliable or valid).

Schools should use data in the format available – Ofsted, for example, does not require data in any particular format – and the report suggests that school leaders actively avoid asking for the duplicate collection of data. In short ‘collect once, use many times.’

The report suggests that schools should analyse the cumulative impact on workload of new initiatives before implementing new data collection systems.  It also suggests that schools should be prepared to stop an activity which is time-consuming and has limited value: i.e. not assume that collection or analysis must continue just because it always has

Eliminating unnecessary workload around planning and teaching resources

The report argues that teachers spend too long planning and resourcing lessons. It makes a key distinction between ‘the daily lesson plan’ and ‘lesson planning’. It suggests that school leaders requiring the production of ‘daily written lesson plans’ are using them as proxy evidence for an accountability ‘paper trail’ rather than an effective process of planning for pupil progress and attainment. The report identifies the reaction of school leaders to the real and perceived demands made by Government and Ofsted as a principal cause of this. This unnecessary accountability paperwork often becomes a fairly pointless ‘box-ticking’ exercise and creates a ‘false comfort’ of purpose (the mere appearance of ‘doing something’ to raise school standards).

Perhaps the main issue with resourcing is teachers having to ‘reinvent the wheel’ in the absence of good quality textbooks and fully-resourced schemes of work. Once collaboratively developed schemes of work are in place, the report suggests that individual teachers should be free to teach in a way informed by their professional judgement and experience.

The report discusses some of the cultural mistrust of textbooks in many English schools. This cultural bias is one factor driving increased workload and the report argues that a cultural shift is required – one where high-quality textbooks are seen as part of a recipe – a useful base but still requiring the flair of the chef.

A great deal the recent escalation in workload here is probably due to the rapid changes in curriculum and specifications over the past few years. The report suggests that the DFE should commit to providing sufficient lead-in times for changes for which the sector will have to undertake significant planning to implement.

Eliminating unnecessary workload around marking

The report argues that providing written feedback on pupils’ work has become disproportionately valued by schools and has become unnecessarily burdensome for teachers. It sums up as ‘deep marking’ some of this unnecessary practice – which covers lots of varieties of this practice like dialogic marking, triple marking and quality marking. It suggests decisions by school leaders regarding marking have been in response to distorted ideas about Assessment for Learning and the presence of Ofsted reports which praised particular methods of marking.

The report suggests that ineffective marking looks like this:

  • It usually involves an excessive reliance on the labour intensive practices under our definition of deep marking, such as extensive written comments in different colour pens, or the indication of when verbal feedback has been given by adding ‘VF’ on a pupil’s work.
  • It can be disjointed from the learning process, failing to help pupils improve their understanding. This can be because work is set and marked to a false timetable, and based on a policy of following a mechanistic timetable, rather than responding to pupils’ needs.
  • It can be dispiriting, for both teacher and pupil, by failing to encourage and engender motivation and resilience.
  • It can be unmanageable for teachers, and teachers forced to mark work late at night and at weekends are unlikely to operate effectively in the classroom.

It also makes the point that there is little robust evidence to support the current widespread practice of extensive written comments (an EEF review is looking at existing evidence on marking and identifying gaps in research – and should be published fairly soon) . They recommend that school leaders should also challenge emerging ‘fads’ that indirectly impose excessive marking practices on schools.

The report suggests that effective marking is ‘meaningful’, ‘manageable’ and ‘motivating’:

  • Meaningful: marking varies by age group, subject, and what works best for the pupil and teacher in relation to any particular piece of work. Teachers are encouraged to adjust their approach as necessary and trusted to incorporate the outcomes into subsequent planning and teaching.
  • Manageable: marking practice is proportionate and considers the frequency and complexity of written feedback, as well as the cost and time-effectiveness of marking in relation to the overall workload of teachers. This is written into any assessment policy.
  • Motivating: Marking should help to motivate pupils to progress. This does not mean always writing in-depth comments or being universally positive: sometimes short, challenging comments or oral feedback are more effective. If the teacher is doing more work than their pupils, this can become a disincentive for pupils to accept challenges and take responsibility for improving their work.

What causes excessive workload for teachers?

The report identifies many plausible reasons why workload has become so unmanageable for school teachers over recent years, for example:

  • Rapid changes to curriculum, exam specification and school structures arising from the DFE
  • Historical demands from Ofsted and the perception of ‘what Ofsted wants’
  • School leaders ‘gold-plating’ the evidence they think they ‘might need’ to justify their decisions

However, in my opinion it perhaps misses a key driver for this workload. In ‘What’s driving workload in schools?’, I suggest that excessive demands made upon teachers arise from the inherent uncertainty for school leaders and teachers created by high-stakes judgements made about their effectiveness or capability arising from measures of school or teacher performance which lack validity and reliability. The education white paper (Educational Excellence Everywhere) makes a proposal which might help: to remove the separate Ofsted judgements for Teaching and Learning from future inspections. This seems a pragmatic move – given T&L grades tend to correlate with student achievement anyway, I’m not sure the separate grade often tells school leaders very much – and this might further undermine the distortion to planning, marking and assessment which are so very often given spurious justification by ‘what Ofsted wants’.

Posted in Education policy | Tagged , , | 3 Comments

Lesson observations: Would picking a top set get you a better grading?

Lesson observations: Approach with caution!

For any measure of teaching effectiveness to be useful, it needs to be valid. To be valid, a measure also needs to be reliable. Reliability represents the consistency of a measure. A measure is said to have a high reliability if it produces similar results each time – for example if two observers independently rate the same lesson, those ratings should agree with one another. Validity represents the extent to which a measurement corresponds to what it aimed to measure. So a valid observation would measure genuine learning gains, rather than be subject to bias.

We’ve known for some time that classroom observations lack the reliability for high-stakes judgements of teacher effectiveness. For example, the MET project – which spent millions of dollars to produce robust observation protocols – found that even for these carefully constructed observation measures, the reported reliabilities of observation instruments used in the MET study range from 0.24 to 0.68. Rob Coe gives a great example of why such low reliability represents a problem for teacher appraisal.

“One way to understand these values is to estimate the percentage of judgements that would agree if two raters watch the same lesson. Using Ofsted’s categories, if a lesson is judged ‘Outstanding’ by one observer, the probability that a second observer would give a different judgement is between 51% and 78%.”

Classroom observation: it’s harder than you think

Evidence of poor validity has also been around for a while. For example, Strong et al (2011) asked participants to watch videos of teaching and asked them to rate whether the teacher was ‘effective’ or ‘ineffective’. These ratings were compared to the value-added scores for the students of those teachers. Where the observers had not received specific training on observations, even experienced teachers and head teachers matched ‘effective’ teaching to high value-added less than 50% of the time. Again, Rob Coe gives an example of what this would mean for classroom teachers being graded in the UK.

“Fewer than 1% of lessons judged inadequate and only 4% of lessons judged outstanding produce matching learning gains. Overall, 63% of judgements will not correspond with value-added.”

Classroom observation: it’s harder than you think

Why is observation such a problematic measure of effective teaching?

There are lots of possible reasons why observations may not be a valid or reliable measure of teaching: Learning is invisible – it takes place in a student’s head and anything that we can see in the classroom is merely a proxy for that learning, is one problem. Another problem is the fact that observers likely have strong ideas about what ‘good practice’ looks like – whether those practices lead to learning gains is another matter. Teaching is also based on a natural ability – something humans have evolved to do – therefore even experienced teachers will find it really difficult to explicitly describe what it is they do.

However, there also appears to be another, quite simple, reason why using observations to make high-stakes judgements about teaching tend to lack validity: It seems the prior attainment of students in a class biases the ratings of an observer.

Steinberg, M. P., & Garrett, R. (2016). Classroom Composition and Measured Teacher Performance What Do Teacher Observation Scores Really Measure?. Educational Evaluation and Policy Analysis, 0162373715616249.

Steinberg and Garrett used data from the MET study to explore the extent to which the class a teacher is timetabled to teach might influence observation measures of that teacher’s performance. They review a number of previous studies in this area, relating other factors which appear to influence the outcome of observation ratings. For example, observation scores tend to be lower for teachers whose students come from more disadvantaged backgrounds. They also note the problem that teachers are not randomly assigned to teaching groups in schools – and that often inexperienced teachers are allocated to more disadvantaged students, while more experienced teachers tend to work with higher achieving students.

To examine whether the prior attainment of students influenced observation ratings, they used the data from the MET study. The MET study was carried out over 2 years and across six districts in the US. One of the advantages of the MET data was the fact that the project randomised the allocation of teachers to classes prior to the second year of the study. They used this random allocation of 834 teachers to classes (Grades 4-9) to generate estimates of the effect of prior achievement on measured teacher performance for that second year. Their conclusion suggests that teachers of lower-ability groups may be unfairly rated as relatively ineffective, even when very the strict observation protocols involving considerable training are used:

“In this article, we find that the incoming achievement of a teacher’s students significantly and substantively influences observation-based measures of teacher performance. Indeed, teachers working with higher achieving students tend to receive higher performance ratings, above and beyond that which might be attributable to aspects of teacher quality that are fixed over time.” Page 20

Interestingly, the study found that the influence of prior attainment was greater for teachers of ELA (English Language Arts) than for maths teachers, and for subject-specialist teachers (common in secondary in the UK) compared to generalist teachers (more like primary). Another interesting finding was that prior attainment appeared to particularly influence measures of teaching related to ‘classroom climate’ – suggesting that observers of teachers of higher-performing students may be judged better at behaviour management than they actually are.

This study has significant implications for schools which use high-stakes (let alone graded) observations as the basis for appraisal. If a teacher’s effectiveness is, in part, determined by which groups they are allocated to teach then withholding a pay rise or placing a teacher on capability based on observations of teaching becomes potentially unmerited and inequitable.

How can teachers know their impact?

Observations of teaching can (and I’d say should) provide teachers with useful feedback they can use to develop their professional practice – but if observations lack validity, then they won’t help provide useful formative feedback (let alone summative judgement). Once again, Rob Coe has some suggestions about how schools might approach observations:

There’s a great video of Rob Coe presenting some of the problems and possible ways forward at Teach First: What is the future of lesson observation in our schools? (Part 1) (Jan 2014)

  • Stop assuming that untrained observers can either make valid judgements or provide feedback that improves anything
  • Apply a critical research standard and the best existing knowledge to the process of developing, implementing and validating observation protocols
  • Ensure that good evidence supports any uses or interpretations we make for observations. It follows that appropriate caveats around the limits of such uses should be clearly stated and the use should not go beyond what is justified
  • Undertake robustly evaluated research to investigate how feedback from lesson observation might be used to improve teaching quality (EEF already has one such study underway).

Other than observations, value-added data and student survey feedback might be used to help provide teachers with more valid feedback on their teaching.

The MET study, for example, found that VA data reasonably correlates with a teacher’s long-term success. However, VA data tends to come too infrequently (and too late) in a school cycle to help identify where things might be going well or need to improve. It also doesn’t provide ‘fine-grain’ detail – i.e. it can tell you that students did well, but can’t really tell you what it was the teacher was doing well, or what they should be doing to improve. There are also some other issues with VA scores – for example, one study tested VA modelling techniques by see what effect teachers had on their students’ heights. In their analysis, they found that teachers appeared to influence the height of their students almost as much as English and maths scores.

MET predictors of success

Testing Teachers: What works best for teacher evaluation and appraisal

Student surveys are another method used by the MET. I’ve used these within a coaching context to help teachers identify areas they might work on – and used follow-up surveys to see whether students felt the changes had any impact. You can read a bit about this here: Investigating teaching using a student survey

At the last though, the problem for teachers is that high-stakes judgements use any form of measure which lacks the reliability and validity to form a reasonable basis for such judgement. I suspect part of the issue has been the impression that school leaders needed to have such observation data to support their judgements of the quality of teaching in their schools to Ofsted. One of the proposals in the recent white paper (Educational Excellence Everywhere was to remove the separate Ofsted judgements for Teaching and Learning from future inspections. On the basis of the evidence this seems like a very good idea indeed!

Perhaps once out of Ofsted’s shadow, schools will be able to think about how to use observations much more constructively – perhaps as a coaching tool to help teachers improve their impact rather than a sword of Damocles to hang over their heads three times a year.

Posted in Education policy | Tagged , , , , | 7 Comments

Attachment Theory: Why teachers shouldn’t get too excited about it.

John Bowlby: Attachment theory

The British psychologist John Bowlby is fairly synonymous with attachment theory. From his clinical work with ‘juvenile delinquents’ over the course of World War II be began formulating ideas about the role of early and prolonged separation from parents and caregivers in the development of problems in those children’s social and emotional development.

The core of his theory is that attachment is an evolutionary adaptation which is characterised by a child seeking proximity to caregiver when that child perceives a threat or suffers discomfort. Given the intense needs of human infants, it is perhaps unsurprising that the formation of a “deep and enduring emotional bond that connects one person to another across time and space” evolved to improve the chances of an infant’s survival.

Over the first year of life, an infant begins to develop attachments with parents or carers. As these attachments form we tend to see characteristic behaviour in infant interactions with their attachment figure:

  • Stranger Anxiety – the infant responds with fear or distress to arrival of a stranger.
  • Separation Anxiety – when separated from parent or carer the infant shows distress and upon that attachment figure’s return a degree of proximity seeking for comfort.
  • Social Referencing – the infant looks at the parent or carer to see how they respond to something novel in the environment. The infant looks at the facial expressions of the parent or carer (e.g. smiling or fearful) which influences how they behave in an uncertain situation.

Attachment figures aren’t simply individuals who spend a lot of time with the infant, or the one who feeds them, but typically the individuals who responds the most sensitively, for example often playing and communicating with the infant. For many infants the principal attachment figure is their mother, but fathers, grandparents or siblings may also fulfil this role. By about 18 months, most infants enjoy multiple attachments though these may be somewhat hierarchical with a primary attachment figure of particular importance. The behaviour relating to attachment develops over early childhood, for example babies tend to cry because of fear or pain, whereas by about two-years-old they may cry to beckon their caregiver (and cry louder or shout if that doesn’t work!).

Bowlby believed these early experiences of attachment formed an internal ‘working model’ which the child used to form relationships with secondary attachment figures, later friendships with peers and eventually romantic and parenting relationships in adult life.

Mary Ainsworth: Types of attachment

There are individual differences in the behaviour related to attachment. Famous observation studies by Mary Ainsworth (who worked with John Bowlby during the 1950s) identified that in normal children there were a range of attachment types:

Secure attachment: The majority of infants, across different cultures, tend to have an attachment style typified by strong stranger and separation anxiety along with enthusiastic proximity seeking with the parent upon reunion.

Insecure –avoidant: Slightly more common in western cultures, an insecure-avoidant attachment tends to be characterised by avoiding or ignoring the caregiver and showing little emotion (whilst experiencing inward anxiety) when the caregiver leaves the room, and displaying little enthusiasm when the caregiver returns.

Insecure-resistant: Perhaps more common in ‘collectivist cultures’, an insecure-resistant (sometimes also called insecure-ambivalent) attachment tends to be characterised as showing intense distress during separation, and being difficult to comfort when the caregiver returns. Infants with this attachment type may also show some rejection or resentment towards the caregiver after a separation.

Disorganised attachment: Added in the 1990s, infants with a disorganised attachment tend to show no consistent pattern in behaviour towards their caregiver. For example, they may show intense proximity seeking behaviour one moment, then avoid or ignore the caregiver the next.

If you you’re interested in some of the history and the origins of attachment theory, the work of John Bowlby and Mary Ainsworth are good places to start. There’s a nice summary here – Bretherton, I. (1992). The origins of attachment theory: John Bowlby and Mary Ainsworth. Developmental psychology, 28(5), 759.

Many children may display behaviour suggesting an ‘insecure’ attachment type which may make it a harder to form peer friendships, and this likely underlies an association between insecure and disorganised attachment and higher levels of behaviour problems. However, it’s not certain that differences in attachment are specifically the cause of behaviour problems.  For example, a meta-analysis by Fearnon, et al (2010) found that socio-economic status accounted for a considerable portion of the variance in behaviour problems in childhood.

Fearon, R. P., Bakermans‐Kranenburg, M. J., Van IJzendoorn, M. H., Lapsley, A. M., & Roisman, G. I. (2010). The significance of insecure attachment and disorganization in the development of children’s externalizing behavior: a meta‐analytic study. Child development, 81(2), 435-456.

So, whilst there’s reasonable evidence to suggest that these individual differences in attachment correlate to differences in behaviour within school, it is very important to note that these differences are not ‘pathological’ in a clinical sense. Given that about 30-35% of representative populations have an ‘insecure’ attachment, NICE suggests that it is unhelpful to view insecure attachment as an ‘attachment problem’.

Reactive Attachment Disorder

A popular misconception about attachment is a conflation between the ‘types of attachment’ that children possess and an ‘attachment disorder’. CoramBAAF, a leading charity working within adoption and fostering, suggests that even when used by those trained to do so, attachment classifications cannot be equated with a clinical diagnosis of disorder. While the insecure patterns may indicate a risk factor in a child’s development, they do not by themselves identify disorders. The term ‘attachment disorder’ refers to a highly atypical set of behaviours indicative of children who experience extreme difficulty in forming close attachments. NICE suggests that the prevalence of attachment disorders in the general population is not well established, but is likely to be low. However there are substantially higher rates among young children raised in institutional care or who have been exposed to abuse or neglect. The Office for National Statistics (2002) report for the Department of Health estimated that somewhere between 2.5% to 20% of looked after children had an attachment disorder (depending on whether a broad or narrow definition was used).

There is a broad distinction between two classifications of RAD:

Inhibited attachment disorders: Characterised by significant difficulties with social interactions such as extremely detached or withdrawn – usually attributed to early and severe abuse from ‘attachment figures’ such as parents.

Disinhibited attachment disorders: Characterised by diffuse attachments, as shown by indiscriminate familiarity and affection without the usual selectivity in choice of attachment figures – often attributed to frequent changes of caregiver in the early years.

Reactive Attachment Disorder is a psychiatric condition and often accompanied by other psychiatric disorders. CoramBAAF argues that the lack of clarity about the use of attachment concepts in describing children’s relationship difficulties can create confusion and advises extreme caution. A diagnosis of an attachment disorder can only be undertaken by a psychiatrist.

Unfortunately, there are also no widely applicable, evidence-based set of therapies for RAD. However, there has been significant concern expressed about some therapies. One example is “Holding therapy” involving holding a child in a position which prevents escape whilst engaging in an intense physical and emotional confrontation. CoramBAAF argues there is nothing in attachment theory to suggest that holding therapy is either justifiable or effective for the treatment of attachment disorders. Less controversial therapies involve counselling to address the issues that are affecting the carer’s relationship with the child and teaching parenting skills to help develop attachment.

What should teachers be doing?

This is why I don’t really understand all the apparent excitement about attachment theory at the moment: there’s nothing a teacher should be doing that they shouldn’t already be doing.

Firstly, given the relationship between attachment disorders and abusive or neglectful relationships, perhaps some teachers are worried that they need to know about attachment disorder in order to fulfil their statutory safeguarding responsibilities. However, it’s important to note that whilst some children with RAD have suffered abuse or neglect, that doesn’t mean that problematic behaviour is evidence of such. The teacher isn’t in a position to make either the clinical judgement or investigate the cause of problematic behaviour they suspect may relate to a safeguarding concern. If a student is behaving in a way which concerns you, then report that concerns to your designated member of SLT (as you would any safeguarding concern). Whether or not you might think a child has an insecure attachment or a disordered attachment isn’t really your professional call.

Secondly, it may be that some teachers feel they need to know more about attachment in order to support students with behaviour problems in school. However, the advice for working with RAD students isn’t really any different to good behaviour management generally. Teachers should not confuse their role in loco parentis with being the primary caregiver for a child. For example, the Center for Family Development is an attachment centre based in New York specializing in the treatment of adopted and foster families with trauma and attachment disorder. In their ‘Overview of Reactive Attachment Disorder for Teachers’ they point out that, as a teacher, you are not the primary caregiver for a child you teach.

“You cannot parent this child. You are the child’s teacher, not therapist, nor parent. Teachers are left behind each year, its normal.  These children need to learn that lesson.”

They recommend approaching behaviour through explicit teaching of consequences: that there’s a consequence associated with good behaviour and there’s a consequence for poor behaviour.

Further suggestions include:

  • Creating a structured environment with extremely consistent rules
  • Being consistent and specific when giving praise or confronting poor behaviour
  • Providing the child with choices, but choices provided by you, the teacher.
  • Maintaining your professional boundaries (avoid attempting to create ‘friendship’ or ‘intimacy’ with the child).
  • Keep your calm and avoid losing your temper; communicate directly, positively, and firmly.
  • When implementing consequences, remain unemotional and assume a tone that says, effectively, “That’s just the way business is done – nothing personal.”

In short, there’s nothing that teachers shouldn’t do when working with any student with challenging behaviour. Whether the challenging behaviour is due to an issue with attachment isn’t really the issue.

In Summary

Whilst there’s a relationship between insecure attachment and behaviour problems in the classroom, teachers are not qualified to ‘diagnose’ a student’s attachment type nor engage in any kind of ‘therapy’ with that student. There is a condition called ‘Reactive Attachment Disorder’ which has a higher incidence within ‘looked after’ students. Again, teachers are not qualified to make this psychiatric diagnosis.

There is an important difference between the professional role of a teacher and the role of a primary caregiver, and it’s vital that recent interest in attachment theory within the profession doesn’t blur that line. Where teachers are concerned that behaviour presented in the classroom might indicate abuse or neglect, then they are already obliged by law to report these concerns (but not investigate them or try to involve themselves in resolving them).

In terms of managing the behaviour of students with attachment problems, so that they can overcome the difficulties of their family background and experience success within school, the guidance suggests things like a structured environment, consistent rules, professional distance and focusing feedback on behaviour not the child: Advice that forms the basis of good behaviour management regardless of the cause of problematic behaviour.

It may be the case that specific children with RAD will have different strategies which will help them achieve in school. However, that’s also the case for any student with SEND. Perhaps what is important for teachers is not specific ‘training’ in attachment theory to help them ‘diagnose’ attachments, but a clear understanding of their school’s SEND system and time to read, implement and work with SEND coordinators to ensure any specific strategies suggested by an educational psychologist or child psychiatrist are employed effectively.

Posted in Psychology for teachers | Tagged , | 21 Comments

Germane load: The right kind of mental effort?

Despite our vast capacity to hold information in long term memory; our working memory is extremely limited and becomes overloaded very easily. Greater insight into these problems and some practical ideas about what to do about them comes from the research of John Sweller. Sweller was interested in how teachers could structure their lessons in order to minimise this problem of overload. From the results of numerous experiments, he developed Cognitive Load Theory (CLT) which explains how teachers might manage the ‘load’ they place on working memory and help students learn more readily. The theory divides up the different kinds of loading on working memory:

Intrinsic load represents the inherent difficulty of the material and is related to their levels of element interactivity. This is limited to between 3-5 items. There’s not much we can do about this as teachers (multiplying 5×8 will always be easier than 5x8x3). However, for some materials, it may be possible break material up into simpler sub-components which can be tackled separately at first and recombined later.

Not all material is equally intrinsically difficult. Where materials are related to what David Geary calls ‘biologically primary knowledge’ the load on working memory appears to be greatly reduced. Our brains are adapted to solve complex problems related to survival and reproduction (e.g. reasoning tasks related to social cheating are much easier than formal syllogistic logic).

Another way we can ‘cheat’ working memory limitations is by exploiting the fact that visual and auditory information can be processed simultaneously without creating additional load. For example, Sweller, Van Merrienboer and Paas (1998) report that where material has high intrinsic load, using visual/audio presentations was far more effective than where text and explanation (which both require verbal processing) was used.

Intrinsic load is also reduced where individuals have a strong background of prior knowledge. Familiar information is said to be organised in our long-term memory as a schema – essentially allowing us to work with a sizeable ‘chunk’ of information as if it were one item. By having automatic access to these schemas, it allows us to overcome something of the limitations of our working memory. This is why, for instance, many people argue for the memorisation of multiplication tables. For example, if the student doesn’t have to mentally calculate 5×8 this will reduce the load on working memory and they will find 5x8x3 easier to ‘hold in mind’.

Extraneous load is the load generated by the way that material is presented to the learner. For example, Kirschner, Sweller and Clark (2006) suggest that where the intrinsic load of material is high, presenting new material through minimally-guided activities like problem-solving creates an additional, unhelpful load on working memory. One of the issues is that when faced with a novel problem, students tend to use a ‘processing intensive’ general strategy called means-end analysis in order to find a solution. Sweller, Van Merrienboer and Paas (1998) suggest that ‘goal-free’ problems can avoid this issue by forcing the student to rely upon strategies other than the load-intensive means-end approach. A second strategy to overcome means-end searches discussed in that paper is the use of worked examples as a substitute for solving problems.

Another further source of extraneous load is attention switching. For example, Mousavi, Low, and Sweller (1995) suggest that rather than having labels alongside a diagram – which requires the student to switch attention between the text and the visual image – placing the labels at appropriate locations on the diagram can dramatically facilitate learning. In essence, we should seek to minimise extraneous cognitive load in order to best facilitate learning.

Just taking these two types of cognitive load, the implication might appear to be that eliminating extraneous load and organising instruction so that sub-components of a complex task are automated would be sufficient for the learning of new material.

However, Sweller, Van Merrienboer and Paas (1998) reported that encouraging learners to engage in conscious cognitive processing that is directly relevant to the construction of schemas benefits learning. For example, varying the conditions of practice appears to have beneficial effects upon learning, despite the fact that the presence of that variety would raise the loading on working memory. They called this germane cognitive load.

Germane cognitive load

Van Merrienboer, Kester and Paas (2006) suggest that whilst load reducing methods, such as low variability and explicit guidance and feedback, are effective in producing high retention of the material – that these techniques hinder the transfer of learning. They argue that there is a need to vary the conditions of practice and only give limited guidance and feedback in order to induce germane cognitive load and improve transfer.

It’s tempting to connect this to Robert Bjork’s ideas about ‘desirable difficulties’. Bjork makes the argument that things that make learning ‘easy’ during instruction do not always lead to long-term learning. He argues that by creating conditions which are difficult and appear to impede immediate performance lead to greater long-term retention and better transfer. David Didau summarises the idea like this:

“I love Bjork’s coining, ‘desirable difficulties’ because it gets to the very heart of the counter intuitive nature of learning. It turns out that making it more difficult for students to learn means that they actually learn more!”

There are lots of examples across in psychology where introducing additional difficulty appears to facilitate learning. For example, it has been shown that making font more difficult for the learner to study improves memory performance. Solving anagrams involves more effort than simply copying words, but this additional effort appears to facilitate recall (for easy anagrams at least).

It may seem that there isn’t a problem. Perhaps, Bjork’s desirable difficulties are merely examples of germane cognitive load. However, it does create an issue for the theory – mainly because there’s no easy way to experimentally measure each type of load and this risks making the theory impossible to falsify. Debue and van de Leemput (2014) explain the problem:

“In the absence of reliable measurements for each load, the CLT cannot ever be refuted because it is always possible to attribute variation in the overall cognitive load to a source that corroborates the initial assumptions. For example, assuming that the overall load is kept constant, a decrease in performance will be attributed to a rise in extraneous load that impairs germane cognitive processes. Conversely, if the performance increases it will be attributed to a germane load enhancement made possible by a drop in extraneous load.”

Perhaps the solution is simply to get rid of the notion of germane load. However, it seems that cognitive load theory needs some sort of component which represents the fact that some kinds of mental effort lead to improved long term memory for material. However, unless there’s a way to measure this (and I suggest self-report measure are unlikely to convince critics of the theory), it risks making the theory effectively unfalsifiable.

What is the right sort of mental effort?

The relationship between some sort of mental effort and learning isn’t terribly controversial. For example, most readers of this blog will recognise the quotes below:

Memory is the residue of thought           Dan Willingham

Learning happens when people have to think hard         Rob Coe

But what does thinking or ‘thinking hard’ mean? Is it just the quantity of thinking or some aspect of the quality of thinking which leads to learning?

Well, one way of thinking about the ‘right kind of thinking’ might be to borrow the concept of ‘depth of processing’ first posited by Craik and Tulving (1972). I describe a bit about their ideas in more detail here. In brief, they suggested that mental effort might comprise of more shallow or deeper processing.

“For example, in shallow processing, the subject answered questions concerning the word’s typeface (for example, is the word “HOUSE” written in capital letters?); in intermediate processing, the subject answered questions about rhyme (for example, does the word “house” rhyme with “pencil”?); and in deep processing, the questions were directed toward the word’s semantic content (for example, does the word “house” fit into this sentence: “The ______ has a beautiful window”?).”

They suggested that retention in long-term memory depends on the depth to which new information is analysed. However, they argued that the system stops processing the information once the analysis relevant to the task has been carried out, so if a task merely requires shallow processing of the material then deeper processing will not occur.

A simple way to illustrate this is consider the difference between two fairly common classroom activities – the word search and the crossword – when familiarising students with new terminology. Word search puzzles are a great example of ‘structural processing’ – they can be completed with no understanding of the key words but simply pattern matching the first few letters. Although such an activity might require mental effort (e.g. some of the words are presented as anagrams or are arranged diagonally or backwards in the grid) it’s not the right sort of mental effort for effective learning. A better ‘quiz’ type activity might be to use a crossword – perhaps with the definitions of words as the clues – as at least any mental effort expended will lead students to attend to the deeper, semantic properties of the key terms.

This may help explain why the ‘testing effect’ is a more effective method of encouraging reliable recall than restudying. Testing encourages ‘semantic searches’ in order to retrieve information from long-term memory and that sort of mental effort facilitates future attempts. It’s interesting to note that the testing effect disappears where there is no mental effort involved in the retrieval. A recent study by Endres and Renkl (2015)examined the testing effect under a range of conditions and concluded:

“Overall, our findings on mental effort and non-tested items support the elaborative retrieval hypothesis, including the interpretation of mental effort as an indicator of semantic elaboration.” …

“Our results suggest that testing tasks should be used that require learners to invest substantial mental effort. A more difficult task leads to more elaboration as long as it can be solved (more or less) successfully.”

We can also relate this to the benefits of spacing – the ‘spacing effect’ – where practice is spread over time rather than condensed over a short period. We’ve known since Ebbinghaus that information is lost fairly rapidly from memory – but that reviewing the material periodically (e.g. through a quiz) leads to better recall over time.  It seems plausible that the period of delay increases the semantic focused mental effort required to retrieve the information, whereas immediate testing when the information is freshly retained is too effortless to promote much learning.

This might also help explain why varying the conditions of practice (arguably the key component of germane load), whilst more difficult in the short-term tends lead to better long-term recall. When talking about the problems associated with assessment rubrics, Greg Ashman makes the point that students focus exclusively on the elements required by the rubric and ignore the deeper structure related to the problem.

Greg Ashman Rubrics

Source of image

Again, might this be argued to be a problem related to shallow processing. By varying the conditions of practice, the student is encouraged to engage in deeper semantic processing rather than rely upon fairly superficial automatic recall.

However, does recasting germane load as mental effort related to semantic processing solve the problem of measurement? Well, not yet – but as brain imaging becomes cheaper and more available to psychologists, it’s a possibility. There are certainly studies looking for these neural correlates – for example, Otten, Henson and Rugg (2001) report the results of an fMRI study examining the neural correlates of memory encoding.

15 volunteers were presented with a series of 280 words and (depending on a pre-stimulus cue) had to make a decision based on either a semantic process (was it alive) or a non-semantic process (position in the alphabet). Afterwards they were presented with a recognition task, where they had to pick out the words they had seen (mixed in with 140 others they had not). They found there was anatomical overlap in the fMRI scans for semantically and non-semantically processed items, but the non-semantic items appeared to activate a sub-set of the semantically processed ones. They conclude:

“The overlap between regions activated by the depth of processing and deep subsequent memory effects implies the existence of cognitive operations that are engaged differentially both by semantic versus non-semantic processing and by effective versus less effective episodic encoding in a semantic task.”

It’s a small scale study – like many involving neuroimaging – but might it provide a possible way to eventually anchor a concept of germane load by relating it to semantic processing?

Posted in Psychology for teachers | Tagged , , , , | 13 Comments

Goodbye Mr Chips: can research tell teachers how to teach?

Back in October, I took part in a debate at the Battle of Ideas.

Hosted by Kevin Rooney and featuring Professor Frank Furedi, Jack Marwood and Munira Mirza, the discussion focused on the relevance of research to classroom practice.

The video of that session is available here: WORLDBytes

Details of the session are here: Battle for the Classroom


Posted in Philosophy of education | Tagged | Leave a comment

Psychology of behaviour management (part 3)

In the last posts, I briefly examined some of the key ideas and limitations of offering rewards and sanctions, and restorative approaches. Both of these tackle the issue of behaviour at an individual level; in this post I want to examine group-level strategies which utilise our propensity conform to social norms.

Social norms

Humans are social animals and benefit enormously from shared resources and protection, and the ability to engage in acts of reciprocal altruism with reduced risks of exploitation within social groups. Conversely, exclusion from a group tends to have a highly detrimental effect on an individual’s capacity to survive and reproduce. Therefore, humans have evolved a complex range of strategies for maintaining our membership and status within social groups.

One approach to behaviour in schools is encouraging adherence to social norms. Social norms are the (often unwritten) rules about how we behave in social context. One of the functions of social norms is to distinguish who is part of our group and who is an outsider. Behaving in accordance to the norms of our group, especially when there is a ‘cost’ attached, signals our membership of that group. Breaking social norms carries with it a risk of exclusion from the group.

It’s hard to see how society could function at all if we didn’t conform to some fairly predictable set of rules about how we behave. Some of these norms become enshrined as formal laws, like driving on the left in the UK. However, many involve unspoken arrangements, merely triggering disapproval from others if we break them, e.g. the rules of queuing, or saying ‘please’ and ‘thank you’. Like all cultural institutions, schools possess social norms regarding the behaviour of students. Some of these are explicitly communicated through ‘school rules’, but many are based on the unspoken expectations of the teachers and students who make up the school.

Normative influence

The ‘power’ of this desire to ‘fit in’ with a group was demonstrated by Solomon Asch in a famous series of experiments conducted in the 1950s. He asked groups of students to make a series of comparative judgement about the length of a line:

Asch lines

Source of image

However, only one member of the group was a genuine participant. What the participant didn’t know is that the other people in the group were actually ‘confederates’ of the experimenter, instructed to deliberately give wrong answers on certain critical trials. What Asch was investigating was the extent to which the participant would conform to the rest of the group by also giving the wrong answer. He found that 25% of participants would disregard the wrong answer given by the rest of the group and give the correct answer every time. However, 75% of the participants gave at least one wrong answer and 5% of the participants followed the group in giving the wrong answer on every occasion. For Asch, this demonstrated a strong human instinct to fit in, even with a group of strangers and when the task involved unambiguously wrong answers. There’s a short clip illustrating the procedure here.

Asch went on to use this experimental technique to examine the key variables which strengthen and weaken normative influence. He found that when participants could give their answers in private (by writing them down) they were less likely to conform to the group. He also found that the strength of normative influence was greatly diminished by a lack of unanimity; the presence of a ‘fellow dissenter’ making it much easier to act against the behaviour of the rest of the group.

Further insight into the factors which appear to underlie normative influence comes from the research of Robert Cialdini.  For example, Cialdini and Goldstein (2004) identify three major components to social influence; Accuracy, Affiliation and Maintaining a positive self-concept.

The Goal of Accuracy represents an individual’s motivation to be right thinking or possess the correct information when making a decision. They make the point that individuals often look to social norms to gain an accurate understanding of and effectively respond to social situations, especially during times of uncertainty. The example I use when teaching is when I went to a posh wedding in my youth and was confronted by more cutlery than I knew what to do with. I found myself immediately looking around at which knife, or fork, or spoon other people were using for each course.

The Goal of Affiliation represents an individual’s motivation to create and maintain good relationships with others. In essence we tend to adopt the behaviour of others so they will be more likely to like us. Quite superficial characteristics tend to trigger this kind of behaviour; for example physical attractiveness, perceived similarity (e.g. a shared birthday or the same name), ingratiation (e.g. remembering a person’s name or mild flattery – though it’s worth noting that whilst the target tends to develop more positive feelings towards the person, on-lookers tend not to), and reciprocation (the obligation to repay others for what we have received from them).

An interesting aside to this influence of bolstering affiliation through reciprocation is the ‘Franklin effect’. The Franklin effect exploits cognitive dissonance by getting someone who doesn’t like you to do a small favour for you. As a result, that person often develops more positive feelings towards us. A ‘top tip’ that exploits this might be to ask a challenging student to carry some books to another classroom for you, for instance.

The Goal of Maintaining a Positive Self-Concept represents our tendency to maintain our concept of self through behaving consistently with past “actions, statements, commitments, beliefs, and self-ascribed traits”. Where we have behaved in a particular way in the past, or expressed strong views about a situation, there is a motivation to behave in a way consistent to that in the future. Again, I suspect cognitive dissonance plays a strong role in what I sometimes describe when teaching as a ‘homeostasis of the self’. If we have done something a certain way for a long time, then we tend to believe that those behaviours were correct.

Applying normative influence

Psychologists have attempted to apply normative influence in order to promote pro-social behaviour. For example, Schultz et al (2008) used normative messages in order to encourage hotel guests to conserve energy. An example of one of these messages:

Schultz towels

Source of image

This study is interesting as it appears to show that merely trying to change attitudes (by providing information about the importance of energy conservation) appeared to have little effect on behaviour. The presence of a ‘normative message’ along with this information appeared to have a much stronger effect on the behaviour of guests.

Normative messages have also been used to try to reduce alcohol consumption amongst US students. For example, Borasi and Carey (2001) reviewed various social influence strategies used to encourage moderation in drinking and reported that in some cases normative messages about drinking led to reduced self-reported alcohol consumption. They suggest there are a range of cognitive factors related to perceived norms which can influence behaviour.

  • Descriptive and injunctive norms: “a student will match the drinking they perceive other students doing (descriptive norm) and approving of (injunctive norm)”
  • Pluralistic ignorance: ‘‘individuals assume that their own private attitudes are more conservative than are those of other students, even though their public behavior is identical’’
  • Attribution theory: “the student observes others drinking heavily, it is assumed that such excessive use is typical, resulting in elevated norms”

A combination of these processes leads to exaggerated norms for drinking, which then perpetuate themselves when new students observe others drinking heavily. This has led to researchers attempting to use messages based on descriptive and/or injunctive norms to try to correct this exaggerated view of acceptable drinking. In the review, Borasi and Carey point to a number of successful attempts to reduce self-reported alcohol consumption using descriptive and injunctive normative messages.

Applying normative influence in schools

To a great extent, schools have always tried to create social norms within their institutions to support a positive classroom climate. Either through explicit messages like ‘school rules’ or through implicit mechanisms like ‘ethos’ or ‘traditions’ – schools attempt to separate their institutions from the ‘mundane world’ outside their gates.

For example, Martin Robinson is one education writer who explicitly makes this observation. In Practise Teaching, Teaching Practice: Ritual for instance, Martin writes:

“Whether the atmosphere you create in your classroom is like that of a church where children worship at the altar of knowledge or nearer to that of a high powered office where children come to work efficiently on administrative tasks, the ritual of the classroom is something that is unique to your teaching and the children’s experience of studying with you.”

Long established schools, whether in the state or independent sector, are often remarkable for their extensive lineage of school traditions and small rites and practices which mark the ‘other worldliness’ of their institutions. Some private schools provide an almost ‘cloistered’ atmosphere (quite literally in some cases given their historical origins) which helps create the impression that you are entering a world that in some ways is very separate from everyday life. Schools use a wide variety of techniques to create a strong sense of social norms specific to their institution: School uniforms are perhaps the most common and most visible strategy.

In social learning theory, Albert Bandura  suggests that whilst we learn through vicarious reinforcement (e.g. observing others being rewarded and imitating that behaviour) we also form a set of ‘mental representations’ of acceptable behaviour specific to a social environment which regulates how we act. It seems likely that these traditions, small rituals and changes in dress all act as cues which facilitate behaving within a set of pro-social normative influences within the environment of the school.

One of the difficulties for many schools is how to create this strong sense of pro-social norms within the institution so that anti-social behaviours (e.g. bullying) are not imitated. Indeed, it’s possible that low-level disruption in lessons are similarly occasions where that set of desirable norms have failed to inhibit unhelpful behaviour. There’s not much empirical evidence looking specifically at this question, but there is some support from a recent study of a successful anti-bullying programme:

Paluck, Shepherd and Aronow (2015) relate a study which attempted to test the idea that children attend to the behaviour of their peers to build a sense of what is socially normative and modify their own behaviour in response. They randomly allocated an anti-conflict intervention across 56 schools with 24,191 students – but what’s really interesting is that they measured every school’s social network, before randomly selected ‘seed groups’ of students and assigning them to an intervention that encouraged a public stance against conflict at school. They found that treatment schools reported fewer disciplinary problems compared to the control group. Furthermore the effect was stronger where these ‘seed groups’ contained more socially connected students.

They concluded that students pay particular attention to the behaviour of certain individuals in their community, as they infer which behaviours are socially normative and adjust their own behaviours accordingly. This offers some interesting ways forward with research examining how behavioural climates are produced and changed.

Classroom routines

Another example, I propose, of where normative influence has been exploited to improve behaviour comes from Doug Lemov’s observations of effective teachers. In ‘Teach like a Champion’, Lemov identifies a set of classroom routines which, he suggests, work together to create a positive classroom culture.

To me, the genius of this is that rather than try to promote a positive culture through psychological or social manipulation of attitudes or beliefs (c.f. Growth Mindset), Lemov focuses on creating a strong set of social norms based on simple, visible behaviour routines. Schools often try to sell education through trying to change attitudes, for example inspirational talks or aspirational values, but whilst these messages may be effective for some students, many will merely ‘talk the talk’ rather than ‘walk the walk’. By encouraging a uniform set of simple behavioural rituals, I suspect cognitive dissonance does the rest – ‘If I SLANT in a lesson, it’s because learning is really important to me’.

The success of Lemov’s system probably stems from its simplicity and uniformity. However, therein also lays the controversy. For some teachers, it denies practitioners the chance to discover effective systems for themselves which reflect their unique personality and approach to practice. For others, the concern is that the uniformity of behaviour threatens to suppress behaviours vital to normal mental and physical development. For example, from Sue Cowley:

“But it is what I can’t see that really worries me, because these are children. Where is the choice, the fun, the flitting, the wriggling, the laughter, the joy, the sensitivity, the nuance, the playful interactions, the movement, the gradually developing self-regulation?”

Personally, I find it difficult to believe that even very uniform behavioural expectations would have a negative impact on children – after all, school forms only part of a child’s life and there are many opportunities in everyday life to wriggle and muck about like children. Proponents of these systems might also reasonably argue that the purpose is not to suppress creative or imaginative teaching – but to allow teachers to focus that creativity on their actual teaching rather than battling for control of the classroom.

It seems quite likely that using uniform behavioural routines will promote a strong normative influence to support a positive classroom culture. However, I do think there are interesting questions arising from this debate – is it using a sledgehammer to crack a nut? Some empirical questions for me are:

  • Are some of these routines doing more ‘work’ than others?
  • Are all of these routines strictly necessary?
  • Is the degree of uniformity, whilst clearly effective, necessary?
  • Are there effective (perhaps even more effective) alternative routines to the ones Lemov suggests?

Teasing out what it is about these routines which make them effective is an important research task, in my opinion.

Posted in Psychology for teachers | Tagged , , , , , , , | 12 Comments

The psychology of behaviour management (part 2)

A frequent observation in schools is that the same children tend to end up in detention over and over again. The belief that ‘punitive’ approaches to school discipline were proving ineffective or even counter-productive has led to an interest in ‘restorative’ practice approaches. These approaches appear strongly influenced by ‘positive psychology’ and frequently also import ideas from a variety of therapeutic disciplines like cognitive behavioural therapy (CBT).

Part 2: Restorative practice approaches

The roots of this behaviour management strategy are ‘restorative justice’ programmes arising from criminology. Difficult to define and frequently implemented under a variety of different names, restorative justice is sometimes typified as a compromise position in the ‘rehabilitation vs retribution’ debate. A meta-analysis by Latimer, Dowden and Muise (2005) offered the following definition:

“Restorative justice is a process whereby all the parties with a stake in a particular offence come together to resolve collectively how to deal with the aftermath of the offence and its implications for the future”

The focus of these approaches is to repair the harm caused by the criminal act, so that the victim and the offender have an opportunity to discuss the event and decide appropriate reparations for the offence. In the meta-analysis, the authors find that victim and offender satisfaction tends to be higher using this approach than when using the traditional justice system, and offenders more likely to complete restitution agreements and less likely to reoffend.

The reported success of these programmes led to similar systems, often influenced by therapeutic models, being imported into schools. Once again, the principles behind ‘restorative practice’ are difficult to define and operate under a wide variety of names, but are often typified as a compromise position between authoritarian and laissez-faire disciplinary systems.

Social discipline window

Source of image: based on McCold and Wachtel (2003)

The International Institute for Restorative Practices offers the following as a ‘unifying hypothesis’ of restorative practices:

“human beings are happier, more cooperative and productive, and more likely to make positive changes in their behavior when those in positions of authority do things with them, rather than to them or for them.”

Positive psychology

Positive psychology arose out of the ‘Humanistic approach’ developed by psychologists like Abraham Maslow and Carl Rogers who developed theories around human happiness and helping people to thrive or reach their potential. Positive psychology was a term probably coined by Maslow, but has become strongly associated with the work of Martin Seligman – its philosophy essentially the same, to understand the nature of human happiness and well-being.

Applied within education, this approach tends to focus upon how schools can promote positive emotions and relationships, engagement and a meaningful sense of purpose, and positive goals leading to accomplishment. Seligman suggests these form five distinct elements – summarised by the acronym PERMA:

  • P Positive Emotion
  • E Engagement
  • R Positive Relationships
  • M Meaning and Purpose
  • A Accomplishment

We see the influence of positive psychology in all sorts of areas of education: For example, the idea of ‘teaching for happiness’ or ‘teaching mindfulness’ and many of the ideas underpinning ‘character education’.  There appears to be a clear influence of positive psychology in opposition to more behaviourist ideas within restorative practices applied within schools. For example, Hendry, Hopkins and Steele summarise the differences in Restorative Approaches in Schools in the UK:

Restorative approach vs authoritarian approach

The goals are identified as developing positive relationships between the teacher and student; encouraging empathy and creating a sense of safety and trust where both parties can express their thoughts, feelings and needs; encouraging self-actualisation and optimistic beliefs about personal development; and supporting individual and shared responsibility. The main empirical claim appears to be:

“Schools that consciously focus the bulk of their effort on building and maintaining relationships will find that fewer things will go wrong and so there will be fewer occasions when relationships need to be repaired.”

However, within academic psychology the ‘positive psychology’ approach has faced significant criticism. For example, the abstract for Alistair Miller (2008) paper “A Critique of Positive Psychology— or ‘The New Science of Happiness’” summarises many of the problems:

“This paper argues that the new science of positive psychology is founded on a whole series of fallacious arguments; these involve circular reasoning, tautology, failure to clearly define or properly apply terms, the identification of causal relations where none exist, and unjustified generalisation. Instead of demonstrating that positive attitudes explain achievement, success, well-being and happiness, positive psychology merely associates mental health with a particular personality type: a cheerful, outgoing, goal-driven, status-seeking extravert.”

Cognitive-behavioural therapy

An alternative psychological foundation for restorative practice has been cognitive behavioural therapy (CBT) often combined with elements of other therapeutic programmes (e.g. Solution-Focused Therapy).

The focus in CBT is to identify and change patterns of thinking or beliefs which underlie behaviours which are unhelpful to the individual. It’s often typified as a problem-solving therapeutic approach – finding ways to better cope with ‘here-and-now’ practical problems (rather than say childhood experiences).

Albert Ellis developed some of the core principles involved in CBT back in the 1950’s and 60’s. Rational Emotive Behavioural Therapy (REBT) emphasises the role of ‘faulty thinking’ (an individual’s interpretation or view of an event or situation) which gives rise to emotional distress and subsequent unhelpful behaviours (e.g. avoidance coping). This makes some intuitive sense to many teachers. For example, a student faced with an impending exam may believe that they will fail regardless of what they do, so they find ways to distract themselves from this anxiety (e.g. procrastination) and fail to revise for the exam.

In restorative practice, these elements of CBT tend to involve encouraging the student to relate their offending behaviours to the thoughts and feelings which caused them. By exploring alternatives to the way the student interpreted an event and emotionally reacted to it, the idea is that the student finds better ways to respond to these events in future. For example, Writing Wrongs is a restorative approach for use in schools which explicitly draws upon ideas based on CBT to encourage students to reflect upon the causes and consequences of their behaviour.

Whilst optimistic claims were initially made for the efficacy of CBT as a treatment for mental illness, much of the empirical evidence supporting these has come into question – not least because, along with other forms of psychotherapy, there’s no easy way to create a ‘double-blind’ arrangement within randomly controlled trials and this means that results may be influenced by bias. A recent meta-analysis suggests that effect sizes for CBT outcomes has been steadily declining since the 1970s, implying that sources of bias may have given a distorted view of its efficacy.

Do restorative approaches work?

It is almost impossible to give an empirical answer to this question. Case studies appear to provide very positive evaluations for programmes. For example, Littlechild and Sender (2010) found evidence from interviews that students and staff at four residential homes for young people with developmental and physical disabilities gave very positive evaluations of restorative justice.  However, data from police call-outs was more mixed. They note that one unit had an increase in call-outs and caution that the decrease in call-outs at the three other units was not necessarily due to the introduction of restorative justice.

One area where more systematic evidence is available is the success of anti-bullying programmes, many of which use restorative justice principles. For example, restorative approaches are commonly used in conjunction with sanctions within secondary schools to tackle bullying. A report for the DFE “The Use and Effectiveness of Anti-Bullying Strategies in Schools” (Thompson and Smith, 2011) examined the range of practices used in schools and attempted some evaluation of their effectiveness. They broadly defined restorative approaches as:

“Restorative approaches work to resolve conflict and repair harm. They encourage those who have caused harm to acknowledge the impact of what they have done and give them an opportunity to make reparation. They offer those who have suffered harm the opportunity to have their harm or loss acknowledged and amends made.”

They found that over two-thirds of schools used some form of restorative practice in tackling bullying and that these approaches were recommended by the majority of local authorities above the use of sanctions. The survey reported that 97% of both primary and secondary schools rated restorative approaches as effective in reducing bullying, with high proportions of both school types rating them as cost effective and easy to implement. Small group discussions (circles) were the most common approach in primary schools (96%) whereas some form of restorative discussion was the most common in secondary schools (90%).

So, these kinds of anti-bullying programmes are popular and perceived to be effective. Beyond case studies, however, is there much evidence to support their adoption in schools? The fact that restorative programmes tend to be mixed in with sanctions makes it difficult to pick apart whether these programmes are effective as practised in schools. Historically, the evidence supporting the general effectiveness of anti-bullying programmes is mixed. For example, a meta-analysis by Ferguson et al (2007) examined the effectiveness of school-based anti-bullying programmes. One issue they report with the available research was publication bias (sometimes called the ‘file draw effect’) where studies which obtain some statistical significance are more likely to be published than studies which are non-significant.  Thus, while the meta-analysis yielded an overall ‘significant effect’, the very small overall effect sizes led them to conclude that “school-based anti-bullying programs are not practically effective in reducing bullying or violent behaviors in the schools”.

More positive outcomes were reported in a meta-analysis by Ttofi and Farrington (2011). They suggested that significant reductions in bullying tended to be associated with more intensive programs, programs including parent meetings, firm disciplinary methods, and improved playground supervision. However, work with peers (including things like peer mediation, peer mentoring, and encouraging bystander intervention) was associated with an increase in victimization. They recommend that work with peers (arguably a central feature of restorative practice models) should not be used.

Despite mixed and sometimes disappointing evidence of effectiveness with regard to bullying, the popular perception of restorative practice has led some schools to implement these sorts of programmes as whole-school behaviour management systems. It’s hard to define this approach, but typically they involve facilitated discussion between the teacher and student about low-level disruption in lessons in place of – though sometimes in addition to – a direct sanction. Again, there are many case studies reporting positive effects for these programmes, but systematic quantitative evidence is thin on the ground. For example:

“”We’ve shown in case study after case study that schools that adopt this approach report significant changes in their cultures,” said Dr. Paul McCold, researcher and founding faculty member of the International Institute for Restorative Practices (IIRP) graduate school. “What’s needed now is solid quantitative research.””

There are evidently many problems when trying to implement restorative practice programmes in schools. David Didau identifies this problem in his list of psychological principles for teachers:

The biggest problem with restorative justice is that it often becomes a blunt and clumsy stick. The culprit’s needs are often placed over those of the victim. A victim may not want a relationship to be restored and this should never be imposed.”

This becomes even more of an issue when such programmes are used for issues of low-level disruption. In my experience, it can sometimes be successful (e.g. where the student genuinely accepts they were in the wrong and is keen to make amends). However, I suspect the same students who ended up in detention all the time simply end up in endless ‘conflict resolution discussions’ instead. I’ve experienced many occasions where the student isn’t prepared to accept any responsibility or – more difficult still – tries to manipulate the discussion to appear the ‘victim’. If a student merely goes through the motions and isn’t really interested in taking responsibility for their actions, there’s the risk that such systems may inadvertently undermine good behaviour.

Lastly, there are some psychologists who are deeply concerned about the therapeutic frameworks being imported from positive psychology and CBT into schools. In ‘The Dangerous Rise of Therapeutic Education’, Katherine Eccelstone warns that these approaches risk developing students into anxious and self-preoccupied individuals, undermine parental and teacher authority, and represent a diminished view of human potential.

Posted in Psychology for teachers | Tagged , , , , | 16 Comments

The psychology of behaviour management (part 1)

The topic of behaviour management and the problems teachers face in dealing with disruption to lessons continues to evoke strong argument within the profession. The extent of the problem was explored in a 2014 paper by Terry Haydn which argued that whilst ‘official’ reports like Ofsted inspections appeared to rate behaviour as at least ‘satisfactory’  the majority of schools, there was evidence that deficits in classroom climate continue to be a serious and widespread problem. Examples of blogs detailing the sorts of issues in school approaches to behaviour are plentiful (an excellent example from Andrew Old can be found here).

Systems of rewards and punishments have long been the norm in schools but perhaps because of a growing feeling that behaviour has become increasingly difficult to manage, behaviour management has become the focus of experimentation. Some schools have started looking for novel solutions to the problem of disruption in lessons (e.g. Kilgarth school in Birkenhead was recently reported to have ‘banned’ punishment altogether). Whereas, others believe that proportionate sanctions need to be available to teachers as a deterrent (e.g. Tom Bennett urging “schools to bring back detention”). In June last year, the government set up a working party, led by Tom Bennett to develop better training for new teachers and showcase effective practices in schools. For an example of Tom’s approach, there’s a nice practical guide to managing difficult behaviour recently published by Unison.

One controversial approach has been to move schools away from systems of reward and punishment towards a ‘Restorative Justice’ approach. Originally developed within the context of police work, the idea of restorative practice involves conversations between ‘offender’ and ‘victim’ or the teacher and student to give an opportunity to discuss how they have been affected by events and to decide what should be done to move forward. There are claims that this approach can improve behaviour and results, but critics argue that such policies are making schools less safe. Whilst not always explicitly linked, many of the processes appear to draw upon techniques used in cognitive behavioural therapy (CBT). For example, ‘Restorative Thinking’ is a team that work with schools to implement school restorative practices that make the link to CBT and other forms of therapy explicit.

Another controversial approach has come from Doug Lemov’s ‘Teach Like a Champion’. Lemov’s approach involves using standardised routines to create a positive classroom climate.  The system has sparked considerable interest in the UK, but also many critics. Perhaps most notable amongst these critics is Sue Cowley, author of ‘Getting the buggers to behave’ who recently condemned* an example of this approach as “a kind of ‘Pavlov’s Dogs’ approach to education”.

(*Edit – However see Sue’s comment below)

Most teachers likely already use some combination of these various approaches, but teachers may not be aware of the psychological theories and practices which they are (implicitly or explicitly) based upon. Over a short series of blogs, I want to briefly explore these psychological underpinnings in the hope they help explain some of the advantages and limitations of each system.

Part 1: Behaviourism

“Behaviourist” is sometimes used in a pejorative way when describing behaviour management systems, but schools using some sort of system for rewarding or sanctioning behaviour are implicitly using a behaviourist approach.

Behaviourism was a term coined by John Watson in an article published in 1913, but its roots go back to the famous studies by Ivan Pavlov (who discovered Classical conditioning as an accidental side-line to his Nobel Prize winning research on digestion). However, the behaviourist most associated with education is B. F. Skinner. Much misunderstood, and often unfairly maligned, his theory of operant conditioning continues to influence schools to this day.

BFSkinner pic

Source of image

Drawing on the earlier work of Edward Thorndike, Skinner developed his theory of operant conditioning by exposing animals like rats and pigeons to carefully controlled stimuli and recording their responses (what’s often referred to as a ‘Skinner box’).  Skinner identified a variety of techniques which could be used to shape animal behaviour and wrote about how these might be applied to human behaviour (and education specifically).

The core idea within operant conditioning is reinforcement and punishment. Very simply, when an animal receives reinforcement after performing a behaviour they are more likely to repeat that behaviour. Conversely, receiving a punishment after performing a behaviour leads the animal to be less likely to repeat that behaviour in future. Skinner further described reinforcements and punishments as being ‘positive’ or ‘negative’ in character.

reinforcement and punishment grid


Skinner’s rather harsh reputation means that many teachers are surprised to discover that he was very much against the use of punishment in schools. Skinner believed that one of the major disadvantages of punishment is that, even where it is consistently applied, it merely temporarily suppresses an undesirable behaviour.

“Severe punishment unquestionably has an immediate effect in reducing a tendency to act in a given way. This result is no doubt responsible for its widespread use. We “instinctively” attack anyone whose behavior displeases us —perhaps not in physical assault, but with criticism, disapproval, blame, or ridicule. Whether or not there is an inherited tendency to do this, the immediate effect of the practice is reinforcing enough to explain its currency. In the long run, however, punishment does not actually eliminate behavior from a repertoire, and its temporary achievement is obtained at tremendous cost in reducing the over-all efficiency and happiness of the group.”

Science and Human Behaviour, p190.

Contrary to his rather cold, clinical popular reputation, Skinner was a compassionate humanitarian (he won The American Humanist Association’s “Humanist of the Year” award in 1972) who wanted science to help shape a better society by utilising rewards rather than punishment in order to promote pro-social behaviour. I suspect he’d have approved of Kilgarth school’s decision to ‘ban’ punishment, for instance.

However, the issue around the effectiveness of punishment is rather more complex than Skinner believed. For example, a fascinating meta-analysis by Balliet and Van Lange (2013) examined whether punishment was more effective at promoting cooperation in high or low trust societies. They reviewed 83 studies involving 7,361 participants across 18 societies and found a rather surprising conclusion: Punishment appears to effectively promote cooperation in societies with high trust. In essence, they argue that where there is a great deal of trust, members of a society adhere to norms that encourage both cooperation and the punishment of those who defy cooperative social norms. Punishment is less effective in societies where there is a lack of trust: They argue that social norms may be less strongly shared and enforced and so punishment may be less effective in these societies.

“A willingness to pay a cost to punish others, especially noncooperative others, is likely to be viewed as a strong concern with collective outcomes. At the same time, such benevolent views of costly punishment may be more likely to occur in societies that contain higher amounts of trust in others, which we conceptualized earlier in terms of beliefs about benevolence toward the self and others.”

An important question for future research is whether ‘benevolent punishment’ is as effective at an organisational level (e.g. a school) as it appears to be at a society level. However, the implication would be that in benevolent, high-trust environments the proportionate use of punishment to support cooperative social norms can be effective.

Another reason why punishment may be effective is a phenomenon called ‘loss aversion’. The work of Tversky and Kahneman suggests that there is an asymmetry between the effects of positive reinforcement and negative punishment – in that where people weigh up similar gains and losses; people tend to prefer avoiding losses to making gains. For example, Hackenberg (2014) Token Reinforcement: A Review And Analysis, reports an experiment where the value of a loss was worth approximately three times more than a gain. It seems highly likely that this effect might also apply to the sorts of token reward systems employed in schools; suggesting that negative punishment (e.g. loss of merits) may be more motivating than opportunities to gain merits.


Skinner believed that rewards were the most effective way of shaping behaviour and focused a great deal of his research attempting to find out the most effective patterns of reinforcement. In his ‘Skinner box’ experiments, he was able to carefully control the ‘schedule of reinforcement’ and measure the concomitant changes in the desired behaviour.

schedules of reinforcement

Intuitively, teachers see the need for consistency where punishments are applied and I’ve sometimes heard teachers argue that rewards should be given with equal consistency. However, Skinner’s work on ‘schedules of reinforcement’ appears to show that such systems tend to be relatively ineffective. The problem with systems seeking high consistency in rewarding students is that whilst the student’s behaviour may be swiftly modified, the desirable behaviour may become highly contingent upon the presence of the reward. The odd thing about rewards is that they appear to work better when they are slightly unpredictable. A simple summary of these differences:

schedules of reinforcement 2

In Skinner’s experiments, the extinction rates (the rate at which the desired behaviour stopped being performed) was quickest where there was continuous reinforcement (i.e. a reward given for every time the behaviour was performed). Where there was variability in the time interval or ratio, then the behaviour persists for longer in the absence of reinforcement. Skinner believed this represents the ‘power’ of the slot machine. The fact that playing it is unpredictably rewarded by a pay-out encourages the person to continue playing – even where they hit a long streak of losing.

In schools, sometimes these reward systems take on the structure of a ‘token economy’ (systems also used in prisons and psychiatric units – where individuals earn tokens for ‘good behaviour’ which can be used to purchase privileges). However, whilst explicit reward schedules have been used with children (e.g. children with ADD or Autism for example), reward systems have a number of problems which often undermines their use in schools.

One issue is ‘satiation’ – particularly older children rapidly lose interest in the tokens (e.g. merit stickers) or even primary reinforcers (e.g. sweets) that teachers hand out for desirable behaviour. I recall a student teacher handing out sweets to reward year 10 students for answering questions in class. Many of the students took part, but I noticed one lad sat there scowling with his arms crossed. Chatting to him, it was clear he knew many of the answers so I asked why he wasn’t putting his hand up – he said, “What’s the point? I can just buy my own sweets if I want them”. This problem often leads into what I call ‘reward inflation’ as teachers either have to constantly find novel rewards or end up handing out more and more tokens to elicit the same desirable behaviour.

Another issue is that reinforcement can have negative effects. It’s devilishly hard in a class of 30 students to accurately assess how much effort students have genuinely put into their class or homework. Giving praise or a merit for work which actually required little effort may inadvertently imply that you have low expectations of that student.

Lastly, children aren’t stupid. They rapidly learn when they are being manipulated by a reward system and sometimes manage to turn the tables on the teacher by learning to manipulate the criteria used to elicit a reward. I knew one teacher who, in an attempt to tame a particularly difficult class, had managed to trap themselves into handing out 4 or 5 merits to a number of the most naughty children every lesson.

Two great articles by Daniel Willingham further explore some of these problems: Should Learning Be Its Own Reward? and How Praise Can Motivate—or Stifle. At the end of this second article, Willingham summarises the way a teacher’s most common form of positive reinforcement – praise – might best be utilised:

“Praise should be sincere, meaning that the child has done something praiseworthy. The content of the praise should express congratulations (rather than express a wish of something else the child should do). The target of the praise should be not an attribute of the child, but rather an attribute of the child’s behavior.”

In summary

Whilst the term ‘behaviourist’ is used in a pejorative way by some teachers, Skinner desired his research to be used to create societies where reinforcement was used to encourage people to do the right thing, rather than punishment. There’s an enormous amount schools could potentially learn from the classic works on operant conditioning and ways to run token economies (which most school reward systems tend to form).

However, there are some interesting reasons why some of Skinner’s ideas may need updating. ‘Benevolent’ punishment and negative punishment (which may tap into our innate loss-aversion bias) may in some cases be equally or more effective than rewards (so long as they are deserved but a little unpredictable). Both potentially can be used to effectively support behaviour in schools.

In the next post in this series, I’m going to take a similar look at the topic of ‘restorative practices’ and some of the ideas from cognitive-behavioural therapy which underlie many of the systems used in schools.


Posted in Psychology for teachers | Tagged , , , , , , | 18 Comments

The ‘artificial science’ of teaching: System vs Individual competence

Over the last two posts, I’ve been exploring the extent to which teaching is a natural ability and whether there is a formal or ‘professional’ body of knowledge or set of skills required for effective teaching. In summary:

The ability to teach arises universally and spontaneously in early childhood, implying that it is a natural ability (e.g. like first language acquisition). I’m not suggesting that young children can teach to the degree that professional teachers can (no solution for the recruitment issues in education here!) because great teaching requires a strong knowledge and understanding of the subject being taught and benefits from practical experience in the classroom to refine that raw ability.

However, there’s a question as to whether there is a set of formal knowledge or skills in order to become a great teacher. This challenge was taken up by @informed_edu who related a range of areas where teachers would benefit from formalised instruction.

For some of the areas he raised, I agreed: How children learn, curriculum and assessment design, some level of statistics and research methods. They are not the sorts of secondary knowledge teachers would necessarily have from their subject degree, therefore they would appear especially fertile ground for developing a formalised body of professional knowledge and skills.

We disagreed on a couple of areas, particularly behaviour management and mentoring/coaching/leadership skills. There was an implicit distinction I was making in the posts which I’d like to make explicit in this one: A distinction, I will argue, that might be useful when exploring the extent to which teaching is an ‘artificial science’.

Individual competence

In this post, I asked some questions about the proposed College of Teaching relating to the idea of ‘aspirational’ standards. I expressed concerns about the way such standards would be drawn up and how reliably those standards would be applied to members of the College. @GalcottGareth gave a tentative and reasoned reply to some of these points, suggesting any new standards would form the basis of some sort of ‘Chartered’ status – though he was understandably unable to detail how these standards would be determined or how they could be applied in a reliable way.

Professions like medicine and law require practitioners to have a specialist body of knowledge. It is not sufficient to merely have initial training in the subject, there is a requirement that members of these professions keep abreast of new development (e.g. the efficacy of new treatments in medicine or the precedent of new cases in law). These are essentially academic disciplines (though they involve a performance element, e.g. inserting a cannula or presenting a case in court). Thus formal professional development in these areas tends to involve academic specialism through attending lectures or conferences, reading articles or journals, etc.

Professional athletes and footballers have a degree of natural talent within their particular field of sport. It is not sufficient merely to have this talent; there is a requirement to refine their ability through training and practice in order to perform at their best during competition. These are essentially performance disciplines – thus the development of a professional athlete tends to involve personalised coaching feedback rather than academic study.

The question, I think, is whether teaching is an academic profession like medicine or law or whether teachers are performance professionals in the same way as footballers and athletes. My hope, to lay bare my ulterior motives, is that we are / can become a combination of both – but merely wishing it so isn’t a very helpful exercise.

As a natural ability, teaching would appear to be mainly a performance discipline. Yes, there is formal subject knowledge required, but a great teacher needs practice to refine their natural ability to teach and perform effectively in the classroom (rather than formalised academic study). If that is true, then we might expect effective professional development should principally involve personalised feedback.

What is the best form for this personalised feedback to take? Should it be through mentoring, coaching or line management? I don’t think it matters. I’ve been involved in a variety of frameworks for school-based teacher training, mentoring and coaching and I don’t think the format is the effective ingredient. What probably underlies effective feedback is not the system used but the qualities of the individual. The ability to offer appropriate challenge to a teacher’s reflection in a manner which does not undermine their self-efficacy is as ‘natural’ as teaching – I think. My ‘Dodo bird’ verdict is that whether that individual is doing this within the framework of a line manager, mentor or coach is irrelevant compared how that feedback is given.

If this is true, then drawing up ‘aspirational standards’ for teachers will be highly problematic. We know there are enormous problems with the reliability of measures of effective teaching. Thus, judging whether a teacher meets an enhanced set of standards for classroom practice in order to obtain chartered status will be equally problematic. Unlike medicine or law, where academic qualifications provide a pointer towards individual competence, judging performance in a fair and reliable way is extremely difficult and economically impractical.

So, the open question is whether there is a formal body of knowledge or set of skills which makes teaching like an academic discipline as well as a performance one.

At the moment, there isn’t. Teachers don’t go to conferences or lectures to learn about teaching, nor is there an expectation that teachers read articles or journals to update their professional knowledge.

Should they though? In my last post, I concurred with David that there are some technical elements related to teaching which might benefit teachers to learn about: How children learn, curriculum and assessment design, some level of statistics and research methods would appear especially fertile ground for developing a formalised body of professional knowledge and skills.

Whether there is genuinely an advantage to teachers learning these things is an urgent hypothesis for anyone seeking to move teaching towards a higher-status profession by combining academic and performance disciplines. If there were a body of knowledge related to effective teaching, then ‘Chartered’ status becomes a much more realistic prospect. Teachers could have access to formal instruction through post-qualification programmes of study (perhaps even assessed by exams – haha!) which would provide a reliable pointer for individual competence. The null hypothesis is that this formal body of knowledge will have no bearing upon the quality of teaching. I’d really like to reject that null hypothesis, by the way – but I don’t think we justifiably can at present.

System competence

A different way of examining the question of whether teaching is an ‘artificial science’ is to consider the question at the level of the system rather than the individual.

In my last post, for instance, I argued that behaviour management mostly involves the exercise of ‘folk psychology’ rather than requiring an explicit ‘body of knowledge’. Much more important than explicit training, I argued, is the identification of effective school-wide systems for supporting teachers in developing the relationships and routines in classrooms.

David replied:

“I’d like to challenge your discussion about behaviour. It feels like you’re saying that if I know how to understand what a child is thinking and I can set expectations then my behaviour management will be fine (if I’m in a school with good systems).”

Placing the responsibility for the behaviour of students in a classroom entirely upon the teacher (i.e. at the level of individual competence) is a very quick way to destroy that teacher – or undermine the teaching in that classroom. There are loads of examples to support this – but to pick one, @LearningSpy wrote this last year:

Undermining teachers is easy via @LearningSpy

The truth is, highly-experienced, effective teachers can struggle with behaviour – especially when they are a new face and/or lack ‘senior status’ within a school. More systematic evidence of this comes from Haydn (2014) which I discussed here:

Talking about the behaviour in our lessons

Haydn found that experienced ASTs reported struggling with the behaviour in some lessons:

“One AST (Advanced Skills Teacher) told me:

Well I’m an AST. . . I’m not saying that that means that I’m superman but it’s reasonable to say that there are some who struggle even more than I do and I go down to about level 4 with some groups.”

Now, I’m not saying that there’s nothing a teacher can improve in their behaviour management techniques. Certainly developing working relationships and a set of effective routines requires practice and (as a performance discipline) likely would benefit from personalised coaching. However, even the most effective practitioner will be undermined in a system which does not support them in this. Thus, I think evidence relating to effective behaviour management needs more focus as a system-level competence rather than an individual one.

A second area of debate was around the systems to support professional development in schools. I’d argued the ‘Dodo bird’ verdict with respect to whether schools adopted a manager, mentoring or coaching framework (which I hope I’ve clarified above) – David replied:

“I think your assertions that nothing is inferior/superior in teacher development is directly in contrast to the research base – e.g. http://TDTrust.org/about/dgt.”

It’s a really useful report – I’d encourage all teachers, especially school leaders, to read it. I’d argue that the TDT report is an excellent example of how research evidence might help improve ‘system competence’.

What the TDT report describes are aspects of the ‘system competence’ needed to effectively provide the personalised feedback to teachers. However, I’d argue that whether this regular, student-outcome focused, personalised feedback takes place within the framework of line-management, mentoring or coaching is probably less important that the skill of the manager, mentor or coach to give that feedback in an effective way. I might also argue that the TDT report into ‘Developing Great Teaching’ actually supports my view that teaching is a natural ability – therefore a performance discipline.

In summary

If teaching is primarily a natural ability, which its spontaneous, rapid and universal development implies it is, then in terms of individual competence it might be classified as a performance discipline. If so, teacher professional development would have more in common with that of a footballer or an athlete rather than a doctor or a lawyer – regular personalised feedback provided to challenge a teacher’s self-reflection whilst not undermining their self-efficacy. Accountability systems tend to get in the way of this form of professional development, thus I think there needs to be more focus on ‘system competence’ rather than ‘individual competence’ in order to make much progress in improving schools, teaching and student outcomes.

On the other hand, plausible examples of domains which might form a professional body of knowledge exist: How children learn, curriculum and assessment design, some level of statistics and research methods. One question is what these domains would look like as a post-qualification programme of study. There is also the urgent question of whether formalised instruction in these areas would improve teaching.


Posted in Education policy | Tagged , , | 7 Comments

The ‘artificiality’ of teaching

In my last post, I argued that the universality and the spontaneous development of teaching leads to the conclusion that teaching is a natural ability. The post generated some really interesting responses, but one from @informed_edu made a direct attempt to answer the question I posed to ITE providers: What is the ‘technical’ or ‘professional’ body of knowledge or set of skills required of an effective teacher, which can actually be taught?

Whilst teaching may have evolved as a natural cognition (based on a functioning theory of mind) there are many aspects of modern teaching which are artificial. I use the term ‘artificial’ not in a pejorative sense but in the same sense as Herbert Simon in ‘The Sciences of the Artificial’.

“Natural science is knowledge about natural objects and phenomena. We ask whether there cannot also be “artificial” science knowledge about artificial objects and phenomena.” page 3

The modern context of teaching and more widely education are cultural phenomena, created by human beings rather than emerging directly from evolution through natural selection. Whilst, at its core, teaching may be a ‘natural ability’, it operates through artificial, culturally derived, systems. David Geary in ‘Educating the Evolved Mind’ suggests that these systems have emerged to meet a specific cultural demand.

He suggests that secondary cultural knowledge (e.g. science, literature, art) emerged from cognitive and motivational systems evolved to support what he calls primary or ‘folk knowledge’: Things like folk psychology (interest in people), folk biology (interest in living things) and folk physics (interest in inanimate objects) which directly aided survival and reproduction in our evolutionary past. As humans developed ways to retain these cultural artefacts across generations, he proposes that there was created an ever-growing gap between ‘folk knowledge’ (which people rapidly and easily acquire) and the theories and knowledge base of secondary knowledge (which people need explicitly to be taught).

Geary argues that where this gap between ‘folk knowledge’ and secondary cultural knowledge becomes large enough, schools emerge as cultural institutions. The function of schools, he suggests, is to close the gap between the biologically primary knowledge children rapidly learn for themselves and the secondary knowledge needed for living in society.

“The need for explicit instruction will be a direct function of the degree to which the secondary competency differs from the supporting primary systems.” Page 35

Teaching, I argued in the last post, largely involves ‘folk psychology’ – a rapidly acquired ability to pass-on cultural knowledge across generations. Beyond knowledge of secondary culture itself (subject knowledge), the question is whether there is a body of secondary knowledge required for teaching. What are the ‘technical’ or ‘professional’ elements of knowledge or sets of skills required of an effective teacher?

David’s response to ‘Is teaching a natural ability?’

You can read @informed_edu ‘s response in the comments to my last post here. He argued that:

“some form of teaching comes naturally to most people, but that doesn’t mean that the version that comes most naturally is always most effective”

To summarise (I hope fairly), David suggested a range of secondary knowledge required of a teacher which benefit from formalised instruction:

  1. Planning lessons
  2. Teaching strategies
  3. Curriculum design
  4. Assessment design
  5. How children learn
  6. Differences between students: e.g. special educational needs
  7. Behaviour management
  8. Drawing upon and evaluating research evidence
  9. Mentoring, coaching and leading teachers

Lastly he says:

“In most professions, the acquisition of this is significantly more formalised and then you achieve recognition for having learned it. You also get the options to delve into more specialist areas and receive well-planned learning and the opportunity to be recognised for that. This helps professions both build, recognise and then use the knowledge – easier to identify who to turn to for advice if there is a better system for recognising expertise.”

A body of knowledge for teaching

I agree with some of David’s points, others I think are a function of subject knowledge (which I readily conceded is learnt), and others I don’t think can be argued to form part of a professional body of knowledge or a technical competence.


How useful is lesson planning? It seems to me an empirical question – what sort of planning actually improves student outcomes? Over my career, I’ve been asked to use a wide variety of formats to record what I intended to teach. I must confess I’ve often found it easier to write the lesson plan after I’d taught the lesson!

That’s somewhat flippant, but there are a couple of reasons I’m skeptical about the value of lesson planning. Firstly, the ‘impossible task of mind reading’ means I cannot always anticipate where students will have difficulty or achieve understanding easily. I concede that that very early on in my PGCE year I needed to sketch thoughts on paper before trying things out in the classroom. However, great teaching, in my opinion, requires responsive flexibility rather than explicit planning. Such flexibility undoubtedly comes from practice rather than any formal instruction – thus I reject the notion that planning forms a technical competence. I strongly suspect that teachers are using their subject knowledge and theory of mind to actually teach and that a great deal of ‘planning’ is being done merely for the appearance or accountability

Secondly, I’m far from certain that planning a lesson is even the right level of focus. Two great blogs exploring this:

A lesson is the wrong unit of time via @BodilUK

The problem with lesson planning via @LearningSpy

Teaching strategies

This, for me, is the most problematic area in CPD. Throughout my career, I’ve been told that one strategy or another was effective or necessary: I’ve been told I need to differentiate for kinaesthetic learners; told to limit the amount of talking I’m allowed to use; told to use lollipop sticks as a way of randomly sampling the class for questioning; told to make children write lesson objectives; told to divide lessons into starters, mains and plenaries, etc. The problem is that, beyond what evolved natural ability a teacher possesses, many teaching strategies passed on through CPD are little more than gimmicks.

Even much more plausible, research-based strategies – e.g. Assessment for Learning – tend to have devolved to the level of bureaucratic box-ticking rather than any useful strategy. For example, requiring teachers to report ‘progress’ through (non-existent) sub-levels and generating targets by which a student will reach the next (non-existent) sub-level. Beyond the observation that great teachers formatively assess learning as they teach (which I argue is natural ability) how useful were any of the ‘strategies’ that arose from AfL? This argument is explored in more depth by David Didau:

AfL: Cargo cult teaching? via @LearningSpy

5-year olds ask questions to check understanding whilst teaching, and great teachers do this too. I have no doubt that AfL provides a description of great teaching – my question is, does explicit teaching of AfL strategies (or any other teaching strategy) actually improve teaching?

Curriculum design

I don’t doubt that curriculum design involves a strong understanding of a subject – thus, I concede this involves secondary knowledge, but I’d argue that teachers do not receive any explicit training in curriculum design and therefore it’s hard to argue that this forms a pillar of professional status.

I think there are some useful things teachers can learn about curriculum design. Two that were most useful for me were these:

Trivium 21st Century via @Trivium21c

One scientific insight for curriculum design via @joe_kirby

The first of these provided an interesting model for thinking about curriculum design. Martin’s many examples of grammar, dialectic and rhetoric across different subject areas was based firmly within tradition, but I think it works because it reflects the inheritance, selection and variation which drives cultural evolution (but this is opinion). Joe’s ideas, based upon our profession’s nascent understanding of how children learn, are an excellent example of perhaps where a body of professional knowledge might exist to be exploited.

Assessment design

There’s certainly a body of technical knowledge required for effective assessment design. I guess my major issue is that teachers aren’t taught it! A great starting point, in my opinion, is the work of Daisy Christodoulou.

Guide to my posts about assessment via @daisychristo

How children learn

This is another area where I’m in complete agreement with David. Education appears to have recently discovered that scientific ideas about how humans learn didn’t stop in the 1920-30s with Jean Piaget and Lev Vygotsky.

There really is a body of knowledge here that teachers might benefit from. For an accessible starting point, teachers could do far worse than the Deans for Impact – The Science of Learning:

The Science of Learning

Obviously, I’m greatly biased in this regard – having a background in psychology and also writing a blog principally about trying to apply psychological research to the classroom! However, the sheer number of myths circulating in education regarding how children learn makes it something of a justifiable priority in my opinion.

Differences between students

There’s certainly a body of technical knowledge related to SEND. Merely understanding the overwhelming number of labels applied to children requires some explicit explanation.

Personally, I’m cautious of many of the labels that underlie differentiation strategies in lessons. There’s some evidence that such labels may not always benefit the children involved – for example:

Does the dyslexia label disable teachers?

There are certainly children who struggle within the large classes and the (inevitably) limited personal attention in mainstream education, but differentiation strategies have suffered from the same myths and unevaluated claims as other teaching strategies. There are some areas of SEND where some technical knowledge about how children learn might be very applicable, for example from Susan Gathercole and Tracey Alloway:

Understanding Working Memory: A Classroom Guide

However, the best starting point for understanding individual differences in learning is probably understanding how children learn in the first place (see above). Otherwise, I’d argue the majority of day-to-day classroom differentiation is running off the same ‘subject-knowledge-mediated-through-theory-of-mind’ as the rest of teaching.

Behaviour management

This is an interesting area of current debate. Many schools run behaviour management systems entirely upon operant conditioning lines (a branch of behaviourism – which includes using rewards and/or punishments). These behaviourist approaches have some fundamental limitations however, for example older children typically see through attempts to manipulate their behaviour through rewards and praise can undermine effort if used carelessly:

Praise and rewards – use thoughtfully!

For the most part, dealing with children is employing a teacher’s theory of mind more than applying an explicit body of technical knowledge, in my opinion. There are certainly helpful starting points for new teachers – mainly that children respond to clear and consistent expectations:

Tom Bennett’s top ten tips for maintaining classroom discipline via @tombennett71

Some of the behaviour challenges faced by teachers are often due to merely being a new face. Contrary to the saying ‘familiarity breeds contempt’ – there’s evidence that repeated interaction with the same person tends to bring about more positive attributions about that person (in psychology this is called the ‘mere-exposure’ effect). I’ve often wondered how much of behaviour issues in schools stem from staff turnover and timetable instability.

On balance, most of this involves ‘folk psychology’ and I remain to be convinced that there is an explicit ‘body of knowledge’ which underlies a positive classroom climate which teachers need to learn in order to be effective. Much more important, in my view, is the identification of effective school-wide systems for supporting teachers in developing the relationships and routines in classrooms.

Drawing upon and evaluating research evidence

There’s certainly a great deal of secondary cultural knowledge within research methods (including things David mentioned like statistics and how to assess things like validity and reliability).

Whilst organisations like the EEF were founded to provide teachers with better information about effective intervention and teaching strategies, it’s not clear how effectively this information is being used in schools. Pilot projects like ‘Evidence for the Frontline’ seek to overcome the gap between research and teaching through brokering partnerships – and perhaps this will help teachers access and implement more effective interventions. Finally, a fantastic grassroots organisation is achieving this dialogue between researchers and teachers at an international level, and I’d encourage any teacher to get involved with researchED.

At the very least, we might hope that greater professional understanding of research would help teaching avoid the gimmicks and myths which bedevil education. Therefore, perhaps some level of understanding of research methods – and most certainly statistics – would be useful for teachers. A big question is the degree to which we might expect all teachers to be ‘research literate’ and whether/what sort of teacher ‘research’ has demonstrable practical value in developing effective teaching.

Mentoring, coaching and leading teachers

David mentioned a range of things related to developing teaching – from ITT to school leadership. I’m going to be very sceptical and suggest a null hypothesis: None of the systems for developing teachers is more or less effective, it is merely that some teacher trainers, coaches and school leaders have a well-functioning theory of mind which makes them effective (regardless of the system they use). In essence, like an argument put forward regarding counselling, I think the ‘Dodo bird verdict’ applies to different models of mentoring, coaching and leadership.

In conclusion

I’ve argued that beyond knowledge and understanding of the subject to be taught – and some experience at teaching it – that teaching is essentially a natural ability arising from a well-functioning Theory of Mind. David mounted an interesting challenge to this, relating a list of essential knowledge and skills required for effective teaching which required explicit teaching and practice.

I disagree with some of his suggestions. As a profession I think we have a whole swathe of questionable teaching strategies and interventions, debateable behaviour management guidance and uncertain differentiation advice – much of which probably involves a natural ability to teach (plus a bit of practice) rather than the effectiveness of the strategies. I remain to be convinced that these require or benefit from formalised, explicit instruction.

However, for some of the areas he raised, I agree: How children learn, curriculum and assessment design, some level of statistics and research methods would appear especially fertile ground for developing a formalised body of professional knowledge and skills. What’s remarkable perhaps is the relative absence of these features from teacher training and professional development.

Posted in Education policy | Tagged , , , , , , , , , , | 4 Comments