The Hidden Bias in GCSE Listening Exams That Disadvantages Many Learners and What You Can Do About it

Introduction

There is a persistent tendency in language teaching to explain listening failure almost exclusively in linguistic terms: students struggle because they do not know enough vocabulary, because their grammar is weak, because they have not had enough exposure, because they are not applying the “right” listening strategies, or because they supposedly lack resilience, concentration or motivation.

Often, of course, this diagnosis is correct. Vocabulary knowledge, grammatical competence and phonological familiarity remain the strongest predictors of listening success (Nation, 2013; Vandergrift & Goh, 2012), and without them there is no listening competence to speak of.

However, as I have repeatedly argued on this blog over the years, especially in my work on listening pedagogy, decoding and micro-listening, this explanation is incomplete, because listening in a second language is not merely a linguistic act. It is also a highly demanding cognitive activity which places considerable strain on working memory, attentional control, processing speed, inhibitory control and executive functioning, particularly when learners are required not only to process language accurately in real time, but to sustain that processing repeatedly over extended periods whilst simultaneously managing uncertainty, stress, partial comprehension and continual decision-making.

In other words, what listening exams frequently measure is not simply whether students know the language, but whether they can continue accessing and deploying that knowledge efficiently once cognitive fatigue begins to accumulate.

What Listening Actually Involves

I know I have repeated this ad nauseam, but let me say it once again: listening in a second language is arguably one of the most cognitively expensive things we ask novice and intermediate learners to do. Unlike reading, where learners can pause, re-read and control the pace of processing, listening unfolds in real time and disappears almost instantly, forcing the learner to process, interpret and integrate incoming information under severe temporal pressure.

The learner must:

  • decode the acoustic signal,
  • segment the speech stream into meaningful units,
  • retrieve lexical meaning,
  • interpret grammatical relationships,
  • suppress competing interpretations,
  • hold partial meaning in working memory,
  • integrate new information with previous clauses,
  • and construct a coherent mental representation of meaning,

all within fractions of seconds!

Research over the last two decades has shown repeatedly that these processes rely heavily on:

  • working memory (Baddeley, 2003; Andringa et al., 2012),
  • attentional control (Mackey et al., 2010),
  • processing speed (Segalowitz, 2010),
  • executive functioning (Miyake & Friedman, 2012),
  • and automaticity (DeKeyser, 2007).

Neurolinguistic research using fMRI and EEG methodologies has further demonstrated that L2 listening recruits substantially more frontal-lobe activity than L1 listening, especially in lower-proficiency learners, because processing remains effortful rather than automatised (Abutalebi, 2008; Perani & Abutalebi, 2005). In practical terms, this means that novice listeners must consciously allocate attentional resources to processes which, in expert listeners, occur automatically and below conscious awareness.

As I have often pointed out in earlier posts, what students know certainly matters, but how efficiently they can process what they know often determines whether performance remains stable under pressure. A learner with reasonably secure vocabulary but weak decoding or slow retrieval may perform far worse than expected simply because each listening decision costs too much cognitive effort.

The Nature of GCSE and A-Level Listening Exams

As we all know, GCSE and A-Level listening papers are composed of:

  • multiple short extracts,
  • repeated cycles of listening and responding,
  • continual task-switching,
  • and sustained cognitive engagement across thirty to forty-five minutes.

This entails:

  • repeated high-load processing,
  • cumulative cognitive effort,
  • and fatigue accumulation over time.

Students are required to:

  • listen,
  • decode,
  • interpret,
  • infer,
  • decide,
  • and respond,

again and again, often with relatively little cognitive recovery time in between.

A typical GCSE listening paper may involve well over one hundred separate micro-decisions, each requiring rapid allocation of attentional and working-memory resources. Whilst any individual extract may appear manageable in isolation, the cumulative effect of repeated processing under pressure is far more substantial than many teachers realise.

Hence, the bottleneck is very often not the difficulty of the text itself, but rather the learner’s ability to sustain accurate processing across repeated listening events.

Cognitive Fatigue: What It Is and Why It Matters

Cognitive fatigue refers to the gradual decline in mental efficiency following sustained effortful processing (Borghini et al., 2018; Hockey, 2013). In listening contexts, this fatigue manifests itself through:

  • reduced attentional control,
  • slower processing speed,
  • weaker error monitoring,
  • diminished working-memory efficiency,
  • reduced inhibitory control,
  • and increasing susceptibility to distraction.

Crucially, learners do not suddenly “switch off”. Rather, because attentional resources become progressively depleted as processing demands accumulate, they gradually lose the ability to process accurately and efficiently, even though outwardly they may still appear focused and compliant.

Across the school day, this process is cumulative. A learner sitting a listening paper at 3pm is not cognitively equivalent to that same learner at 9am. Prior lessons, emotional stress, sustained concentration, social interaction, screen exposure and decision fatigue all consume cognitive resources, particularly in adolescents whose executive-control systems are still developing neurologically (Blakemore & Choudhury, 2006).

Within a listening paper itself, fatigue often evolves in a recognisable sequence:

  • early phase → relatively efficient processing,
  • middle phase → attentional drift and increasing effort,
  • later phase → greater reliance on guessing, partial cues and heuristics.

This aligns closely with research on attentional depletion and executive control (Kahneman, 2011; Miyake & Friedman, 2012), as well as with studies showing that sustained cognitive effort progressively reduces accuracy in tasks requiring rapid auditory discrimination and working-memory updating (Warm et al., 2008).

As I have often reiterated in previous discussions of listening instruction, the issue is rarely that learners stop trying. Rather, the cognitive system itself becomes less efficient over time.

Who Is Most Likely to Be Affected?

Not all learners are equally vulnerable to cumulative listening fatigue. Those most affected are usually the learners for whom each listening event already carries a high cognitive cost.

Learner profileWhy they are vulnerableWhat it looks like in class/exams
Students with poor working memoryCannot hold enough information online long enough to integrate meaning“I understood the beginning but lost the rest”
Slow lexical retrieversLexical access is too slow for real-time processing“I knew the word — but only afterwards”
Weak phonological decodersCannot reliably map sounds onto known wordsConfusions such as fui/fue or je vais/j’ai
Poor segmentersStruggle to identify word boundariesHear an undifferentiated stream
Fragile grammar processorsMiss small but crucial grammatical cuesTense and negation errors
Low-automaticity learnersProcessing remains effortful and consciousRapid fatigue accumulation
Anxiety-prone learnersAnxiety consumes working-memory resourcesPerformance collapses after mistakes
Learners with weak attentional controlCannot efficiently reset after failureOne error destabilises several subsequent items
Students with limited vocabulary depthToo much input remains uncertainHeavy reliance on guessing
Students with auditory-processing difficultiesSpeech processing itself is effortfulNeed excessive repetition
Fatigued or overloaded learnersReduced cognitive reservesStronger late-day decline

As I have often stressed on this blog, many of these learners are not weak in any simplistic sense. Rather, because the cognitive cost of each listening event is substantially higher for them than for more automatised listeners, fatigue accumulates earlier and more aggressively.

Listening Twice (or Three Times): Help or Illusion?

Most exam boards replay listening extracts twice, whilst Pearson uses three repetitions.

On the surface, this appears entirely beneficial, and indeed there are important advantages:

  • reduced immediate memory load,
  • opportunities to confirm interpretations,
  • greater support for lower-proficiency learners,
  • improved local accuracy (Vandergrift & Goh, 2012).

However, the situation is more nuanced than it first appears.

Repeated listens do not eliminate global cognitive fatigue. In fact, they may contribute to it! Why?

Firstly, some learners reduce attentional effort during the first hearing because they know another opportunity is coming, a phenomenon linked to effort-regulation theory (Hockey, 2013). Secondly, incorrect initial interpretations can become reinforced rather than corrected, especially when learners engage in confirmation bias rather than fresh processing during subsequent listens. Moreover, whilst replay may reduce local task difficulty, it simultaneously extends the cumulative duration of high-load processing across the paper.

In other words: repetition helps locally, but it does not fully solve the broader problem of cumulative fatigue. In fact, it may exacerbate it!

Implications for Teaching

If listening performance reflects not only linguistic knowledge but also processing efficiency, cognitive stamina, decoding automaticity and attentional resilience, then classroom practice must address all of these dimensions explicitly and systematically.

As I have argued repeatedly in my CPD and writing, merely “doing more listening” is often insufficient if the underlying sub-processes remain weak, fragile and effortful.

1. Build Stronger Linguistic Foundations

Vocabulary breadth and depth remain the strongest predictors of listening success (Nation, 2013). However, vocabulary knowledge must become rapidly accessible, not merely recognisable, because listening does not allow learners the luxury of extended retrieval time. This means:

  • cumulative retrieval,
  • narrow recycling,
  • delayed retrieval,
  • frequent low-stakes reactivation,
  • and rapid retrieval practice.

For example:

  • “Write five verbs you can use with je vais
  • “Past, present or future?”
  • “Translate rapidly: I used to eat / I ate / I am going to eat”

The goal is not simply memory, but speed and automaticity of access.

2. Develop Aural Competence Explicitly

As I have often emphasised in earlier posts, many listening failures originate not in comprehension but in perception. Learners therefore need systematic work on:

  • phonological discrimination,
  • segmentation,
  • reduced forms,
  • chunk recognition,
  • syllabic awareness,
  • and stress/rhythm perception.

Practical classroom examples include:

  • pero/perro discrimination,
  • je vais/j’ai contrast tasks,
  • chunk-marking exercises,
  • rapid dictation races,
  • syllable grouping activities,
  • shadow reading,
  • and selective dictation.

Even five minutes daily can significantly reduce future cognitive load because learners gradually begin to process these forms more automatically.

3. Increase Processing Efficiency and Automaticity

Automaticity matters enormously because the less effort each micro-decision requires, the more cognitive resources remain available for comprehension and inferencing. As I have repeatedly argued on this blog, speed of processing is not an optional extra. It is central to successful listening. Useful classroom routines include:

  • three-second sentence exposure,
  • rapid chunk reconstruction,
  • constrained listening retrieval,
  • timed sentence rebuilding,
  • rapid-fire tense identification,
  • disappearing transcripts,
  • and rapid-response mini-whiteboard routines.

For example:

  1. Show a sentence for three seconds.
  2. Remove it.
  3. Play it aloud.
  4. Students reconstruct it from memory.

Such activities force faster parsing, retrieval and integration.

4. Train Attentional Recovery

One of the biggest problems weaker listeners face is that one missed word destabilises subsequent processing, creating a cascade effect across multiple items. Students therefore need explicit practice in:

  • recovering after failure,
  • resetting attention,
  • tolerating ambiguity,
  • and continuing to process despite partial loss.

Teachers can train this by:

  • deliberately inserting unknown words,
  • interrupting extracts,
  • masking small sections,
  • or using “recover and continue” listening routines where students must keep processing despite uncertainty.

5. Reduce Cognitive Load Strategically

Research in Cognitive Load Theory (Sweller et al., 2011) strongly suggests that instructional design matters enormously.Teachers should therefore:

  • pre-teach essential lexical items,
  • reduce unnecessary task complexity,
  • sequence input carefully,
  • avoid overloading students with simultaneous demands,
  • and progressively increase difficulty.

Importantly, simplifying cognitive demands does not mean “dumbing down”. It means allocating attentional resources more intelligently.

6. Build Listening Stamina Indirectly

Importantly, stamina is not built simply by making students endure increasingly long listening texts.As I have consistently argued in my work on listening pedagogy, stamina improves primarily when processing becomes less effortful.This means:

  • stronger decoding,
  • faster retrieval,
  • better segmentation,
  • more automatised grammar recognition,
  • and greater attentional efficiency

In other words:students become more resistant to fatigue not because they “try harder”, but because each listening event gradually costs less cognitive effort.

The Role of Micro-Listening (MCL) Tasks

This is where micro-listening tasks — or MCL tasks, as developed extensively in my book Breaking the Sound Barrier — become particularly powerful. Those familiar with my work will now that MCL tasks consist of:

  • short,
  • highly interactive,
  • cognitively manageable listening cycles,

usually lasting between ten and twenty seconds. Each cycle typically involves:

  1. listening,
  2. making a rapid judgement,
  3. responding immediately,
  4. receiving corrective feedback.

As I have repeatedly maintained in my writing on listening pedagogy, these short cycles are far more aligned with the actual cognitive architecture of listening than the traditional “play a long text then answer comprehension questions” approach.

Those of you who are familiar with my book ‘Breaking the sound barrier: teaching learners how to listen’ (Conti & Smith, 2019) will know that MCL tasks are not random quick listening games. Each one targets a specific listening sub-process.

Listening sub-processWhat the learner is trainingExample MCL taskStudent response
Phoneme discriminationDistinguishing similar soundspero/perroChoose correct word
SegmentationIdentifying word boundariesjevaisaucinémaMark chunk boundaries
Morphological parsingDetecting grammatical morphologyje mange/j’ai mangéIdentify tense
Negation processingDetecting negation cuesje n’aime pasPositive or negative?
Person identificationRecognising subject/person markershablo/habla/hablanIdentify speaker
Lexical retrievalRapid meaning accessHear-and-matchGive rapid meaning
Syntax parsingUnderstanding grammatical relations“Who did what?”Assign roles
Distractor resistanceIgnoring misleading lexical cuesKeyword trap taskReject distractor
Working-memory updatingHolding and integrating meaningListen-hold-selectRetain then decide
Error detectionMonitoring linguistic accuracyCorrect vs incorrect versionIdentify the change
Discourse integrationLinking clauses coherentlyCause-result matchingIdentify logical relationship
Attentional resetRecovering after missed inputInterrupted listeningContinue processing

Because the tasks are:

  • short,
  • interactive,
  • repetitive,
  • feedback-rich,
  • and cognitively manageable,

they reduce overload whilst simultaneously increasing engagement and attentional reset. Moreover, because processing becomes faster and more automatised over time, fatigue accumulates much more slowly.

This is one of the most important points to understand: students do not become resistant to listening fatigue because they “try harder”; they become resistant because processing gradually becomes less effortful.

Conclusion

Ultimately, listening performance reflects the interaction of:

  • linguistic knowledge,
  • aural competence,
  • processing efficiency,
  • attentional control,
  • working-memory capacity,
  • executive functioning,
  • and cognitive stamina.

Overemphasising any one of these variables inevitably produces a distorted understanding of why learners succeed or fail.

The danger in many classrooms is that we continue to interpret listening breakdown purely as a knowledge deficit, when in reality it is often the product of cognitive overload interacting with fragile processing systems, inefficient decoding, weak attentional recovery and insufficient automaticity.

If we genuinely want students to perform better in GCSE and A-Level listening exams, then we need to move beyond the simplistic notion that more listening practice alone is the solution. What matters is not simply how much learners listen, but how efficiently, accurately and sustainably they are able to process what they hear under pressure and across time.

And that, in my view, changes the pedagogical conversation quite profoundly.

If you want to find out more on this topic, read my book ‘Breaking the sound barrier: teaching learners how to listen’ (Conti and Smith, 2019)

Selected References.

  • Andringa, S., Olsthoorn, N., van Beuningen, C., Schoonen, R., & Hulstijn, J. (2012). Determinants of success in native and non-native listening comprehension: An individual differences approach. Language Learning, 62(1), 49–78. Wiley.
  • Baddeley, A. (2003). Working memory and language: An overview. Journal of Communication Disorders, 36(3), 189–208. Elsevier.
  • Blakemore, S.-J., & Choudhury, S. (2006). Development of the adolescent brain: Implications for executive function and social cognition. Journal of Child Psychology and Psychiatry, 47(3–4), 296–312. Wiley.
  • Conti, G., & Smith, S. (2019). Breaking the Sound Barrier: Teaching Language Learners How to Listen. Piefke Trading Singapore.
  • DeKeyser, R. (2007). Practice in a second language: Perspectives from applied linguistics and cognitive psychology. Cambridge University Press.
  • Field, J. (2008). Listening in the Language Classroom. Cambridge University Press.
  • Hockey, G. R. J. (2013). The Psychology of Fatigue: Work, Effort and Control. Cambridge University Press.
  • Kahneman, D. (2011). Thinking, Fast and Slow. Penguin Books.
  • Mackey, A. (2010). Input, interaction and corrective feedback in L2 learning. Oxford University Press.
  • Nation, I. S. P. (2013). Learning Vocabulary in Another Language (2nd ed.). Cambridge University Press.
  • Roussel, S., Joulia, D., Tricot, A., & Sweller, J. (2021). Learning subject content through a foreign language should not ignore human cognitive architecture: A cognitive load theory approach. Educational Psychology Review, 33, 1225–1249. Springer.
  • Segalowitz, N. (2010). Cognitive Bases of Second Language Fluency. Routledge.
  • Sweller, J., Ayres, P., & Kalyuga, S. (2011). Cognitive Load Theory. Springer.
  • Vandergrift, L., & Goh, C. C. M. (2012). Teaching and Learning Second Language Listening: Metacognition in Action. Routledge.
  • Wallace, M. P. (2020). Individual differences in second language listening: Examining the role of vocabulary knowledge, working memory, and personality. Language Teaching Research, 24(6), 707–727. SAGE Publications.

EPI vs Rosenshine: Commonalities and Differences

Introduction

In recent years, many language teachers have noticed a striking overlap between classrooms informed by Rosenshine’s principles and those shaped by Extensive Processing Instruction (EPI). Both frameworks look structured, purposeful, and highly interactive. Students are engaged, teachers model clearly, and practice is carefully scaffolded. It is therefore tempting to conclude that the two are essentially the same thing. However, they are not.

What they share is a set of high-quality teaching behaviours. Where they diverge, on the other hand, is in the underlying theory of how learning—specifically language learning—actually happens over time. Barak Rosenshine distilled what effective teachers do across subjects; EPI, in addition, operationalises insights from second language acquisition and cognitive science to control how learners process, store, and retrieve language.

Where EPI and Rosenshine Align

Below, each shared principle is briefly unpacked—with concrete examples rooted in a typical English secondary school MFL curriculum (e.g. French/Spanish at KS3–KS4)—so that it maps directly onto the table that follows.

  • Structured teaching → Both rely on carefully sequenced instruction rather than ad hoc activity selection.
    Example: A Year 8 French lesson on future plans follows a clear arc: recap prior tense (je joue), introduce future meaning through input (je vais jouer), then structured micro-listening tasks followed by macro-listening followed by chunking aloud tasks followed by chunk-level written and oral retrieval practice followed by mini role plays, etc.
  • Small steps → New learning is broken down to manage cognitive load (John Sweller).
    Example: Instead of teaching the full future tense paradigm, the teacher starts with “je vais, tu vas, il/elle/on va + infinitive” only; whilst the other persons are postponed to the end of the instructional sequence.
  • Modelling → Both require clear models; however, in EPI this is primarily realised through Sentence Builders, not teacher exposition. In addition, these make language visible, selectable and processable before independent use.
    Example: A Sentence Builder such as:
    Le week-end, je vais + jouer au foot / regarder un film / sortir avec mes amis + parce que c’est…
    is used for listening, reading aloud, decoding, translation and choral response before students produce language independently.
  • Guided practice → Learners rehearse new material with support before moving towards independence.
    Example: Students manipulate the Sentence Builder to create sentences with controlled variation; moreover, the range of options is gradually expanded.
  • Checking understanding → Frequent formative checks prevent misconceptions from embedding.
    Example: Mini-whiteboards: teacher says “Je vais manger” → students write “future” or “present”; in addition, misconceptions are immediately addressed.
  • High success rate → Tasks are calibrated to ensure learners experience frequent success.
    Example: Listening tasks where only one variable changes (e.g. tense marker), thus ensuring high accuracy and confidence.
  • Active participation → Students are constantly engaged cognitively, not passively listening.
    Example: Whole-class choral translation or rapid-response tasks using Sentence Builders; furthermore, no student remains inactive.
  • Retrieval practice → Recall strengthens memory; however, in EPI it is most powerfully realised during Structured Production, where learners retrieve and recombine language under controlled conditions.
    Example: Students are prompted to produce sentences from partial cues:
    “tomorrow / play football / because fun” → Je vais jouer au foot demain parce que c’est amusant.
    This, in turn, forces retrieval from memory rather than reliance on full models.
  • Review and consolidation → Learning is revisited over time to secure retention.
    Example: Je vais + infinitive reappears weeks later in a holidays topic; moreover, it is embedded within new contexts.
  • Scaffolding → independence → Support is gradually withdrawn as competence increases.
    Example: Sentence Builder → reduced prompts → free GCSE-style writing; in addition, accuracy expectations increase.
  • Clarity of goals → Clear objectives and success criteria guide learning.
    Example: “Today you will understand and say what you are going to do this weekend”; furthermore, success is explicitly defined.
  • Teacher-led guidance → Novices benefit from explicit instruction rather than discovery (Paul Kirschner). It should be noted, however, that whilst explicit instruction is leveraged, it is used in synergy with implicit instruction, which has an equally if not important role in the EPI pedagogical cycle (MARSEARS)
    Example: Teacher provides structured input and modelling rather than asking students to infer rules independently; on the other hand, independence is gradually built later.

Commonalities Table

DimensionShared PrincipleWhat it looks like in practiceResearch anchor
Structured teachingLearning benefits from clear, well-sequenced instructionLessons follow a deliberate progression, not ad hoc activitiesBarak Rosenshine
Small stepsNew learning should be broken down to reduce overloadLimited new material introduced at a timeJohn Sweller
ModellingLearners need clear models of target performanceIn EPI: Sentence Builders; in Rosenshine: teacher demonstration/worked examplesRosenshine; Conti & Smith
Guided practiceLearners need supported rehearsal before independenceScaffolded tasks before freer applicationRosenshine
Checking understandingMisconceptions must be identified earlyFrequent questioning, mini-whiteboardsRosenshine
High success rateFrequent success enhances learningTasks calibrated to avoid repeated failureRosenshine
Active participationStudents must be cognitively engagedConstant response, no passive listeningRosenshine
Retrieval practiceRecall strengthens memoryIn EPI: intensified in Structured Production; in Rosenshine: review routinesRobert Bjork
Review and consolidationLearning must be revisited over timeSpaced review, cumulative practiceRosenshine
Scaffolding → independenceGradual release is essentialSupport fades as competence growsRosenshine
Clarity of goalsClear aims support learningExplicit success criteriaInstructional research
Teacher-led guidanceNovices need explicit instructionLimited unguided discoveryPaul Kirschner

Where They Diverge Fundamentally

Again, each difference below is aligned directly with the table that follows, with examples grounded in an English MFL classroom.

  • Theoretical base → EPI draws on ISLA and psycholinguistics; Rosenshine, on the other hand, draws on general cognitive psychology.
    Example: EPI designs listening tasks to develop parsing (e.g. distinguishing hablo vs habló), not just comprehension.
  • Primary goal → EPI targets long-term language competence; Rosenshine, however, focuses on successful lesson outcomes.
    Example: EPI may delay speaking to ensure retention, even if it feels less “productive” in the lesson.
  • Unit of design → EPI sequences learning processes over time; Rosenshine structures individual lessons.
    Example: A Year 9 scheme ensures past tense reappears across multiple units, not confined to one topic; furthermore, recycling is systematic.
  • Core driver of learning → EPI prioritises input processing; Rosenshine prioritises explanation and practice.
    Example: Students spend extended time interpreting Sentence Builder content before producing; in contrast, Rosenshine would move more quickly to practice.
  • Role of input → Central and sustained in EPI; however, often brief in Rosenshine-informed teaching.
    Example: Multiple listening/reading exposures before any speaking task.
  • Nature of practice → EPI emphasises processing and retrieval; Rosenshine emphasises correct rehearsal.
    Example: Students decide meaning (now vs future), not just repeat sentences; in addition, they recombine language.
  • Timing of output → Delayed in EPI; early in Rosenshine.
    Example: No speaking until sufficient processing has occurred; on the other hand, Rosenshine encourages early guided responses.
  • View of early production → Risky in EPI; acceptable in Rosenshine.
    Example: Avoiding premature pairwork speaking to prevent incorrect automatisation.
  • Role of explanation → Minimal and delayed in EPI; central and early in Rosenshine.
    Example: Grammar explanation follows exposure, not precedes it; furthermore, it is brief.
  • Cognitive load management → Domain-specific and staged in EPI; more general in Rosenshine.
    Example: Limiting simultaneous novelty in vocabulary and structure.
  • View of grammar → Emerges from processing, then systematised in EPI; explained then practised in Rosenshine.
    Example: Students recognise patterns before seeing full conjugations.
  • Retrieval → Core engine in EPI; routine strategy in Rosenshine.
    Example: Frequent timed retrieval tasks driving fluency; in addition, embedded in structured production.
  • Fluency development → Explicitly engineered in EPI; less theorised in Rosenshine.
    Example: 4-3-2 speaking to build automaticity.
  • Curriculum design → Cumulative and interleaved in EPI; largely assumed in Rosenshine.
    Example: Structures recycled across topics over months; furthermore, deliberately interleaved.
  • Attention to form → Engineered through tasks in EPI; less explicit in Rosenshine.
    Example: Tasks where meaning depends on noticing verb forms.
  • Transfer → Explicitly built in EPI (Transfer Appropriate Processing); less foregrounded in Rosenshine.
    Example: Practising exam-style listening mirroring GCSE conditions.
  • Typical classroom → Input-rich and recycling-heavy in EPI; fast-paced and response-heavy in Rosenshine.
    Example: Longer processing phases vs rapid-fire questioning; however, both remain structured.

Differences Table

DimensionEPI (Extensive Processing Instruction)Rosenshine (Principles of Instruction)
Theoretical baseISLA + psycholinguisticsGeneral cognitive psychology
Primary goalDevelop implicit and procedural language competenceEnsure successful lesson learning
Unit of designLearning processes over timeLesson structure
Core driverInput processing and form–meaning mappingExplanation and guided practice
Role of inputCentral and sustainedOften brief and front-loaded
Nature of practiceProcessing + retrieval + recombinationRepetition of correct responses
Timing of outputDelayedEarly
View of early productionRisky (fossilisation, shallow encoding)Necessary and beneficial
Role of explanationMinimal and delayedCentral and early
Cognitive load managementStrongly staged and domain-specificManaged via small steps
View of grammarEmerges from processing, then systematisedExplained then practised
RetrievalCore to automatisationRoutine review strategy
Fluency developmentExplicitly engineeredNot explicitly theorised
Curriculum designCumulative, interleaved, recycling-drivenLargely assumed
Attention to formExplicitly engineered in tasksNot systematically targeted
TransferBuilt via Transfer Appropriate ProcessingLess explicitly addressed
Typical classroomInput-rich, delayed output, heavy recyclingFast-paced, interactive, frequent responses

Conclusion: Same Tools, Different Logic

In conclusion there is a significant overlap between the two frameworks as they are both based on sound educational and theories and research. The overlap explains why both approaches often produce classrooms that look effective. However, the divergence, on the other hand, explains why outcomes—especially in language learning—can differ significantly over time.

Rosenshine gives us a powerful framework for instructional clarity and effective lesson delivery. EPI, in contrast, provides a model for how language learning unfolds cognitively over time, ensuring that teaching aligns with how memory and processing actually work.

Whilst Rosenshine help teachers structure how you teach EPI enables them to control how learning happens over time. If one gets that balance right, one will move from lessons that work… to learning that lasts.

The Vocab Matrix – One of my favourite do-now retrieval tasks

Introduction

When I work with schools to improve their students’ vocabulary retention rates, one thing I consistently notice when observing Do-Now tasks is that they often feel like isolated islands, narrowly focused on the unit at hand.

A Do-Now task should instead recycle new/recent items alongside previously learnt language. For instance, if in Year 8 Term 2 you are teaching Unit 4, you should be deliberately bringing back items from Units 1 and 3 in Term 1—and ideally even from Year 7. This approach is vital not only for combatting the rate of forgetting but, more crucially, for fostering the lexical automaticity required for true spontaneity. By reducing the cognitive effort of retrieval, we free up the working memory for higher-order communication.”

If this is your modus operandi, you will find the Vocabulary Matrix extremely useful. This is a low-tech, high-impact retrieval strategy that forces students out of their comfort zone and into combinatorial thinking. Instead of simply “knowing” words, they begin to build language.

How the Vocab Matrix works: The 4×4 Mix

Draw a 16-square grid on the board (or handout). The secret sauce is interleaving.

  • Top 2 rows → “Old” Year 7 sets (high-frequency anchors)
  • Bottom 2 rows → “New” Year 8 sets (new grammar, concepts, opinions)

If you are as busy as I am, AI can help massively. Just give Chat Gpt or Gemini the right prompts and the grid will be ready in no time.

What students actually do

Students must build sentences by combining items from different rows.

For example:

  • Mis padres reciclan porque es importante
    → My parents recycle because it is important
  • Mi hermano malgasta agua y me molesta
    → My brother wastes water and it annoys me
Columna AColumna BColumna CColumna D
Fila 1: Y7 (Familia)Mi hermanoMis abuelosMi hermana mayorMis padres
Fila 2: Y7 (Ciudad)El parqueMi institutoEl centro comercialLas tiendas
Fila 3: Y8 (Medio Amb.)ReciclarAhorrar energíaMalgastar aguaUsar menos plástico
Fila 4: Y8 (Opiniones)Es preocupanteMe molestaMe parece esencialEs una lástim

The benefits

The rationale for this activity revolves around three key principles:

1. Strategic interleaving

It systematically forces the retrieval of Year 7 vocabulary alongside Year 8 structures. So, you are not just “mentioning” old words—you are making them the core building blocks of new language.

2. Collocation awareness

Students don’t just learn reciclarto recycle. They also learn who or what can realistically recycle:

  • Mis padres reciclan → My parents recycle ✔
  • El parque recicla → The park recycles ✖

They begin to internalise the lexical behaviour of words, not just their meaning.

3. Deep processing

This is not passive work, since, in order to build a sentence, the learner must:

  • Retrieve meaning (Year 7 knowledge)
  • Select the correct form (Year 8 grammar)
  • Combine both into a meaningful message

This kind of effortful thinking—what Robert Bjork calls desirable difficulty—is precisely what makes learning stick.

Conclusions

The Vocab Matrix is an activity which tackles a problem which language teachers often have to contend with: the fact that students’ learning often develops in “Topic Silos.” They can talk about “The Environment” in Year 8, but they’ve “forgotten” how to talk about “Family” from Year 7.

The Vocabulary Matrix just like the Language Gym’s Rock Climbing game, Cumulative Writing and Spiral Sentence builders forces Synthesis. When a student has to use a Year 7 noun as the subject for a Year 8 verb, they are performing “Desirable Difficulty” (Bjork, 1994). This cognitive effort is what moves the language from short-term “recognition” to long-term “proceduralization.” In other words through activity like the Vocab Matrix you are engineering (in under five minutes, with zero technology):

  • retrieval
  • interleaving
  • controlled production and
  • automaticity (if a time limit is imposed)

The 14 Staples of the Extensive Processing Instruction (EPI) Approach

Summary: This article outlines the 14 core staples of the Extensive Processing Instruction (EPI) approach, bridging cognitive science with what I call ’empathetic pedagogy’. It provides a research-validated roadmap for language teachers to move students from initial exposure to spontaneous fluency through a success-first model. The research basis for each ‘staple’ is provided alongside the full list of references.

0. The 14 Staples: A Universal Blueprint for the Heart and Brain

In the modern MFL landscape, I firmly believe that effective teaching must be a meticulous balance of cognitive engineering and human empathy. Hence, while the EPI approach is a research-informed instructional system designed to align classroom practice with the brain’s natural cognitive architecture, it is fundamentally driven by a commitment to the student’s emotional well-being and, in particular, to the development of self-efficacy as conceptualised by Bandura’s (1986)  SCLT.

Fostering self-efficacy (a ‘Can-Do’ attitude) is in my view the primary safeguard against the ‘Cognitive Triad’—the psychological framework Aaron Beck identified as the catalyst for chronic disengagement and learned helplessness:

  1. Negative Views of Self: The learner perceives themselves as linguistically ‘inadequate,’ often attributing temporary struggle to permanent personal deficits.
  2. Negative Views of the Experience: Lessons, homework, and assessments are perceived as an unfair gauntlet of obstacles, leading to cognitive overwhelm.
  3. Negative Views of the Future: The learner anticipates perpetual failure, viewing the path to fluency as hopeless—a mindset that inevitably triggers anxiety and withdrawal.

By prioritizing ‘successful processing’ and the ‘science of success,’ we do more than build linguistic competence; we dismantle the systemic anxiety that has plagued language classrooms for decades. We move students from initial exposure to spontaneous fluency not just through logic, but through the joy of achievement.

The principles below represent a universal blueprint for language acquisition because the biological constraints of human memory remain constant across all learners. Applying these staples globally ensures an inclusive pedagogy where fluency becomes a predictable outcome of a scientifically grounded process, rather than a lucky accident for a select few.”

Figure 1 – Beck’s (1960) Cognitive triad

1. The MARS EARS Cycle

This foundational instructional sequence respects the learner’s cognitive architecture by moving from MARS (Modeling, Awareness-Raising, Receptive Processing, Structured Production) to EARS (Expansion, Autonomy, Routinization, Spontaneity).

  • Research Basis: Rooted in Skill Acquisition Theory (DeKeyser, 2007) and Cognitive Load Theory (Sweller, 2011), which emphasizes managing working memory during the fragile initial stages of proceduralization.

2. Extensive Processing of highly comprehensible input before Controlled Practice

EPI prioritizes “depth over breadth,” favouring “input flooding” where students process a limited set of high-frequency structures hundreds of times across diverse, multi-sensory tasks. Crucially, this occurs within the context of highly comprehensible (>98%) and patterned input. Input flooding ensures repeated processing of target linguistic features; when engineered intelligently—integrating input-enhancement, opportunities for Language-Related Episodes (LREs), and process-based instruction—it facilitates both conscious and subconscious learning.

  • Research Basis: Grounded in Laufer and Hulstijn’s (2001) Involvement Load Hypothesis and Nation’s (2006) research on lexical coverage, alongside Swain and Lapkin’s (1998) work on the efficacy of Language-Related Episodes in consolidating formal accuracy.

3. Sentence Builders

Sentence Builders serve as the primary cognitive scaffold, presenting language in logical, horizontal “chunks.” This allows students to create complex, grammatically correct sentences immediately, bypassing the “grammatical bottleneck.” Consistently with Sweller’s Cognitive Load Theory, they constitute worked examples which provide a clear, step-by-step path to the final linguistic “product.” By explicitly laying out the combinatorial possibilities of the language, they significantly reduce the extraneous cognitive load that occurs when students are forced to search for words and rules simultaneously.

Instead of struggling with the mechanics of sentence construction (the “problem-solving” phase), learners can focus their finite cognitive resources on the mapping of form to meaning. This allows for the successful automation of these structures into long-term memory, effectively transforming a high-effort task into a low-effort retrieval process over time.

  • Research Basis: Supported by Usage-Based Linguistics (Tomasello, 2003) and modern interpretations of Scaffolding (Puntambekar & Hubscher, 2005), providing the frameworks necessary for learners to perform beyond their independent capacity. Also supported by Sweller (2006) on the “worked-example effect,” which demonstrates that learners perform better when they are shown a solution rather than being forced to discover it.

4. Lexical Chunks: The Unit of Acquisition

EPI treats the formulaic sequence or “chunk” as the fundamental building block of acquisition, rather than the individual word. By focusing on these pre-fabricated units, students bypass the high-effort process of assembly-on-the-fly, which is often the primary cause of cognitive overload when learners, who lack a robust mental lexicon, attempt to apply grammatical rules in real-time. This allows for “holistic retrieval,” where phrases are stored and accessed as single cognitive units, drastically reducing the burden on working memory and creating the “lexical agility” required for real-world interaction.

  • Research Basis: Supported by Wray (2002) and Schmitt (2010), who demonstrate that native-like fluency is the mastery of a massive repertoire of pre-fabricated chunks that allow the brain to focus on the message rather than the mechanics.

5. Narrow Reading & Listening

This technique involves exposing learners to multiple, highly similar texts that repeat the same target structures in slightly different contexts. Rather than overwhelming students with one taxing “authentic” document, narrow processing ensures a “flooding” effect that builds deep familiarity, even though the content remains diverse enough to sustain engagement, because the repetition occurs within varied but predictable contexts. This facilitates the incidental acquisition of syntax and vocabulary through high-frequency, comprehensible repetition.

  • Research Basis: Grounded in Lightbown (2014) and Nation and Webb (2011), which confirms that repeated exposure across predictable contexts is the most effective driver of structural “anchor points.”

6. Focus on Receptive Skills First

In EPI, “Input is King.” We prioritize the construction of a deep “receptive base” before demanding output to allow students to develop a robust mental representation of the language. While traditional methods often force premature production, which frequently results in fossilized errors and high anxiety, the EPI model delays output until the learner has internalized the phonological and syntactic code. This avoids performance anxiety and ensures the “depth of intake” is maximized.

  • Research Basis: Anchored in Marsden and Shintani’s (2022) meta-analysis on the superiority of Input-Based Practice (IBP) and Leow’s (2015) cognitive-pedagogical theory.

7. Listening as Modelling (Input Enhancement)

Listening is redefined as the primary vehicle for modeling, employing Input Flood, Input Enhancement, and Process-Based Instruction to explicitly train “listening micro-skills.” This ensures that students move from hearing a stream of noise to segmenting the sound into meaningful chunks, a process that is notoriously difficult for beginners unless the teacher provides structured decoding practice, which focuses specifically on bottom-up processing.

  • Research Basis: Based on Field’s (2008) work on decoding skills and Wong’s (2005) research on making linguistic features salient without interrupting communicative flow.

8. Structured Production: The Scaffolding Bridge

Oral activities are meticulously engineered to move from low-stakes chunking aloud to chunk retrieval and eventually to semi-structured tasks. By scaffolding challenging oral tasks with written priming and pre-planning, we ensure that students are never “thrown into the deep end,” since providing these cognitive supports allows them to focus on pronunciation and fluency, which would otherwise be compromised if they were simultaneously struggling to retrieve vocabulary.

  • Research Basis: Supported by Skehan’s (2009) Cognitive Approach to task design, which emphasizes planning time and task complexity as the dual levers for output quality.

9. Focus on Sound-to-Spelling Correspondence (SSC)

We place a primary emphasis on SSC through micro-listening and chunking aloud. By explicitly teaching the relationship between sound and script from the outset, we provide students with the “keys to the code,” which significantly reduces the likelihood of students mispronouncing written words, as they have been trained to recognize the phonological patterns that underlie the orthography. This builds the awareness necessary for fluent reading and confident speaking.

  • Research Basis: Draws on Ehri’s (2005) phases of sight-word learning and Woore’s (2009) research on the benefits of explicit phonics instruction for beginners.

10. The Cumulative Curriculum: Recycling & Interleaving

The EPI curriculum is a “spiral” where we use cumulative texts that intentionally embed material from all previous units. By building in “curriculum holes” and interleaving different topics, we force the brain to constantly retrieve language, which is essential for creating permanent neural pathways, because language that is not regularly re-activated during the learning cycle will inevitably be lost to the forgetting curve.

  • Research Basis: Supported by the Spacing Effect (Cepeda et al., 2006) and Interleaving Theory (Rohrer, 2012), proving distributed practice is superior for long-term memory.

Figure 2 – Cumulative texts

11. Routinized High-Efficiency Classroom Habits

A high-performance classroom requires high-efficiency habits, utilizing snappy, low-stakes assessments every 6–7 lessons. Consistent classroom routines and Do-Now tasks maximize “time on task,” ensuring that every second of the lesson is dedicated to linguistic processing, although the atmosphere remains supportive because the students find the predictable structure comforting rather than stifling.

  • Research Basis: Aligns with Rosenshine’s (2012) Principles of Instruction and Hattie’s (2009) meta-analysis on high-impact formative feedback.

12. Fluency Training (The Nation Influence)

Inspired by Paul Nation’s (2007) “Four Strands,” we incorporate timed activities to increase retrieval speed. By practicing known language under time pressure, we move to the “automaticity” phase, training the brain to access the lexicon with the rapid-fire speed required for conversation, even if the student is initially nervous about the time constraint.

  • Research Basis: Grounded in Nation (2007) and Segalowitz’s (2010) work, emphasizing that fluency must be practiced using familiar language.

13. Integrated Grammar: Proactive vs. Reactive

Grammar is Proactively housed within Sentence Builders and clarified through Reactive “Pop-up Grammar” moments. This approach ensures that the explanation addresses the “how” and “why” only once the students have developed a mental representation of the structure, a strategy that is far more effective than traditional deductive methods, where students are often taught rules for language they have not yet encountered in a meaningful context.

  • Research Basis: Supported by Macaro’s (2003) research, suggesting grammar is most effective when contextualized and tied to an immediate communicative need.

14. Low-Stakes Retrieval Practice

We use frequent retrieval tasks to pull language from long-term memory, strengthening the neural pathways. This creates a positive feedback loop where successful retrieval causes student anxiety to drop, especially when the tasks are gamified so that students forget they are being tested, which ultimately leads to a surge in self-efficacy and motivation.

  • Research Basis: Rooted in the Testing Effect (Roediger & Karpicke, 2006) and the “Desirable Difficulties” framework of Bjork and Bjork (2011).

Conclusion

To wrap this up, it is essential to remember that while the cognitive engineering of the EPI approach provides the framework, the teacher provides the lifeblood. We aren’t just teaching a language; we are rebuilding the learner’s belief in their own ability to succeed.

By grounding our practice in these 14 staples, we move away from the “hit-or-miss” nature of traditional instruction and toward a universal standard of excellence that respects the limits of the human brain while celebrating its potential. When we harmonize the science of the synapse with the empathy of the classroom, we don’t just create speakers—we create confident, global citizens who no longer view a second language as an impossible barrier, but as a bridge they are fully equipped to cross.


*any moment during a task when learners stop to think about language itself — its form, meaning, or use.

Selected References

  • Bjork, E. L., & Bjork, R. A. (2011). Making things hard on yourself, but in a good way: Creating desirable difficulties to enhance learning. Psychology and the Real World.
  • Cepeda, N. J., et al. (2006). Distributed practice in verbal recall tasks: A review and quantitative synthesis. Psychological Bulletin.
  • DeKeyser, R. M. (2007). Practice in a Second Language: Perspectives from Applied Linguistics and Cognitive Psychology. Cambridge University Press.
  • Field, J. (2008). Listening in the Language Classroom. Cambridge University Press.
  • Hattie, J. (2009). Visible Learning: A Synthesis of Over 800 Meta-Analyses Relating to Achievement. Routledge.
  • Laufer, B., & Hulstijn, J. (2001). Incidental vocabulary acquisition in a second language: The construct of task-induced involvement. Applied Linguistics.
  • Leow, R. P. (2015). Explicit Learning in the L2 Classroom: A Cognitive-pedagogical Theory. Routledge.
  • Lightbown, P. M. (2014). Focus on Content-Based Language Teaching. Oxford University Press.
  • Macaro, E. (2003). Teaching and Learning a Second Language. Continuum.
  • Marsden, E., & Shintani, N. (2022). Input-based practice, production-based practice, and the development of L2 proficiency: A meta-analysis. Applied Linguistics.
  • Nation, I. S. P. (2007). The four strands. Innovation in Language Learning and Teaching.
  • Nation, I. S. P. (2013). Learning Vocabulary in Another Language. Cambridge University Press.
  • Nation, I. S. P., & Webb, S. (2011). Researching Speaking. Heinle Cengage Learning.
  • Puntambekar, S., & Hubscher, R. (2005). Tools for Scaffolding Students in a Complex Learning Environment: What Have We Gained and What Have We Missed? Educational Psychologist.
  • Roediger, H. L., & Karpicke, J. D. (2006). The Power of Testing Memory: Basic Research and Implications for Educational Practice. Perspectives on Psychological Science.
  • Rosenshine, B. (2012). Principles of Instruction: Research-Based Strategies That All Teachers Should Know. American Educator.
  • Schmitt, N. (2010). Researching Vocabulary: A Vocabulary Research Manual. Palgrave Macmillan.
  • Segalowitz, N. (2010). Cognitive Bases of Second Language Fluency. Routledge.
  • Skehan, P. (2009). Modelling speaker performance in second language acquisition. Applied Linguistics.
  • Sweller, J. (2011). Cognitive Load Theory. Psychology of Learning and Motivation.
  • Tomasello, M. (2003). Constructing a Language: A Usage-Based Theory of Language Acquisition. Harvard University Press.
  • Wong, W. (2005). Input Enhancement: From Theory and Research to the Classroom. McGraw-Hill.
  • Woore, R. (2009). Investigating the development of beginner learners’ French pronunciation and sound-spelling knowledge. University of Oxford.
  • Wray, A. (2002). Formulaic Language and the Lexicon. Cambridge University Press.

My upcoming online and face-to-face EPI and non-EPI workshops for Network For Learning

Here is the complete list of sessions I’ll be leading for the University of Bath’s Network For Learning this term.

Note: This doesn’t include my (pretty hectic) international schedule for the Australia tour in May or the European tours running through June to September!

Many of you have reached out recently asking for strategies that “cut across” different teaching styles. While I’m usually deep in the world of EPI, we’ve listened to you and have included a new series of methodology-neutral workshops that provide universal instructional strategies—perfect for any teacher looking to sharpen their toolkit, regardless of their specific pedagogical preference.

For those interested in the EPI accreditation: go straight to the bottom of the page. Why should you be working towards the EPI accreditation? Here are some of the key reasons:

Why Join the EPI Accreditation Journey?

1. Join the Most Rapidly Growing MFL Community in the UK

The data speaks for itself. According to the recent British Council Language Trends survey, the EPI (Extensive Processing Instruction) framework is now the most widely used research-informed methodology in English secondary schools. By becoming accredited, you aren’t just learning a method; you are joining a professional movement that has redefined the national landscape of language teaching.

2. Master the “Science of Teaching” and Learning

EPI is more than just a collection of “fun activities.” The accreditation process dives deep into the cognitive science of how the brain actually acquires language. You will master the MARS EARS cycle, learning how to move students from initial awareness to spontaneous fluency by respecting the limitations of working memory and the power of “chunking” for long-term retention.

3. The “Direct from Source” Advantage

It is a poorly kept secret in the MFL world: many of today’s most prominent MFL bloggers and CPD providers have attended several EPI courses and/or have drawn heavily on its methodology and techniques. While they may have rebranded the strategies or called them by different names, their most successful techniques—from Sentence Builders to Narrow Listening—draw heavily and directly on the EPI framework. Accreditation allows you to bypass the “second-hand” versions and learn the original, high-impact principles directly from the creator.

4. A Blueprint for True Adaptive Teaching

With the removal of the EBacc and the rising diversity of learner needs, “one size fits all” no longer works. EPI accreditation provides a robust toolkit for adaptive teaching. You will learn how to scaffold complex linguistic tasks so that the weakest learners feel successful, while simultaneously providing the high-frequency “lexical agility” that pushes your most able students toward the highest grades.

5. Professional Authority in the New GCSE Era

As the new GCSE specifications move toward a focus on defined vocabulary and high-frequency structures, EPI-trained teachers are already ahead of the curve. Your accreditation proves that you possess the pedagogical “heavy lifting” skills required to deliver results in the current climate, making you a highly sought-after leader in the MFL community.

Here’s the full list with links you can click on for more information and to enroll.

Strategies for Effective Grammar Instruction
                      29 AprilFind out more
                       29 JulyFind out more
__________________________________
From Rehearsed Answers to Exam-Ready Performance – GCSE Speaking Exam
                       8 June Find out more
__________________________________
Writing that Scores – Preparing Students for
 new MFL GCSE Writing

                     11 JuneFind out more
__________________________________
Becoming an EPI Teacher: A Deep Dive into Dr Conti’s Extensive Processing Instruction 
17 June LondonFind out more
22 & 23 JuneFind out more
21 & 22 JulyFind out more
___________________________________
Implementing EPI at KS4 for the new MFL GCSEs
        18 JuneFind out more
__________________________________
Curriculum Design & Lesson Planning in MFL
   29 & 30 JuneFind out more
 25 & 26 AugustFind out more
_________________________________
Becoming an Accredited EPI Teacher – Final Assessment Tutorial
           8 July            Find out more
16 September            Find out more
__________________________________
Applying Memory Research to L2 Vocab Instruction – Classroom & Curriculum Design Strategies for Long Term Retention
              9 JulyFind out more
__________________________________
Phonics in MFL
                28 JulyFind out more
____________________________________
Implementing EPI for Learners in High School
      15 SeptemberFind out more
__________________________________
Becoming an Accredited EPI Teacher Program
Gianfranco Conti | variousFind out more
_______________________________________

From Routine to Ruin? A Critical Analysis of Do-Now Tasks in the Light of Memory Research

Introduction

Let’s start with a bit of honesty, because otherwise this becomes another polite CPD piece that changes nothing… do-now tasks are everywhere, they look good, they tick boxes but in many MFL classrooms they don’t actually move learning forward in any meaningful way… If that sounds a bit blunt it’s because, in my experience observing hundreds of lessons a year across WL, MFL and EFL learning contexts, what we often see is activity masquerading as learning

What are do-now tasks and where do they come from?

A do-now task is simply a short activity students complete as soon as they walk into the room, usually independently and without explanation, typically lasting three to five minutes; the idea was popularised by Doug Lemov in Teach Like a Champion, and it was originally about behaviour and efficiency, not learning in the deep cognitive sense. This is important because a lot of what we now attribute to do-now tasks was never part of the original design.

Only later did people start saying, “Ah, this is retrieval practice”, linking it to the work of Henry Roediger and Jeffrey Karpicke, or to activation of prior knowledge (Ausubel, 1968), and yes, in theory, it can do all of that, but – and this is th key point—only if it is designed with those principles in mind, which, in many cases, it simply isn’t! and this gap between theory and practice is where most of the damage happens…

What do-now tasks can do when they actually work

When they are done properly—and I stress properly, because this is where things usually go wrong—do-now tasks can be extremely powerful, precisely because they hit that moment at the start of the lesson when attention is relatively high and cognitive load is relatively low (Sweller, 2010), which makes them ideal for reactivating prior learning.

They can support retrieval practice, which we know strengthens memory (Roediger & Karpicke, 2006; Karpicke & Blunt, 2011), but only when retrieval is successful and followed by feedback (Butler & Roediger, 2008; Agarwal & Bain, 2019), they can enable spacing, which is critical for retention (Cepeda et al., 2006; Kang, 2016), and they can help students see learning as cumulative rather than a series of disconnected units, which is absolutely crucial in language learning where forgetting is the default state unless we intervene (Nation, 2013).

But there are a few other advantages that are often overlooked, and in my experience these are the ones that make a disproportionate difference when you get them right.

They create desirable difficulty at a low cost, in the sense that a well-designed do-now forces students to think just enough to engage memory without overwhelming working memory, which aligns with Bjork’s notion of “desirable difficulties” (Bjork & Bjork, 2011), and this matters because effortful recall, when it is successful, strengthens learning more than passive review.

They also provide a form of diagnostic assessment in real time, because within two or three minutes you can see very clearly what students can and cannot do, and this can inform the rest of your lesson in a way that end-of-unit tests simply cannot; in other words, they give you actionable data early, which is pedagogically far more useful than summative data late (Black & Wiliam, 1998).

Another benefit, which is rarely discussed but quite powerful, is that they help build fluency under low-stakes conditions, because students are asked to retrieve and produce language regularly without the pressure of formal assessment, and over time this repeated, low-stakes retrieval contributes to automatisation (DeKeyser, 2007), even if each individual task feels quite small.

They can also strengthen metacognitive awareness, because when students repeatedly encounter language they thought they knew but cannot fully retrieve, they begin—slowly, sometimes reluctantly—to recalibrate their own sense of what it means to “know” something, which is a key aspect of self-regulated learning (Dunlosky & Metcalfe, 2009).

And finally, they create lesson continuity, not just in the sense of revisiting content, but in signalling to students that what they learned yesterday, last week, or last term still matters today, and that learning is something that accumulates rather than resets, which, although it sounds obvious, is something many learners do not naturally perceive unless it is made explicit.

Let me give you a simple example of a do-now task that actually works.

Bad version (what I often see):
“Translate: I play football / I watch TV / I go to the park”
→ all from yesterday’s lesson
→ easy, quick, forgotten in 10 minutes

Better version:
“Translate: When I was ten I used to play football / Yesterday I went to the park / Tomorrow I will watch TV”

Now what’s happening?

  • multiple tenses
  • retrieval from different points in the curriculum
  • actual thinking

Even better:
“Write one sentence about the past, one about the present, one about the future using these verbs: giocare, andare, guardare”

Now you’re getting:

  • retrieval
  • manipulation
  • transfer

Same time, completely different impact.

Why they go wrong in MFL (and they really do)

Now here’s where it gets slightly uncomfortable, because the issue is not the idea of do-now tasks, it’s how we use them… and, frankly, how casually we sometimes design them

1. They are too narrow

Most do-now tasks focus on the current unit, which makes sense superficially but is disastrous for retention; and this is why you get the classic situation where students can perform reasonably well in a unit test and then, three months later, that language has vanished as if it had never been taught, which is not a failure of teaching but a failure of retrieval over time (Cepeda et al., 2006; Kang, 2016).

Example of poor practice:
“Translate: I eat pizza / I drink water / I like ice cream”

Better:
“Translate: I ate pizza yesterday / I used to drink water every day / I will eat ice cream tomorrow”

Now you’re forcing:

  • interleaving
  • retrieval across time

2. Retrieval is introduced too early

This is a big one, and it links to recent discussions around the limits of retrieval practice (Carpenter, 2022; Nakatsukasa, 2023; Yang et al., 2021).

Teachers often ask students to retrieve language that has not been sufficiently processed, which leads to:

  • guessing
  • copying
  • fossilising errors

I’ve seen this countless times. Students are introduced to a structure, then imediately asked to produce it from memory, and the teacher thinks, “Great, they’re retrieving”, but what is actually happening is approximation under pressure, which is not the same thing (Sweller, 2010; DeKeyser, 2007).

If they can’t do it with support, they won’t do it without support…

3. They are disconnected from the lesson

This one drives me slightly mad.

The do-now happens, students finish it, it is checked quickly, and then we move on as if it never happened; and at that point you’ve wasted one of the most cognitively valuable moments in the lesson (Rosenshine, 2012).

If you retrieve something, you must use it again.

Otherwise:

  • it goes back into long-term memory
  • and stays there… unused

4. They are too easy (or too mindless)

A lot of do-now tasks are designed to be “quick wins”, but in doing so they become mechanical, and here is the uncomfortable truth:

Effort matters (Bjork & Bjork, 2011)

If students are not thinking, they are not learning.

How to fix them (without reinventing the wheel)

You don’t need to scrap do-now tasks. You need to upgrade them… and the good news is that the fixes are not complicated, they just require intention.

1.Interleave previously learned (secure) items

The aim is to mix language that has already been stabilised in long-term memory, not to force complexity for the sake of it; interleaving strengthens retrieval when items are secure (Cepeda et al., 2006), but if you mix unstable material you simply overload working memory and trigger guessing (Sweller, 2010)… in other words, don’t interleave fragility.

Think:
“What is ready to be retrieved and recombined?”, not “How many tenses can I cram in?”

Year 7 (early KS3 – no interleaving yet, just stabilisation):

  • I play football
  • I watch TV
  • I go to the park

Same structure, high repetition → building form–meaning mapping

Year 8 (emerging interleaving – safe and controlled):

  • I play football every day
  • Yesterday I played football
  • I don’t play football

Mix:

  • present + past
  • affirmation + negation
    BUT only with high-frequency, well-rehearsed verbs

Year 9–11 (full interleaving — now it pays off):

  • When I was ten I used to play football
  • Yesterday I played football with my friends
  • Now I play basketball instead
  • Next week I will play in a match

Now you get:

  • multiple time frames
  • contrast
  • transfer
    because the language is secure enough to handle it

2. Control difficulty

Aim for:

  • ~70% success (Rowland, 2014)
  • not guessing, not trivial

3. Recycle in the lesson

If the do-now includes:
Quando avevo dieci anni giocavo a calcio

Then later:
Ora invece gioco a tennis

That’s transfer.

4. Use partial support

Don’t go from full support → nothing.

Use:

  • sentence builders
  • prompts
  • partial translations

Where do they fit in MARS EARS?

Do-now tasks sit after modelling and processing, not before.

What they are:
retrieval + structured production

What they are NOT:
– initial exposure
– blind production

In other words:

  • language must be seen, heard, processed first
  • THEN retrieved (VanPatten, 2015)

If you reverse that sequence, you are not doing retrieval practice, you are doing guessing practice…which will likely lead to fragile learning

Conclusion

Do-now tasks are not the problem. Poorly designed do-now tasks are. And if students forget what you taught them in Term 1 by Term 3, the issue is not that they are lazy or unmotivated, it is that the system has not forced that knowledge back into working memory often enough, and no amount of engagement or “fun” activities will fix that (Nation, 2013; Cepeda et al., 2006).

So next time you design a do-now, ask yourself: Am I just starting the lesson… or am I making memory work?

Because only one of those leads to learning.

Have We Overdone Retrieval Practice? A Timely Recalibration for Language Teachers

For the past decade or so, retrieval practice has enjoyed near-mythical status in education circles, often presented, somewhat uncritically, as a universal good : test more, retrieve more, and learning will automatically follow. And while the underlying principle is undoubtedly sound, recent research and, in my experience, classroom reality suggest that the picture is rather more nuanced than the slogan would have us believe.

Let us be clear from the outset: retrieval practice DOES work. A robust body of research shows that retrieving information strengthens memory more effectively than re-exposure alone (Roediger & Karpicke, 2006; Karpicke & Blunt, 2011), and in the context of vocabulary learning this has been repeatedly confirmed, with retrieval supporting long-term retention when used appropriately (Nation, 2013; Kang, 2016). However, what more recent syntheses in cognitive psychology and its application to SLA are beginning to emphasise is that retrieval is highly condition-dependent, and that when those conditions are not met, its benefits may be attenuated, or, in some cases, even reversed.

The following three variables, in particular, appear to be critical:

First, success rate: retrieval needs to be largely successful—typically in the region of 60–80%—for it to strengthen memory traces effectively, because repeated failure leads not to learning but to guesswork or disengagement (Karpicke, 2017; Rowland, 2014).

Second, prior encoding: the material must have been sufficiently processed before retrieval is attempted, since asking learners to retrieve poorly encoded information places excessive demands on working memory and often results in the reinforcement of incorrect hypotheses (Sweller, 2010; Baddeley, 2000). Hence the importance of input processing and effective elaborative rehearsal prior to retrieval, a tenet of the EPI methodology, which occurs in the MAR segment of the MARSEARS sequence.

Third, feedback quality: errors must be corrected promptly and clearly, otherwise retrieval risks consolidating inaccuracies rather than strengthening correct representations (Butler & Roediger, 2008).

And here, if one may pause for a moment, lies the pedagogical crux: in beginner and mixed-ability classroomwheres, learners’ lexical and grammatical representations are still fragile, and where processing capacity is easily overwhelmed, these conditions are not always present, and in my observation this is precisely when retrieval tasks become less a tool for consolidation and more an exercise in frustration; and is it not reasonable, therefore, to question whether what we are witnessing in such cases is retrieval at all, or merely the appearance of it?

From a cognitive load perspective, the issue becomes even clearer. Working memory is limited (Baddeley, 2000), and when learners are asked to retrieve language that has not yet been automatised, they must simultaneously search for forms, map them to meaning, and assemble them into coherent output. This operation, for novices, is often simply too demanding. Under such conditions, retrieval may not only fail to support learning but may actually impair encoding, because attentional resources are diverted away from processing input and towards struggling with output (Sweller, 2010; DeKeyser, 2007).

So what does this mean for classroom practice? Not, certainly, that retrieval should be abandoned—far from it—but rather that it should be repositioned within a carefully sequenced instructional framework, one in which learners are first provided with rich, comprehensible input, then guided through structured processing activities that stabilise form–meaning connections, and only then asked to retrieve and produce the language in question. In other words:

input → processing → rehearsal → retrieval

not the other way round, as often happens..

This is not a trivial adjustment. It represents a shift from viewing retrieval as the engine of learning to understanding it as a powerful consolidator of learning, effective precisely because it strengthens representations that are already in place, rather than creating them ex nihilo—an assumption which, though rarely stated explicitly, often underpins premature retrieval tasks.

In conclusion, retrieval practice remains one of the most valuable tools at our disposal, but like all powerful tools it must be used with precision; and if there is one lesson emerging from the most recent research—and, indeed, from careful classroom observation—it is that timing matters as much as technique, because learning is not simply a matter of doing the right thing, but of doing it at the right moment.

References

Baddeley, A. (2000). The episodic buffer: A new component of working memory? Trends in Cognitive Sciences.
Butler, A. C., & Roediger, H. L. (2008). Feedback enhances the positive effects and reduces the negative effects of multiple-choice testing. Memory & Cognition.
Carpenter, S. K. (2022).Retrieval practice. In J. Dunlosky & K. A. Rawson (Eds.), The Cambridge handbook of cognition and education (2nd ed., pp. 347–369). Cambridge University Press. DeKeyser, R. (2007). Practice in a second language: Perspectives from applied linguistics and cognitive psychology.
Kang, S. H. K. (2016). Spaced repetition promotes efficient and effective learning. Policy Insights from the Behavioral and Brain Sciences.
Karpicke, J. D. (2017). Retrieval-based learning: A decade of progress. Nakatsukasa, K. (2023).Retrieval practice in second language vocabulary learning: A research synthesis. Language Teaching Research, 27(4), 987–1008.
Nation, I. S. P. (2013). Learning vocabulary in another language.
Roediger, H. L., & Karpicke, J. D. (2006). Test-enhanced learning: Taking memory tests improves long-term retention. Psychological Science.
Rowland, C. A. (2014). The effect of testing versus restudy on retention: A meta-analytic review. Psychological Bulletin.
Sweller, J. (2010). Cognitive load theory: Recent theoretical advances.

Sentence Builders in Input-to-Output Sequences: Cognitive and Affective Benefits

Introduction

Sentence builders—when used as carefully engineered, input-rich, reusable frames rather than mere substitution tables—offer a powerful convergence of cognitive and affective benefits; and, in my experience, when they are embedded within a coherent input→processing→production sequence rather than deployed as isolated artefacts, they become not just helpful supports but central drivers of acquisition, even if this runs counter to the somewhat fashionable preference for less structured approaches.

Before proceeding, however, I strongly believe it is important—especially for those familiar with my work—to dispel a common misconception: in EPI, sentence builders are not the method. They are, rather, a tool amongst several used to model language, to encourage noticing of key language features, to scaffold elaborative rehearsal, and to assist initial production; and while they play a crucial role within the broader architecture, they must be understood as one cog in the MARSEARS sequence, not the engine itself.

Indeed, and this is something I have observed repeatedly in classrooms where EPI is implemented with fidelity, the language contained within a sentence builder does not constitute the totality of what learners encounter, because during the receptive phases learners are exposed to aural and written input through narrow listening and narrow reading, which extends, enriches, and consolidates the core language, thereby ensuring that the builder functions as a springboard rather than a boundary—and is it not precisely this interplay between structured input and guided output that drives acquisition forward?

Cognitive benefits

1. Reduces cognitive load, frees up processing capacity

Sentence builders externalise part of the linguistic load (lexis, morphology, word order), which—given the limits of working memory (Baddeley, 2000; Sweller, 2010)—allows learners to focus on meaning-making and message construction rather than retrieval from scratch, and in my observation this is precisely what enables lower-attaining learners to remain cognitively engaged rather than overwhelmed, especially when the linguistic demands would otherwise exceed their processing capacity within seconds.

Example
Instead of generating: “I went to the cinema with my friends because…” from nothing, learners select and combine pre-structured chunks → less overload, more successful processing.

Research: Sweller (2010); Baddeley (2000); Paas & van Merriënboer (1994)

2. Supports form–meaning mapping (Input Processing)

By aligning chunks clearly (e.g. je vais + infinitive), sentence builders help learners establish accurate form–meaning connections, reducing reliance on faulty heuristics (VanPatten, 2015), and—this is key, and often underestimated—preventing the kind of mislearning that occurs when learners construct meaning from partial or misleading cues.

Example
je vais regarder consistently mapped to “I am going to watch” across multiple sentences → stable mapping.

Research: VanPatten (2015); Wong (2004)

3. Promotes chunking and automatisation

Frequent exposure to patterned language encourages chunking (Ellis, 2002), and with repeated retrieval, these chunks become automatised (DeKeyser, 2007), which in turn reduces processing time and increases fluency; and one might say, perhaps a little unfashionably, that learners begin to operate through formulaic sequences which, far from being a limitation, constitute the very substrate of fluent language use.

Example
I would like to + infinitive + non or prepositional phrase becomes a single retrievable unit rather than assembled word by word.

Research: Ellis (2002); DeKeyser (2007); Wray (2002)

4. Enables high-frequency recycling (spacing + retrieval)

Sentence builders lend themselves naturally to recycling across lessons, supporting spacing effects (Cepeda et al., 2006) and retrieval practice (Roediger & Karpicke, 2006), both of which are strongly linked to durable learning; and in my experience, when the same builder is revisited across modalities—listening, reading, speaking, writing—the gains are not merely additive but multiplicative.

Example
Monday: comprehension
Wednesday: structured speaking
Friday: written adaptation

Research: Cepeda et al. (2006); Roediger & Karpicke (2006); Kang (2016)

5. Facilitates structured output (scaffolded production)

They provide a bridge from receptive processing to controlled production, ensuring that output is not random but principled and success-oriented, which aligns with skill acquisition theory (DeKeyser, 2007), where guided practice precedes freer use; and in my observation, skipping this stage is one of the most common causes of fragile, error-prone production. They are valuable tools in the elaborative rehearsal phase which should always precede retrieval, i.e. the phase in which students carry out activities such as sentence puzzles, gap-fill, odd one out, word subsitution and other deep processing tasks, etc. with the assistance of sentence builders.

Example
Learners produce accurate sentences before being asked to improvise.

Research: DeKeyser (2007); Anderson (1982)

6. Enhances noticing of patterns (lexicogrammar)

By visually organising language, sentence builders make patterns salient, supporting noticing (Schmidt, 1990) and helping learners internalise colligations and collocations—what Halliday would call lexicogrammar—even if, to the learner, this remains largely implicit and only gradually becomes explicit through repeated exposure.

Example
Seeing play + sport (no article) vs play the + instrument repeatedly.

Research: Schmidt (1990); Halliday (1994); Nation (2013)

Affective benefits

1. Reduces anxiety and increases confidence

Sentence builders lower the affective filter (Krashen, 1985) by giving learners a clear pathway to success; and in my experience, even students who would normally remain silent are willing to attempt production when they feel they are not operating in a vacuum but within a supportive structure.

Research: Krashen (1985); Horwitz et al. (1986)

2. Increases willingness to communicate

Because learners are not starting from zero, they are more likely to take risks and speak, which is essential for developing fluency (MacIntyre et al., 1998), even if the language is initially scaffolded rather than spontaneous in the purist sense.

Research: MacIntyre et al. (1998); Dörnyei (2005)

3. Creates a sense of control and clarity

Ambiguity is reduced: learners know what is expected, what is possible, and how to succeed, which—though it may sound somewhat antiquated—restores a degree of didactic transparency often absent in less structured approaches. A recent research by Kate Trafford (here) found that the use of sentence builders was highly instrumental in tripling her GCSE uptake. The main reason for the students liking sentence builders was that their use make them more confident for the control and clarity they provide.

Research: Hattie (2012); Rosenshine (2012) , Trafford (2023)

4. Promotes inclusivity in mixed-ability classes

Stronger learners can extend and manipulate, weaker learners can follow and succeed, and in my observation this dual accessibility is one of the few ways to maintain both challenge and inclusivity without fragmenting the class into separate tracks.

Research: Tomlinson (2014); Vygotsky (1978)

5. Builds momentum and motivation through success

Success breeds motivation (Dörnyei, 2001), and sentence builders provide frequent, visible success moments which gradually reshape learner self-perception—I can’t do languages becomes I can actually say things, and that shift, once achieved, is remarkably powerful.

Research: Dörnyei (2001); Bandura (1997)

The perils of using sentence builders badly

Let us not pretend that sentence builders are a panacea, because in my experience they can be misused in ways that are not merely ineffective but actively counterproductive, particularly when teachers—often with the best of intentions—shortcut the processing phase and move too rapidly into retrieval or production, thereby forcing learners to produce language they have not yet sufficiently internalised.

Common pitfalls include:

  • Going to retrieval too soon → learners guess rather than retrieve
  • Overcrowded sentence builders → excessive options increase cognitive load rather than reduce it
  • Lack of recycling → structures remain inert and unautomatized
  • Overreliance → learners become dependent if scaffolds are not gradually withdrawn
  • Poor sequencing → jumping from exposure to free production without intermediate stages

And here one might reasonably ask: if a tool designed to reduce cognitive load ends up increasing it, have we not misunderstood its very purpose?

Research: Sweller (2010); DeKeyser (2007); Bjork & Bjork (2011)

Addressing the criticisms

Sentence builders are often criticised on several grounds, and it is worth addressing these explicitly rather than dismissing them, because some contain a kernel of truth even if the conclusion drawn is, in my view, misguided.

“They limit creativity”

This criticism assumes that creativity precedes control, whereas the evidence suggests the opposite: learners require a threshold level of automatised language before meaningful creativity is possible (DeKeyser, 2007; Skehan, 1998); in my experience, asking beginners to be creative without sufficient linguistic resources results not in creativity but in silence or error. As mentioned above, the language contained within a sentence builder does not constitute the totality of what learners encounter, because during the receptive phases learners are exposed to aural and written input through narrow listening and narrow reading and any other texts the teachers deem useful, which extends, enriches, and consolidates the core language, thereby ensuring that the builder functions as a springboard rather than a boundary—and is it not precisely this interplay between structured input and guided output that drives acquisition forward?

“They encourage rote learning”

All learning involves some degree of memorisation, and the question is not whether learners memorise, but what they memorise and how it is processed; sentence builders, when used properly, promote meaningful repetition, not mindless parroting.

Research: Nation (2013); Baddeley (2000)

“They are not communicative”

On the contrary, they enable communication earlier, because they provide the linguistic means to express meaning, even if within constraints; and is it not preferable for learners to communicate accurately within a frame than inaccurately without one?

Research: Long (1996); Ellis (2003)

“Students become dependent on them”

Only if scaffolding is not withdrawn; properly used, sentence builders are gradually faded as learners internalise patterns, much like training wheels on a bicycle—useful initially, unnecessary later.

Research: Vygotsky (1978); Wood, Bruner & Ross (1976)

“They make learning boring”

Only if the teacher doesn’t know how to use them in an engaging way or always uses them the same way.

Conclusions

Sentence builders work because they sit at the intersection of cognitive efficiency and affective support, reducing overload while increasing confidence, stabilising form–meaning mappings while enabling structured production; and in my experience, when they are embedded within a principled instructional sequence—rich input, guided processing, scaffolded output, and gradual release—they do not constrain learners but rather enable them to say more, sooner, and with greater accuracy.

The real question, therefore, is not whether sentence builders should be used, but whether they are being used well, because the difference between effective scaffolding and counterproductive crutching lies not in the tool itself but in its implementation, and that, ultimately, is where professional judgement must prevail. This shines through very evidently in the above mentioned research by Kate Trafford which details a flawless and engaging implementation of EPI, in which the sentence builders are used with great ingenuity and creativity. The result, as she write in the conclusion of her MA dissertation:

“Pupils expressed with enthusiasm their opinions on the use of SBs, particularly in relation to visual organisation, accessibility and clarity of sentence structure. Their opinions directly reflect literature surrounding grammar being taught in context, enjoying being able to use the language to communicate as posited by CLT and the importance of lowering stress on the working and intermediate memory as per Krashen’s comprehensible input theory.”

References

Anderson, J. R. (1982). Acquisition of cognitive skill.
Baddeley, A. (2000). The episodic buffer.
Bandura, A. (1997). Self-efficacy.
Bjork, R., & Bjork, E. (2011). Making things hard on yourself, but in a good way.
Cepeda, N. et al. (2006). Distributed practice in verbal recall tasks.
DeKeyser, R. (2007). Practice in a second language.
Dörnyei, Z. (2001). Motivational strategies in the language classroom.
Ellis, N. (2002). Frequency effects in language processing.
Ellis, R. (2003). Task-based language learning.
Halliday, M. (1994). An introduction to functional grammar.
Kang, S. (2016). Spaced repetition promotes efficient learning.
Krashen, S. (1985). The input hypothesis.
Long, M. (1996). The role of the linguistic environment.
MacIntyre, P. et al. (1998). Willingness to communicate.
Nation, I. S. P. (2013). Learning vocabulary in another language.
Paas, F., & van Merriënboer, J. (1994). Instructional control of cognitive load.
Roediger, H., & Karpicke, J. (2006). Test-enhanced learning.
Rosenshine, B. (2012). Principles of instruction.
Schmidt, R. (1990). The role of consciousness in SLA.
Skehan, P. (1998). A cognitive approach to language learning.
Sweller, J. (2010). Cognitive load theory.
Tomlinson, C. (2014). The differentiated classroom.
VanPatten, B. (2015). Input processing.
Vygotsky, L. (1978). Mind in society.
Wray, A. (2002). Formulaic language and the lexicon.

Target Language First… But Not at All Costs: A Research-Informed Case for Strategic L1 Use in Beginner Mixed-Ability Classrooms

Introduction

If I had a pound for every time I’ve heard “you must stay in the target language 100% of the time”, I’d probably be writing this from a beach somewhere rather than my desk; and yet, in my experience, as appealing as such slogans may appear—succinct, emphatic, and seemingly grounded in common sense—they tend, upon closer inspection, to unravel under the weight of empirical scrutiny.

Let me be clear from the outset: maximising target language (TL) use is pedagogically sound, and in my observation this is borne out both in the literature and in the classroom. A substantial body of research shows that frequent, meaningful exposure to the TL is essential for acquisition (Krashen, 1985; Nation, 2013; Ellis, 2015), since learners require rich input, repeated encounters, and opportunities to process language in context, all of which, when orchestrated judiciously, contribute to the gradual entrenchment of form–meaning mappings. In my own work with the EPI approach, I would argue that input is the engine of acquisition—and therefore the TL must dominate classroom discourse.

However—and this is the crucial point—“maximising TL” is not the same as “TL-only at all costs.”

In beginner, mixed-ability classrooms where all learners share the same L1, rigid TL-only policies can, in my experience, overwhelm working memory, reduce comprehension, increase anxiety, and ultimately disengage a significant proportion of learners, particularly when the linguistic demands exceed their current processing capacity and when no mediating support is available to stabilise meaning; and if that is the case, one might reasonably ask, what exactly are we maximising—exposure or confusion? In such contexts, the L1 is not the enemy of acquisition; it is a cognitive tool—a scaffold, a shortcut, and, dare one say (to use a slightly antiquated turn of phrase), a propedeutic aid.

Let’s unpack this properly, with the research in hand.

1. Comprehensibility: The Non-Negotiable Condition for Learning

One of the most robust findings in SLA is that input must be understood to be useful, a principle so well-established that it borders on the axiomatic. Nation (2013) and Hu & Nation (2000) suggest that learners need to know around 95–98% of the words in a text to process it effectively, which, in my experience, is rarely the case in beginner mixed-ability classrooms unless the input has been very carefully engineered.

In beginner classrooms, TL-only instruction often pushes learners well below this threshold, with the predictable consequence that comprehension falters and processing becomes superficial, fragmented, and at times illusory, as learners cling to isolated lexical items while failing to construct a coherent representation of meaning. The result?

  • Guessing
  • Shallow processing
  • Cognitive overload
  • Disengagement

A brief L1 clarification—ten seconds, no more—can restore comprehensibility and allow learners to actually process the TL input that follows, thereby ensuring that subsequent exposure is not merely auditory noise but cognitively tractable material.

As Nation (2013) reminds us, time spent not understanding is time not learning—and, if one may be permitted a rhetorical flourish, what conceivable virtue is there in incomprehension?

2. Cognitive Load

According to Cognitive Load Theory (Sweller, 2010), working memory is severely limited—typically handling around 4–6 elements at a time (Baddeley, 2000)—a constraint which, though often acknowledged in theory, is in my observation routinely underestimated in practice, particularly when teachers attempt to sustain extended TL-only discourse with novice learners.

Beginner learners are already juggling:

  • New phonology
  • New vocabulary
  • New syntax

When we insist on delivering instructions and explanations exclusively in the TL, we often add extraneous cognitive load, that is to say, processing which does not directly contribute to schema construction but instead taxes limited attentional resources, thereby impeding rather than facilitating learning; and is it not somewhat paradoxical that, in striving to immerse learners in the TL, we may in fact be diminishing their capacity to process it?

The irony is stark: in trying to increase exposure, we may actually reduce effective processing.

A well-timed L1 explanation reduces this extraneous load, allowing learners to allocate their limited cognitive resources to what actually matters: processing the TL input—a point which, though seemingly self-evident, bears reiteration in the current climate of methodological absolutism.

3. Preventing Mislearning

VanPatten’s Input Processing theory (2015) shows that learners rely heavily on default strategies, such as prioritising content words and neglecting morphological markers, particularly when processing capacity is strained; and in my experience, this tendency is exacerbated in TL-only environments where meaning is insufficiently anchored.

And here’s the problem:
👉 Incorrectly learned language is harder to unlearn than correctly learned language is to acquire.

When comprehension is weak, learners generate incorrect form–meaning mappings, which, once established, may fossilise and resist subsequent correction, thereby creating a pedagogical impediment that is far more onerous to address than the initial misunderstanding.

Strategic L1 use can:

  • Clarify meaning precisely
  • Prevent faulty hypotheses
  • Stabilise early representations

In other words, it acts as a quality control mechanism for input processing—an intervention not of indulgence but of prudence.

4. Efficiency and Time-on-Task

Macaro (2001) and Turnbull & Dailey-O’Cain (2009) both highlight that judicious L1 use increases efficiency, a finding which aligns closely with my own observation that time squandered on protracted TL explanations often yields diminishing returns, particularly in heterogeneous classrooms.

A concept that takes two minutes to explain in the TL may take ten seconds in the L1, and while some may argue that those two minutes constitute valuable exposure, one must ask whether such exposure, if only partially understood, is in fact pedagogically efficacious.

This is not about laziness—it is about optimising instructional time, or, to employ a somewhat recherché expression, ensuring economy of cognitive effort.

Every minute saved on clarification is a minute gained for:

  • Input flood
  • Processing tasks
  • Retrieval practice

Ironically, less rigid TL use can result in more total TL exposure—a conclusion which, though counterintuitive to some, is well supported by both research and classroom praxis.

5. Affective Factors: Keeping Students in the Game

Krashen’s Affective Filter Hypothesis (1985) reminds us that anxiety, confusion, and low self-efficacy can block acquisition, and in my experience this is particularly salient in mixed-ability classrooms where weaker learners, unable to access meaning, may withdraw cognitively even if they remain physically present.

TL-only approaches often:

  • Benefit the top 20%
  • Marginalise the rest

Lower-attaining learners, unable to follow, disengage—not because they are unwilling, but because they are locked out of meaning, and once this disengagement sets in, it can be remarkably difficult to reverse, especially if learners begin to perceive the subject as impenetrable.

Research by Macaro (2005) and Levine (2011) shows that appropriate L1 use can reduce anxiety and increase participation, particularly in lower-proficiency learners, thereby lowering the affective filter and facilitating engagement with the TL input.

6. The Power of Cross-Linguistic Comparison

Ellis (2006) and Hall & Cook (2012) highlight the value of explicit comparison between L1 and L2, a process which, in my observation, not only enhances noticing but also accelerates the consolidation of form–meaning relationships, particularly when learners are guided to attend to subtle differences that might otherwise escape their attention.

The L1 provides a powerful reference point for:

  • Noticing differences
  • Avoiding negative transfer
  • Deepening understanding

Example:

  • Mi piace il calcio → “to me pleases football”

Without L1 mediation, this structure can remain opaque for a considerable period, whereas a brief contrastive explanation can render it immediately intelligible—mutatis mutandis, the same principle applies across numerous grammatical structures.

Used well, the L1 is not a crutch—it is a cognitive lever.

7. Code-Switching and the Bilingual Brain

From a psycholinguistic perspective, bilinguals do not “turn off” one language to use another; rather, research shows that both languages are co-activated in the brain (Kroll, Bobb & Wodniecka, 2006; Green & Abutalebi, 2013), a phenomenon which has significant implications for classroom practice, even if it is sometimes overlooked in methodological debates.

This means:

  • The L1 is always present
  • Learners naturally draw on it
  • Suppressing it entirely is neither realistic nor cognitively optimal

Strategic code-switching:

  • Reduces processing effort
  • Facilitates meaning-making
  • Supports lexical access

In other words, using the L1 aligns with how the brain actually works, and to ignore this is, arguably, to disregard a fundamental aspect of human cognition.

8. The Mixed-Ability Reality

Let us not indulge in pedagogical idealisation: in real classrooms, ability is not evenly distributed, and any approach that fails to account for this heterogeneity risks privileging a minority at the expense of the majority.

TL-only instruction often results in:

  • Stronger learners inferring meaning
  • Weaker learners copying or withdrawing

In my experience, strategic L1 use allows us to maintain inclusivity while preserving cognitive challenge, ensuring that all learners can access meaning without diluting the intellectual demands of the task, which, after all, remains firmly rooted in the TL.

So Where Does This Leave Us?

The research does not support:

  • ❌ TL-only dogma
  • ❌ Excessive reliance on L1

Instead, it supports a principled middle ground, one which, though less rhetorically appealing than absolutist positions, is far more consistent with both theory and practice:

  • Maximise meaningful TL input
  • Use L1 strategically and sparingly for:
    • Instructions
    • Clarification
    • Contrastive explanation
    • Repair

Conclusion

The real question is not:

“Should I use the TL or the L1?”

But rather—if I may phrase it thus—

“At this precise moment, what will maximise comprehension, minimise cognitive overload, and lead to better processing of the TL?”

If the answer is ten seconds of L1, then refusing to use it is not principled—it is ideological; and ideology, as we know, is a very poor substitute for learning. Whenever a staunch advocate of TL-only starts giving me their spiel as to why it is so key that the teacher speaks solely or 99% in the TL and that teaching resources are solely or 99% in the TL, I always ask the very same question: is there a chance that one student in the class may be left behind? If so, then it is unethical. Period.

The L1 should be used as pedagogical capital, at the right time, in the right way, in order to scaffold TL use. In EPI, for instance, this is achieved through gradually phasing out the L1 after a substantive input-to-output sequence where English is not spoken, but helps maintaining input-processing comprensibile and retrieval feasible.

Why “Getting It Wrong” Might Be the Best Thing Your Students Do (And Why Most Teachers Still Get This Completely Wrong)

Introduction

Let me start with a slightly uncomfortable truth: in most classrooms, error is treated as something to minimise, avoid, or at best tolerate briefly before rushing to the “correct answer”. And yet, as I purport to show below, a growing body of research suggests something far more provocative: students dont need to get the answer right for retrieval practice to work. In fact, getting it wrong—under the right conditions—can make learning stronger (Kornell, Hays & Bjork, 2009; Grimaldi & Karpicke, 2012; Pan, Hutter & Rickard, 2020).

Whilst this is true, and this is crucial, please note:  it’s not the mistake itself that does the learning; it’s what the mistake triggers in the brain.

The Illusion That’s Holding Us Back

Many of us implicitly believe that Retrieval practice only works if students successfully recall the answer. It sounds logical, doesn’t it? If nothing is retrieved, nothing is strengthened… right? However, research does suggest that this may be wrong. What appears to matter is not success, but rather the effortful retrieval followed by feedback (Roediger & Karpicke, 2006; Metcalfe, 2017).

What’s Actually Going On Under the Hood

When a learner attempts to retrieve something—even unsuccessfully—three powerful processes kick in.

1. The Brain Starts Searching in the Right Place

Even a failed attempt activates related knowledge.

For example:

  • You ask: “How do you say ‘I used to go’ in Spanish?”
  • A student says: “Yo fui… no… iba…?”

They haven’t got it right, but they’ve activated:

  • Past tense forms
  • First-person verb morphology
  • The semantic field of habitual action

When you now reveal “solía ir”, it doesn’t land in a vacuum—it attaches to an already activated network (Kornell et al., 2009).

2. Prediction Error Supercharges Learning

The moment the correct answer appears, the brain detects a mismatch:

“What I thought ≠ what is correct”

That gap—what cognitive scientists call prediction error—is incredibly powerful for memory encoding (Metcalfe, 2017; Pan et al., 2020) by virtue of the cognitive comparison it elicits.

Example:

  • Student guesses: “Je suis allé souvent” for “I used to go”
  • You reveal: “J’allais”

That discrepancy creates a kind of cognitive jolt:

  • “Ah! So it’s imperfect, not passé composé.”

That moment seems to stick far more according to a growing body of research than passive exposure ever could

3. Attention Is Sharpened (Learned Attention in Action)

After a failed attempt, students pay closer attention to the correct form.

Compare:

Scenario A (no retrieval):

  • Teacher: “Today we learn ‘j’allais’.”
  • Student: passively nods

Scenario B (failed retrieval first):

  • Student struggles → guesses wrong
  • Teacher reveals answer

Now the student is thinking:

  • “Why was I wrong?”
  • “What did I miss?”

That’s deep processing, not surface exposure (Schmidt, 1990; Metcalfe, 2017).

But Here’s Where Most Classrooms Get It Wrong

This doesn’t mean:

“Let students guess wildly and hope for the best.”

Because unsuccessful retrieval only works under specific conditions. What you do with the mismatch between the correct and the learner’s wrong answer, is of the essence for the prediction error to lead to meaningful learning.

Example of Poor Practice (Too Much Guessing)

Teacher:


“Guess what ‘der Schmetterling’ means.”

Students:

  • “Dog?”
  • “Car?”
  • “Tree?”

This is not retrieval, this is random guessing. Learning is minimal because there is

  • No prior knowledge activated
  • No meaningful network search
  • No learning advantage (Grimaldi & Karpicke, 2012)

Example of Effective Unsuccessful Retrieval

Students have already:

  • Heard “der Schmetterling fliegt” several times
  • Seen it in a sentence builder
  • Processed it in listening

Now you ask:

“What was ‘butterfly’ again?”

Student:

  • “Der… something… flug… no…”

Then you reveal:

  • der Schmetterling

Now the conditions are right:

  • Prior exposure
  • Effortful search
  • Immediate feedback

That’s where the magic happens (Kornell et al., 2009; Pan et al., 2020).

Classroom Examples You Can Use Tomorrow

1. Pre-question before input (the “productive failure” move)

Before a listening:

  • “How do you think French expresses ‘I used to play’?”

Students attempt:

  • “Je jouais?” / “J’ai joué?”

Then they hear it multiple times in context.

Result: deeper encoding than if you had just told them first (Kornell et al., 2009).

2. Sentence Builder Recall with Deliberate Struggle

After modelling:

  • Cover part of the sentence builder

Prompt:
“Say: ‘At the weekend I used to go to the cinema’”

Student:

  • “Le week-end… je… vais… non… j’allais au cinéma?”

Even if imperfect, the attempt activates structure → correction sticks (DeKeyser, 2007).

3. Narrow Listening with Retrieval Pauses

Play:
“Quand j’étais petit, j’allais au parc…”

Pause:
“What verb did you just hear for ‘I used to go’?”

Student:

  • “Je… allé…?”

Reveal:

  • “j’allais”

Immediate correction after effort = strong encoding (Field, 2008; Vandergrift & Goh, 2012).

4. Retrieval + Feedback Mini-Loops

Instead of:

  • Explain → practise

Do:

  • Attempt → fail → correct → repeat

Example:

  1. “Translate: I used to eat a lot”
  2. Student: “Je mangeais beaucoup?”
  3. Confirm or correct
  4. Immediate second attempt

That loop is where learning consolidates (Roediger & Karpicke, 2006; Metcalfe, 2017).

Conclusion

Retrieval practice does not require success. It requires effort, prior exposure, and feedback. Or, more provocatively: It’s not getting it right that builds memory—it’s trying, failing, and then being corrected that does the real work (Kornell et al., 2009; Pan et al., 2020). If your students are always getting answers right immediately, you may not be strengthening memory at all. You may just be making performance look good in the moment. And as we both know, in language learning:

Performance is not learning (Bjork & Bjork, 2011).