IELTS Speaking Part 2, commonly referred to as the Long Turn, requires candidates to speak continuously for between one and two minutes on a given topic. The candidate receives a cue card with prompting points and is afforded exactly 60 seconds of preparation time before beginning the response. The quality of that preparation window, and the structural framework applied during the speaking itself, directly influences the Fluency and Coherence band score—the first and most heavily weighted criterion in the assessment. This article examines how effective planning during that critical minute, combined with a task-appropriate response structure, distinguishes candidates who achieve Band 7 and above from those who plateau at Band 6.
Understanding the IELTS Speaking Part 2 task and its four assessment criteria
The IELTS Speaking test is administered by a certified examiner and consists of three parts. Part 2 sits between the introductory interview in Part 1 and the abstract discussion in Part 3. The candidate receives a cue card—typically a thick cardstock rectangle—and is instructed to speak for one to two minutes on the subject described. Four prompting bullet points are provided, and the examiner will stop the candidate when two minutes have elapsed or when the response has reached its natural conclusion.
The response is assessed across four equally weighted criteria:
- Fluency and Coherence: the ability to speak at length without unnecessary hesitation, repetition, or self-correction, and to organise ideas in a logical sequence using appropriate cohesive devices.
- Lexical Resource: the range and accuracy of vocabulary deployed, including awareness of register, collocation, and idiom.
- Grammatical Range and Accuracy: the variety and precision of sentence structures, ranging from simple to complex forms.
- Pronunciation: the clarity, intonation patterns, and comprehensibility of spoken English.
The planning techniques and structural frameworks discussed in this article primarily address Fluency and Coherence, which carries the highest raw weighting in examiners' holistic assessment. However, the structures described also create natural conditions for deploying complex grammatical structures and topic-relevant vocabulary, thereby benefiting the other criteria simultaneously.
The 60-second preparation window: high-efficiency planning techniques
The preparation minute is perhaps the most underutilised resource available to IELTS Speaking Part 2 candidates. Most candidates spend this time either writing extensive notes—leaving insufficient cognitive bandwidth for delivery—or doing nothing productive at all. Band 8 and Band 9 candidates consistently approach the preparation minute with a disciplined, focused strategy.
The first priority is to identify the cue card's structural requirement. The four prompting bullet points on any cue card are not random; they follow one of four recurring families: describing a person, describing a place, describing an object, or describing an experience or event. Recognising the family immediately signals the most effective structure to deploy.
The second priority is to allocate the preparation minute across three tasks: identifying the key content points, selecting a clear sequence, and noting only essential keywords. The keyword approach is critical. Writing full sentences during the preparation minute is counterproductive; the candidate must then read those sentences aloud rather than speaking naturally. Instead, a short list of six to eight content words—proper nouns, specific adjectives, action verbs—provides sufficient scaffolding without constraining the response.
A practical exercise for candidates is to practise the preparation minute in isolation. Set a timer for 60 seconds. Read the cue card. Write only keywords. Review the keyword list. The entire sequence should take no more than 45 seconds, leaving a 15-second buffer for composure before speaking begins. With consistent practice over several weeks, this routine becomes automatic.
Structure 1: The four-point linear framework for person, place, and object cue cards
The most versatile and examiner-friendly structure for IELTS Speaking Part 2 is the four-point linear framework, which maps directly onto the four bullet points of the cue card. This structure works reliably across person, place, and object cue cards because those categories naturally accommodate a description in sequential layers.
The opening sentence establishes the subject in broad terms: "I would like to describe my grandmother, who has been the most influential figure in my life." This is followed by four development paragraphs, each addressing one cue card bullet point:
- Background and context: who the person is, where the place is located, or what the object is and when it came into the candidate's possession.
- Appearance or physical characteristics: physical description (for a person), sensory details (for a place), or visual and functional description (for an object).
- Significance or personal connection: why this subject matters, what shared experiences exist, or how the object has been used.
- Feelings, conclusion, or future relevance: the emotional resonance of the subject and, where appropriate, its projected future role.
This structure provides the candidate with a clear mental map. Each paragraph addresses a discrete point, reducing the cognitive load during delivery and ensuring that all four cue card prompts are covered. The logical progression from identification through to emotional reflection mirrors the natural arc of human narrative, which examiners recognise as a hallmark of coherent, well-organised speech.
Structure 2: The chronological narrative for experience and event cue cards
When the cue card falls into the experience or event family—describe a memorable journey, describe a festival you participated in, or describe a time you helped someone—the chronological narrative structure is more effective than the linear descriptive framework. This structure unfolds in time order, using sequencing connectors to move the listener through the experience step by step.
The opening still provides context, but the body follows the temporal sequence: setting the scene, the first phase of the experience, a pivotal moment or turning point, and the resolution or outcome. The closing reflects on the significance of the experience and, ideally, connects to a broader insight or personal value.
Chronological narrative is particularly powerful for achieving high Fluency and Coherence scores because temporal connectors—first, then, after that, subsequently, finally—are among the most natural and varied cohesive devices in spoken English. Candidates who deploy a range of time markers demonstrate coherence without appearing to rely on memorised phrases, which is a common concern among examiners assessing Band 6 responses.
Structure 3: The advantages and comparison framework for abstract and activity cue cards
A smaller but significant subset of cue cards asks candidates to describe activities, hobbies, plans, or abstract concepts. These cards often invite a candidate to describe something you do to relax, a skill you want to learn, or a programme you enjoy watching. For these prompts, the advantages and comparison framework offers a compelling alternative to pure description.
In this structure, the candidate introduces the subject and then develops it through evaluation: what are its distinctive features, what benefits does it offer, and how does it compare to alternatives? For example, a candidate describing a leisure activity might contrast it with other pastimes, explaining why this particular activity resonates more strongly. This evaluative layer adds complexity to the lexical resource and grammatical range, pushing the response beyond the descriptive mechanics of the four-point linear framework.
Common pitfalls under this structure include neglecting the core description in favour of excessive comparison, and failing to personalise the evaluation. Candidates should ensure that the advantages discussed are genuinely their own, not generic observations applicable to any activity of that type.
Discourse markers and cohesive devices: organising ideas without sounding mechanical
The effective use of discourse markers is central to the Fluency and Coherence assessment. However, candidates frequently overuse a small set of connectors—and, but, so—or deploy connectors mechanically, inserting firstly, secondly, finally in a way that sounds scripted rather than natural.
A strong Part 2 response deploys a range of cohesive devices drawn from three functional categories: sequencing devices (initially, following that, as a next step), elaboration markers (in particular, specifically, more importantly), and contrast or concession devices (although, in contrast, unlike). The key principle is variety without affectation. Candidates should consciously build a personal inventory of at least twelve to fifteen cohesive markers and rotate them across practice sessions until deployment becomes instinctive.
Examiners are trained to recognise recycled phrases and formulaic openings. A response that begins every paragraph with regarding the first point signals rehearsal rather than genuine organisation. Spontaneous, contextually appropriate use of connectors—what really struck me was, that particular moment, looking back on it now—demonstrates the natural discourse control that characterises Band 8 performance.
Managing the two-minute delivery: pacing, hesitation, and self-correction
With the preparation minute optimised and the structural framework selected, the candidate must manage the delivery itself. The two-minute window is sufficient to address all four cue card prompts and deliver a concluding statement, provided pacing is carefully controlled.
One common error among Band 6 candidates is rushing through the first two bullet points in exhaustive detail, only to discover that one minute has elapsed with two points remaining. The solution is disciplined time allocation: approximately 25 to 30 seconds per main point, with a slightly longer opening (15 to 20 seconds) to establish context and a brief closing (10 to 15 seconds) for reflection. This distribution ensures that no point is underdeveloped while none dominates the response to the exclusion of others.
Hesitation is a natural feature of spoken language, and the IELTS scoring criteria allow for it. A momentary pause to think—signalled by let me think about this for a moment or a brief silence—does not automatically reduce the Fluency score. What examiners penalise is unnecessary hesitation: long pauses mid-sentence, false starts followed by restarts, and repetition of the same phrase as a buying-time strategy. Candidates should practise tolerating brief pauses without over-apologising or self-interrupting.
Self-correction, similarly, is assessed on its impact on communication rather than its mere occurrence. A candidate who says "I went to—visited—Paris last summer" has self-corrected efficiently, and the examiner records no fluency penalty. A candidate who restarts the same sentence three times, by contrast, demonstrates difficulty maintaining the coherence of the spoken output.
Common structural mistakes and how to avoid them
Several recurring structural errors consistently limit candidates to Band 6 in the Fluency and Coherence criterion. Identifying and actively correcting these patterns is an efficient preparation strategy.
The first mistake is providing equal depth across all four bullet points without establishing thematic priority. Not all cue card points are equally productive. A point such as and explain why this person is important to you naturally invites more developed, emotionally resonant speech than describe their appearance. Candidates who allocate time proportionally across all four points—rather than following the natural richness of the content—produce responses that are structurally complete but intellectually thin.
The second mistake is introducing irrelevant digressions. Candidates who are uncertain how to develop a particular point may pivot to an unrelated anecdote or tangential observation. While a brief illustrative example enriches a response, a two-minute tangent away from the cue card topic signals poor coherence. Every sentence in the Part 2 response should either address a prompting point directly or serve as a natural bridge between two points.
The third mistake is concluding abruptly. Many candidates, uncertain whether they have spoken for long enough, stop mid-thought after covering the four points. A well-structured response includes a closing sentence that draws the content together: a reflection, a forward-looking statement, or a simple restatement of the subject's significance. This closing segment, even if brief, signals completeness and control to the examiner.
Structure comparison across cue card families
Different cue card families benefit from different structural approaches. The table below summarises the recommended primary structure and the most effective cohesive markers for each family.
| Cue Card Family | Recommended Structure | Key Cohesive Devices | Typical Opening Pattern |
|---|---|---|---|
| Describe a person | Four-point linear (background, appearance, significance, feelings) | who, whose, particularly, in terms of | Introduce identity and relationship |
| Describe a place | Four-point linear with sensory focus (location, features, atmosphere, personal connection) | located in, situated, as you enter, what strikes you | Name the place and state its relevance |
| Describe an object | Four-point linear (acquisition, appearance, function, significance) | bought from, initially, over time, what I appreciate most | Identify the object and its origin |
| Describe an experience or event | Chronological narrative (setting, sequence, turning point, outcome) | first, then, after that, eventually, looking back | Set the scene and the time frame |
| Describe an activity or plan | Advantages and comparison (introduction, features, benefits, comparison) | unlike, compared to, what appeals to me, in contrast | Name the activity and state your involvement |
These structures are not rigid templates to be followed mechanically. They are frameworks that provide mental scaffolding, allowing the candidate to navigate the response with confidence. Flexibility within the framework—such as combining chronological narrative with brief evaluative commentary on a person cue card—is not only permitted but actively rewarded when it demonstrates sophisticated discourse management.
Preparing with model responses and progressive complexity
The most effective preparation strategy combines structural drilling with progressive complexity. In the early stages of preparation, candidates should practise responding to cue cards using the appropriate structure in a low-pressure environment, prioritising completeness and logical sequencing over lexical sophistication. Recording these responses and reviewing them against the assessment criteria is more valuable than any passive study of sample answers.
As confidence develops, candidates should introduce lexical and grammatical complexity into the established structural framework. This means replacing general vocabulary with more precise, contextually appropriate terms, and incorporating complex sentence structures such as relative clauses, participial constructions, and conditional forms within the same coherent framework. The structure remains the same; the language quality elevates.
One useful exercise is to take the same cue card and deliver three responses: one targeting Band 6 language quality, one targeting Band 7, and one targeting Band 8. Comparing the three recordings reveals how structural integrity and language quality interact, and where the threshold between bands lies in practice rather than in theory.
Conclusion and next steps
The IELTS Speaking Part 2 Long Turn rewards disciplined preparation and flexible structural thinking. The 60-second preparation window should be treated as a strategic asset, not a period of anxious note-writing. By categorising the cue card family, selecting the appropriate structural framework, and allocating time carefully across the four prompting points, candidates can approach the two-minute delivery with confidence and coherence. The cohesive markers, pacing strategies, and closing conventions discussed here, when practised consistently, form the behavioural foundation of Band 7 and above performance in Fluency and Coherence. TestPrep's complimentary diagnostic assessment offers a natural starting point for candidates seeking a sharper preparation plan tailored to their current band level and specific development areas in Part 2 delivery.
Effective preparation for IELTS Speaking Part 2 is not about memorising model answers. It is about building a flexible, task-responsive structural toolkit and developing the spoken habits that signal coherence, fluency, and linguistic maturity to a trained examiner.