TOEFL Speaking Task 3 requires candidates to synthesise information from two distinct sources: an academic reading passage and a lecture that either supports, contradicts, or complicates the written text. Unlike independent speaking tasks, this integrated format evaluates not only language proficiency but also the ability to identify relationships between sources and construct a coherent verbal response within 60 seconds. Understanding the response architecture that examiners reward — and the structural failures that cap scores at Band 3 — is essential for systematic preparation.
Understanding TOEFL Speaking Task 3: the integrated format in context
Speaking Task 3 belongs to the integrated section of the TOEFL iBT Speaking module, where candidates encounter four tasks that combine two or more input modes. Tasks 3 and 4 present reading material followed by a listening passage, and candidates must reference both sources in their response. Task 3 specifically uses an academic context — typically a excerpt from a university-level textbook or article — and a professor's lecture that illustrates, debates, or applies the concept introduced in the reading.
The task prompt instructs candidates to explain the speaker's opinion about the reading passage and to identify the specific points the speaker uses to support that opinion. This dual requirement — restating the reading concept and then tracking the speaker's position relative to it — defines the structural demands of the response.
- Candidates receive 45 seconds to read a short academic passage (typically 75–100 words).
- A 60–80-second lecture follows, during which the candidate must take notes.
- The preparation time before speaking is 30 seconds; speaking time is 60 seconds.
- Responses are scored on delivery, language use, and topic development on a 0–4 scale.
The reading passage: identifying signal phrases and key claims
Most TOEFL Speaking Task 3 reading passages follow a predictable academic structure. A concept or theory is introduced, a definition or explanation follows, and the passage closes with either an application or a theoretical implication. Understanding this structure allows candidates to identify the two or three core claims they will need to reference in their response.
Signal phrases that frequently appear include terms such as "the theory suggests," "research indicates," "according to the passage," and "the concept of." These phrases flag the main claim. Candidates should train themselves to distinguish between the central hypothesis — what the passage is arguing — and supporting evidence, which explains or justifies that hypothesis. The lecture will typically complicate, exemplify, or challenge one or more of these supporting points, so capturing both the main claim and its supporting structure during reading is critical.
A common error is spending too much time on minor details — specific statistics, dates, or examples — at the expense of missing the overarching argument. While noting these details is useful, the response must anchor to the central claim and then show how the speaker reacts to it. Preparation strategies should include timed reading drills that force candidates to identify the main claim within the first 15 seconds of reading.
The lecture: tracking the speaker's position relative to the reading
The professor's lecture in Task 3 does not merely repeat the reading content; it creates a rhetorical relationship with the written text. Understanding that relationship is the decisive factor in response quality. The lecturer may support the reading with additional examples, challenge the reading's assumptions with counterevidence, apply the reading's theory to a specific case study, or present a nuance that the reading omitted.
Active listening strategies during the lecture should focus on three questions: What is the speaker's attitude toward the reading? Which specific point from the reading does the speaker address? What evidence or reasoning does the speaker provide? Candidates should use a two-column note-taking format — one column for reading claims, one for lecture responses — to map the relationship in real time.
Transitional phrases that indicate the speaker's stance include "however," "on the other hand," "in contrast," "for example," "building on this idea," and "this theory fails to account for." Noting these markers helps candidates structure the response logically: first establish what the reading said, then explain what the speaker does with that information, and finally state the speaker's position.
Response architecture: building a coherent 60-second answer
The 60-second speaking window demands a tight, disciplined structure. Examiners evaluate whether the response addresses all elements of the prompt — restating the reading's concept, identifying the speaker's opinion, and citing the speaker's supporting points. A well-structured response follows a three-part architecture that leaves no ambiguity about the speaker's position.
The opening sentence should state the speaker's opinion clearly and directly, using the phrase "The speaker [agrees/disagrees/challenges/expands on] the reading's claim that [restate claim in candidate's own words]." This eliminates any possibility of the examiner wondering where the response is heading.
The body of the response should then present the speaker's supporting evidence in the order it appears in the lecture. Each point should be introduced with a clear transition — "First, the speaker argues that..." and "Additionally, the speaker points out that..." — and should include at least one specific detail from the lecture. The closing sentence should restate the overall relationship between the reading and the lecture in a single sentence, confirming the speaker's position.
| Response component | Function | Approximate time allocation |
|---|---|---|
| Opening claim statement | Establishes speaker's position relative to reading | 8–10 seconds |
| First supporting point | Introduces first lecture example/reasoning | 12–15 seconds |
| Second supporting point | Introduces second lecture example/reasoning | 12–15 seconds |
| Closing synthesis | Confirms overall relationship and position | 8–10 seconds |
Timing is a common pitfall. Candidates who spend the first 20 seconds re-explaining the reading in excessive detail leave insufficient time for the lecture content, which is the more heavily weighted component. The reading should be restated concisely — one sentence is sufficient — and the majority of the response should focus on the speaker's analysis and evidence.
Scoring criteria: what Band 4 looks like versus Band 3
The TOEFL Speaking scoring rubric for integrated tasks evaluates three dimensions: delivery, language use, and topic development. Each dimension is assessed holistically, meaning no single error automatically reduces a score; rather, the overall impression of competence determines the band.
A response that earns a Band 4 demonstrates consistent fluency and clarity. Pronunciation is clear with appropriate intonation patterns, and hesitations are minimal and natural rather than structural. The language use shows sophisticated control of complex grammatical structures — conditional sentences, relative clauses, and passive voice are deployed accurately. Topic development is thorough: the reading concept is accurately summarised, the speaker's position is stated without ambiguity, and both supporting points from the lecture are developed with relevant detail.
A Band 3 response typically shows adequate fluency but with more frequent mispronunciations or unnatural pauses. Grammatical control is solid for simple sentences but becomes less accurate when attempting complex structures. Topic development covers all required elements but may restate the reading in language very close to the source text — examiners penalise over-reliance on direct quotation — or may omit a minor detail from one of the lecture points.
The key distinguishing factor is specificity. Band 4 responses use precise language and include concrete details from both sources; Band 3 responses tend toward vague generalisations. For example, a Band 4 response would state "The speaker argues that the theory's prediction failed in the experiment, citing the results from the second trial," while a Band 3 response would say "The speaker talked about the theory and said it didn't work well." Precision signals comprehension.
Common pitfalls and how to avoid them
Several recurring errors consistently limit Task 3 scores below Band 4. Identifying these patterns and drilling against them is one of the most efficient preparation strategies available.
The first major pitfall is failing to state the speaker's position explicitly. Candidates who begin with "The reading talks about..." and never directly say whether the speaker agrees or disagrees leave the examiner guessing. Even when the speaker's position is obvious, stating it clearly is a prerequisite for a high score. The opening sentence must contain the speaker's stance.
The second pitfall is imbalance between reading and lecture content. Over-summarising the reading — spending 25–30 seconds on background before reaching the lecture material — is one of the most common structural errors. A reasonable guideline is to dedicate no more than 15 seconds to restating the reading concept, leaving 40–45 seconds for the speaker's position and supporting evidence.
The third pitfall is paraphrase failure. Candidates who quote the reading or lecture directly — repeating phrases such as "according to the passage" or "the speaker said" without rephrasing — demonstrate limited language use. The response must restate information in the candidate's own words while maintaining accuracy. Paraphrasing ability is explicitly assessed in topic development.
The fourth pitfall is neglecting transitions. Responses that list lecture points without linking them to the reading claim — or without indicating which point comes first and second — read as disorganised. Clear transition phrases signal coherence and are a low-effort, high-reward addition to any response.
- State the speaker's position in the first sentence.
- Restate the reading concept in one sentence maximum.
- Introduce the first lecture point with a transition phrase.
- Develop the first point with a specific detail from the lecture.
- Introduce the second lecture point with a transition phrase.
- Develop the second point with a specific detail from the lecture.
- Close with a single sentence confirming the overall relationship.
Preparation strategies: building Task 3 competence systematically
Improving Task 3 performance requires both language development and specific test strategy. A layered preparation approach — building foundational skills before adding test-specific techniques — produces more durable results than intensive drilling alone.
The first layer is academic note-taking practice. Candidates should regularly engage with university-level audio content — documentary lectures, educational podcasts, academic conference talks — and practice extracting main claims and supporting details under timed conditions. The goal is to develop the automatic skill of identifying structure in real time, without needing to replay or停顿.
The second layer is integrated practice using official TOEFL materials. The speaking practice sets in TOEFL Practice Online (TPO) include authentic Task 3 prompts. Each practice session should simulate test conditions exactly: read the passage, listen to the lecture, take notes, and respond within the 30-second preparation window. Recording and self-scoring against the rubric builds awareness of personal patterns.
The third layer is targeted error correction. After each practice response, candidates should identify whether errors were in delivery, language use, or topic development. Delivery errors — mispronunciations, unnatural intonation — require phonetic drills. Language use errors — tense inconsistency, article misuse — require targeted grammar practice. Topic development errors — missing a lecture point, being too vague — require structural revision drills.
Working with a qualified instructor for periodic scoring feedback is highly recommended for candidates targeting Band 25 or above. Self-assessment of speaking responses is inherently limited; an experienced scorer can identify patterns invisible to the candidate and prescribe specific corrective exercises.
Distinguishing Task 3 from the other integrated speaking tasks
The TOEFL Speaking section contains four tasks, and understanding the distinctions between them prevents candidates from applying the wrong strategy to Task 3. Each task has a different input combination, prompt structure, and evaluative emphasis.
Task 1 (Independent) requires candidates to draw solely on personal experience to respond to a question such as "Choose a place you enjoy studying and explain why it is conducive to concentration." No reading or listening is involved.
Task 2 (Integrated — campus situation) presents a reading passage about a campus policy or proposal, followed by a conversation in which two speakers discuss their reactions. Candidates must synthesise both sources. The topic is everyday campus life rather than academic content.
Task 3 (Integrated — academic) presents an academic reading passage followed by a lecture. The academic context means vocabulary may be more specialised and concepts more abstract than in Task 2.
Task 4 (Integrated — academic lecture) has no reading passage; candidates listen only to an academic lecture and must summarise the main points and supporting examples. The absence of a reading passage means candidates must capture more detail from the listening alone.
Task 3's unique demand is the ability to manage two sources simultaneously and demonstrate comprehension of the relationship between them. This is why the response architecture — restating the reading, then tracking the lecture's response — is the critical skill to develop.
Conclusion and next steps
TOEFL Speaking Task 3 rewards candidates who can accurately extract information from an academic reading passage, actively track a lecturer's position and evidence during the listening phase, and synthesise both sources into a well-organised 60-second response. The architecture is learnable: stating the speaker's position first, restating the reading concisely, developing two lecture points with specific detail, and closing with a synthesising sentence creates a response that meets all Band 4 criteria. Practice under timed conditions, with systematic error correction, is the most effective path to consistent performance on this task.
TestPrep's complimentary 30-minute speaking diagnostic identifies Task 3-specific weaknesses and maps them to a targeted study plan. Candidates who understand what distinguishes a Band 4 response from a Band 3 response — and who practice deliberately against those specific criteria — are best positioned to achieve their target scores.