IB Economics HL Paper 2 is a data response examination worth 50% of the overall HL mark. The paper presents candidates with unseen statistical evidence, news excerpts, or economic data, followed by four questions of escalating demand. While many candidates prepare thoroughly for content recall and diagram application, the evaluation criterion consistently proves to be the differentiating factor between a 5 and a 7. Understanding precisely what examiners reward under the evaluation strand—and having deployable structures to generate quality evaluation on demand—transforms Paper 2 performance more reliably than any last-minute content review.

What the evaluation criterion actually measures in Paper 2

The assessment objectives for IB Economics HL Paper 2 allocate approximately 40% of marks to evaluation, analysis, and critical thinking. When examiners apply the markbands for evaluation, they are looking for evidence that a candidate can move beyond description and application toward judgement. This is not simply a matter of appending a concluding sentence that says 'in conclusion' or 'therefore.' True evaluation in the economics sense means demonstrating an understanding that economic propositions are conditional, contested, and context-dependent.

The markband descriptors use phrases such as ' reasoned evaluation of proposals,' 'well-substantiated conclusions,' and 'recognition of limitations and alternative viewpoints.' This language signals that evaluative writing must be substantive. A single sentence appended to an otherwise descriptive answer does not satisfy this criterion, regardless of how insightful that sentence appears to the candidate.

Candidates frequently ask whether evaluation matters more at higher mark questions. The answer is nuanced. Evaluation is assessed across all questions on Paper 2, but its relative weight increases as question marks rise. An evaluation-quality paragraph in a 4-mark question may earn partial credit, while the same quality of evaluation in a 10-mark question can be the difference between level 4 and level 5 achievement.

The four core evaluation structures for Paper 2

Experienced examiners observe that high-scoring candidates tend to deploy recognisable evaluative frameworks consistently. These are not rigid templates, but rather intellectual approaches that generate genuine economic judgement. Four structures prove particularly effective for Paper 2 data response contexts.

Structure 1: Strengths and limitations of the economic concept or policy

This is the most fundamental evaluation structure and the one examiners encounter most frequently. For any economic theory, model, or policy measure presented in the data response, candidates should be prepared to articulate both its explanatory power and its shortcomings. For instance, when evaluating a fiscal policy measure described in the stimulus material, a candidate might assess its multiplier effect alongside the risks of crowding out or inflationary pressure.

The key is specificity. General statements such as 'there are advantages and disadvantages' earn no additional credit beyond what description would earn. The candidate must identify particular mechanisms through which the concept succeeds or fails. Why does the fiscal stimulus work less effectively in this context? Which limitation of the AD-AS model makes it less applicable here? What assumption underlying the policy is being violated?

Structure 2: Short-run versus long-run considerations

Many economic phenomena operate differently across time horizons, and Paper 2 stimulus materials frequently contain data that span multiple periods. Candidates who explicitly invoke a short-run/long-run distinction demonstrate sophisticated economic thinking. For example, a trade liberalisation measure might generate immediate consumer surplus gains while producing structural adjustment costs over a longer period.

This structure is particularly powerful because it forces the candidate to engage with dynamics rather than static snapshots. It also naturally leads to conditional conclusions: the evaluation becomes 'this policy is effective in the short run but faces increasing difficulties over time' or 'the short-run costs may be justified by long-run welfare gains.' Such conditionality is precisely what the evaluation criterion rewards.

Structure 3: Stakeholder impact analysis

Economic policies and market outcomes affect different groups differently. A stakeholder evaluation structure requires the candidate to identify relevant groups—consumers, producers, workers, government, specific industries, developing versus developed nations—and assess the differential impacts. This approach is especially valuable when the stimulus material presents aggregate statistics that may mask distributional effects.

When applying this structure, candidates should avoid listing stakeholders without analysis. The evaluative insight comes from comparing outcomes across groups and acknowledging that a policy that improves aggregate welfare may simultaneously harm vulnerable populations. This kind of nuance distinguishes a level 5 response from a level 4 response.

Structure 4: Alternative perspectives and competing economic theories

At the higher end of the markband, evaluation demonstrates awareness that economics is not a monolithic discipline. Different schools of thought offer competing interpretations of the same phenomenon. A candidate might evaluate a monetary policy response by contrasting Keynesian and monetarist perspectives, or assess a market intervention by comparing welfare outcomes under different assumption sets.

This structure requires some theoretical breadth, but Paper 2 candidates who have studied the IB economics syllabus possess sufficient background. HL students encounter demand-side and supply-side analysis, short-run and long-run aggregate supply, and various perspectives on inflation and unemployment. Deploying this knowledge in evaluation signals mastery rather than mere familiarity.

Locating evaluation opportunities in the stimulus material

A common weakness is evaluating the wrong things or evaluating at the wrong level of abstraction. The stimulus material in Paper 2 is the candidate's primary source, and evaluation should emerge from engagement with that material, not from generic knowledge deployed independently. High-scoring candidates develop a systematic approach to identifying evaluation opportunities as they read.

Begin by identifying every economic concept, model, or policy mentioned explicitly in the data. These become your evaluation targets. For each target, ask three questions: Does this concept have known limitations? Are there conditions under which this policy might not work as predicted? Are there affected groups or time periods that the data might be obscuring?

The stimulus material often contains apparent contradictions or tensions that invite evaluation. If one data series shows growth while another shows increasing inequality, that tension is an evaluation opportunity. If a policy is described as successful in one paragraph but faces implementation challenges in another, that discrepancy is evaluatively significant. Train yourself to notice these structural features of the stimulus material rather than treating each paragraph as an isolated fact to be memorised.

The 8-mark and 10-mark questions: where evaluation determines the grade boundary

Paper 2 questions typically follow a progression: 4 marks for definition and identification, 6 marks for explanation, 8 marks for analysis and application, and 10 marks for extended evaluation. The highest-mark questions are where the greatest differentiation occurs, and evaluation is the primary vehicle for achieving marks in the upper levels of the markband.

An 8-mark question typically requires two to three paragraphs of well-structured analysis followed by a brief evaluative synthesis. A 10-mark question demands more sustained evaluation—often a dedicated paragraph or two that weighs evidence, acknowledges limitations, and reaches a substantiated conclusion. Candidates who treat the 10-mark question as simply 'more of the same' from the 8-mark question miss the opportunity to demonstrate the depth of evaluation that distinguishes level 5 from level 6 achievement.

When approaching extended questions, reserve the final section explicitly for evaluation. Do not allow evaluation to be absorbed into analysis paragraphs where it may be overlooked. A clearly signposted evaluation section—perhaps beginning with 'However, several factors limit the validity of this conclusion'—ensures examiners can identify your evaluative contribution precisely.

Common pitfalls and how to avoid them

The following errors appear repeatedly in Paper 2 responses across examination sessions. Each has a specific remedy that candidates can implement in their preparation.

Conclusion-only evaluation: Writing a single sentence at the end of an answer that reads like a verdict. This does not demonstrate sustained evaluative thinking. Fix: dedicate at least a paragraph to evaluation, not merely a sentence.
Generic evaluation: Using phrases such as 'there are pros and cons' or 'it depends on many factors' without specifying which factors or which pros and cons. Fix: always be specific. Name the particular mechanism, assumption, or condition that generates the limitation.
Evaluation of the question rather than the answer: Some candidates evaluate whether the stimulus material is a good source or whether the question is well-posed, rather than evaluating the economic content of their own response. Fix: direct all evaluation toward the economic concepts, policies, or models under discussion.
Ignoring the data: Deploying evaluation that could have been written without reading the stimulus material at all. Fix: ground every evaluative point in specific evidence from the data. Quote figures, compare data points, or reference particular claims from the stimulus.
Unbalanced evaluation: Offering only weaknesses of a position without acknowledging legitimate strengths, or vice versa. Fix: aim for genuine weighing rather than one-sided criticism. The most credible evaluation acknowledges what works and explains why the limitations ultimately outweigh those strengths.

Diagrams as evaluation tools: a frequently misunderstood opportunity

Many candidates treat diagrams purely as descriptive or illustrative tools—something to include because the syllabus mentions them or because the question asks for them. However, diagrams can serve significant evaluative functions in Paper 2 responses. Understanding how to use diagrams evaluatively adds another dimension to a candidate's answer.

A correctly drawn and labelled diagram demonstrating a deadweight loss can evaluate the inefficiency of a market intervention. A diagram showing shifts in aggregate demand and supply can evaluate the likely effectiveness of a policy under different assumptions. A comparative diagram illustrating two equilibria can evaluate the welfare implications of a price ceiling versus a price floor.

The critical principle is that the diagram must be integrated with evaluative prose. A standalone diagram that is neither referenced in the text nor explained in terms of its economic significance earns marks only for the diagram itself. When the text explicitly states what the diagram demonstrates—'this is shown by the shaded area in Diagram 2, which represents a welfare loss of X'—the candidate creates a coherent analytical narrative that satisfies both the analysis and evaluation criteria.

Planning time allocation in the examination room

Paper 2 comprises 40 marks across four questions, and candidates have 1 hour and 30 minutes to complete it. This implies approximately 22 minutes per question on average, but the distribution should be uneven. Higher-mark questions warrant more time because they carry greater weight and because evaluation is more time-intensive than description.

A practical framework is to spend 3 to 4 minutes planning before writing, with particular attention to identifying the two or three strongest evaluative points available for each question. Attempting to evaluate everything leads to shallow evaluation. Selecting the strongest two or three points and developing them fully is a more effective strategy.

During the examination, resist the temptation to write a long descriptive passage and then append evaluation. Instead, interweave evaluative insights throughout the response where they arise naturally, and reserve the final paragraph for a synthesising evaluative conclusion that weighs the various factors discussed. This approach demonstrates consistent evaluative engagement rather than evaluation as an afterthought.

Connecting Paper 2 evaluation skills to broader IB Economics preparation

The evaluation skills developed for Paper 2 transfer directly to the extended essay in economics and to Internal Assessment commentary writing. The ability to construct a reasoned argument, acknowledge limitations, weigh competing perspectives, and reach substantiated conclusions is assessed across all components of the IB Economics HL course. Investing time in mastering evaluation for Paper 2 therefore yields compounding benefits throughout the diploma programme.

Candidates who develop a habit of asking evaluative questions during their regular economics study—'what are the limitations of this model?', 'under what conditions would this policy fail?', 'who benefits and who loses?'—find that Paper 2 evaluation comes naturally because it mirrors the intellectual habits they have already cultivated. Evaluation is not a separate skill that appears only in examinations; it is a mode of economic thinking that should pervade all study and preparation activities.

Conclusion

Evaluation in IB Economics HL Paper 2 is not an optional enhancement—it is a core assessment objective that determines grade boundaries at every level. Candidates who approach Paper 2 with deployable evaluative structures, specific evaluative content drawn from the stimulus material, and a clear understanding of what the markbands reward consistently outperform those who rely on content knowledge alone. The four evaluation structures outlined here—strengths and limitations, short-run versus long-run, stakeholder analysis, and alternative perspectives—provide a reliable framework for generating quality evaluation on demand. Practice applying these structures to past Paper 2 questions, always grounding evaluation in the specific data provided, and the evaluation criterion will cease to be a source of anxiety and become instead the primary driver of examination success.

Question marks	Typical focus	Evaluation weighting	Recommended structure
4 marks	Definition and identification	Low—brief evaluative comment acceptable	One evaluative sentence integrated with definition
6 marks	Explanation of economic relationships	Moderate—include one evaluative insight	Analysis paragraphs with one evaluative sentence
8 marks	Extended analysis and application	Significant—evaluation contributes to top level	Dedicated evaluative paragraph after analysis
10 marks	Extended evaluation and synthesis	Primary—evaluation determines grade boundary	Multiple evaluative paragraphs with synthesising conclusion

Frequently asked questions

How is the evaluation criterion marked differently across Paper 2 questions of varying lengths?

Evaluation is assessed throughout Paper 2, but its relative contribution increases with question marks. In a 4-mark question, a brief evaluative comment earns some credit, but the primary demand is definition and identification. In 8-mark questions, evaluation becomes a significant differentiator between level 3 and level 4 achievement. In 10-mark extended questions, evaluation is the dominant criterion and determines whether a candidate reaches level 5 or level 6. Examiners apply the markband descriptors proportionally, so an answer to a 10-mark question must demonstrate sustained evaluative engagement, not merely a concluding sentence.

Can I still score well on Paper 2 evaluation if I struggle to think critically under examination conditions?

Yes, because evaluation in Paper 2 is structured rather than spontaneous. Candidates who prepare specific evaluative frameworks—such as strengths and limitations, short-run versus long-run effects, or stakeholder impacts—have ready-made intellectual tools to deploy when reading the stimulus material. The key preparation strategy is to practice past Paper 2 questions and consciously identify evaluative opportunities in each dataset. Over time, this becomes a habit of mind that functions automatically under examination conditions. Structural preparation reduces the cognitive load of evaluation significantly.

Should I always include a diagram in my Paper 2 evaluation?

Diagrams are not mandatory for every question, but they are highly effective evaluative tools when used correctly. A well-constructed diagram can illustrate a deadweight loss, demonstrate the effects of a policy shift, or compare two equilibria, thereby strengthening an evaluative argument. However, a diagram that is neither referenced in the text nor explained in terms of its economic significance contributes only diagram marks, not evaluation marks. The diagram must be integrated into the evaluative prose to serve the evaluation criterion. When time is limited, prioritising quality evaluative writing over an unnecessary diagram is usually the better strategy.

How do I avoid writing generic evaluation that earns no additional credit?

Generic evaluation—phrases such as 'there are advantages and disadvantages' or 'it depends on many factors'—earns no marks beyond what description would earn. The fix is specificity at the level of mechanism. Instead of saying 'the policy has limitations,' identify which particular limitation applies, why it matters in this context, and what evidence from the stimulus material supports this assessment. Evaluation is assessed on the quality of economic thinking, not the quantity of evaluative phrases used. One well-developed evaluative paragraph is worth more than five sentences of generic hedging.

What is the most effective way to practise Paper 2 evaluation before the examination?

The most effective practice method is to work through past Paper 2 questions under timed conditions, deliberately allocating planning time to identify evaluative opportunities in the stimulus material before writing. After completing a response, compare it against the mark scheme and examiner's notes, focusing specifically on how evaluation was rewarded or penalised. Additionally, during regular economics study, cultivate an evaluative habit by asking questions such as 'what are the limitations of this model?' or 'under what conditions would this policy fail?' This consistent practice builds the evaluative reflexes that operate effectively under examination pressure.

4 evaluation structures that boost IB Economics HL Paper 2 scores