TopMyGrade

GCSE/Mathematics/Edexcel

S6Interpret scatter graphs; correlation and causation

Notes

Scatter graphs and correlation

Scatter graphs appear on every Edexcel Paper 2 or 3 — typically Foundation 4–5 marks, Higher with an "is X correlation evidence Y caused Z" critique.

Drawing a scatter graph

  • Plot each (x, y) bivariate data point as a cross (×). Edexcel mark schemes typically award B1 for "all points plotted within ±½ small square".
  • Label both axes including units.

Types of correlation

  • Positive correlation — as x increases, y tends to increase. Line of best fit slopes upward.
  • Negative correlation — as x increases, y tends to decrease. Line slopes downward.
  • No correlation — no clear pattern. Do not draw a line of best fit.
  • Strong vs weak — strong = points close to a clear line; weak = points scattered around it.

Line of best fit

Draw a single straight line that:

  • Passes roughly through the middle of the data (about half the points each side).
  • Goes through the mean point (x̄, ȳ) when both means are easy to compute.
  • Does not need to pass through (0, 0).

Use a ruler. Free-hand wavy lines lose the C1 communication mark.

Using the line of best fit

  • Interpolation — reading a y-value for an x within the data range. Reliable.
  • Extrapolation — reading a y-value outside the data range. Unreliable; mention this when asked about reliability.

Outliers

A point that does not fit the trend. Edexcel asks "circle the outlier" B1 and may ask whether you would include it when drawing the line of best fit (typically, exclude it).

Correlation vs causation

This is the Higher exam question. Just because two variables correlate does not mean one causes the other.

Example: scatter graph of "ice cream sales" vs "drowning incidents" shows positive correlation. Cause is a third variable: hot weather. Both rise together but neither causes the other.

A correct exam answer is in three parts:

  1. Describe the correlation seen.
  2. State the relationship is not causation.
  3. Suggest a confounding variable or alternative explanation.

Common Edexcel exam tip

For "is there a relationship?" questions, always quote both the strength and direction: "strong negative correlation" earns 2 marks where "negative" alone earns only 1.

AI-generated · claude-opus-4-7 · v3-edexcel-maths-leaves

Practice questions

Try each before peeking at the worked solution.

  1. Question 16 marks

    Plot, identify, predict

    Edexcel Paper 2F (calculator) — Foundation

    A scatter graph shows the height (cm) and shoe size of 10 students. The points fall close to a straight line sloping upwards.

    (a) Describe the type and strength of correlation. (2 marks)
    (b) The line of best fit passes through (140, 4) and (180, 9). Estimate the shoe size of a student who is 165 cm tall. (2 marks)
    (c) State whether the line of best fit can reliably predict the shoe size of a 200 cm-tall student. Explain. (2 marks)

    Ask AI about this

    AI-generated · claude-opus-4-7 · v3-edexcel-maths-leaves

  2. Question 23 marks

    Correlation vs causation critique

    Edexcel Paper 3H — Higher

    A study finds a strong positive correlation between the number of fire engines sent to a fire and the amount of damage caused.

    A student concludes: "Sending more fire engines causes more damage."

    (a) State whether you agree with the student's conclusion. (1 mark)
    (b) Explain a possible reason for the correlation that does not involve causation. (2 marks)

    Ask AI about this

    AI-generated · claude-opus-4-7 · v3-edexcel-maths-leaves

  3. Question 34 marks

    Outlier identification

    Edexcel Paper 2F — Foundation

    A scatter graph shows the time spent revising (hours) and percentage exam score for 12 students. Most points lie close to a line of best fit, but one point at (1, 92%) is clearly above the line.

    (a) Circle the outlier. (1 mark)
    (b) Suggest a reason why this student might be an outlier. (1 mark)
    (c) The teacher draws the line of best fit. State whether the outlier should be included. Justify briefly. (2 marks)

    Ask AI about this

    AI-generated · claude-opus-4-7 · v3-edexcel-maths-leaves

Flashcards

S6 — Interpret scatter graphs; correlation and causation

7-card SR deck for Edexcel GCSE Mathematics (1MA1) — Leaves Batch 2 topic S6

7 cards · spaced repetition (SM-2)