TopMyGrade

GCSE/Mathematics/OCR

P5Empirical samples tend to theoretical distributions with sample size

Notes

Empirical samples and the law of large numbers

OCR J560 Statistics expects qualitative understanding: as sample size grows, the relative frequency of outcomes gets close to their true probability.

Statement

If a random experiment is repeated n times and an event has true probability p, then the relative frequency of the event in those n trials approaches p as n → ∞.

This is the law of large numbers. It justifies using relative frequency as an estimate of probability.

What this looks like in practice

For a fair coin, P(heads) = 0.5.

TrialsLikely range of heads count
103 to 7 (very wide)
10040 to 60
1000470 to 530
10 0004900 to 5100

The range as a proportion shrinks as n grows. Single trials fluctuate; long runs stabilise.

Implications for sampling

  • Small samples are noisy — be cautious about conclusions.
  • Larger samples give better estimates — so OCR questions often ask "is your estimate more or less reliable than…" with bigger n always more reliable.
  • You can never get the exact theoretical probability with a finite sample, only approach it.

Quantitative example

A bag contains an unknown ratio of red and blue counters. After 50 draws (with replacement), you find 30 red.

Estimate of P(red) = 30/50 = 0.6.

After 500 draws, 285 red. P(red) ≈ 0.57. Higher confidence; the 0.6 was a slight overestimate.

After 5000 draws, 2810 red. P(red) ≈ 0.562. Likely the true probability is close to this.

OCR phrasing patterns

OCR sets these as 1–3 mark questions, typically structured:

  • "Why is sample size of 1000 better than 50 for estimating?" → law of large numbers, less variance.
  • "Predict the relative frequency for 10 000 trials." → it will be close to (but not exactly) the theoretical probability.

OCR mark scheme conventions

  • B1 for "more reliable with larger samples" / "law of large numbers" / "less affected by random variation".
  • B1 for "approaches the theoretical probability".
  • "Closer to" or "approximately equal to" expected.

Common mistakes

  1. Saying the relative frequency will eventually EXACTLY EQUAL the theoretical probability — it only approaches.
  2. Concluding "fair/biased" from a small sample.
  3. Forgetting that the gambler's fallacy is wrong: past outcomes don't affect future probabilities.

AI-generated · claude-opus-4-7 · v3-ocr-maths-leaves

Practice questions

Try each before peeking at the worked solution.

  1. Question 14 marks

    Sample size and reliability

    OCR J560/02 — Foundation (calculator)

    A spinner is suspected of being biased. Two students do experiments to estimate P(spinner = 1).

    • Student A: 50 spins, 14 ones.
    • Student B: 500 spins, 132 ones.

    (a) Calculate the relative frequency of P(1) for each student. [2]
    (b) Whose estimate of P(1) is more reliable? Justify. [2]

    Ask AI about this

    AI-generated · claude-opus-4-7 · v3-ocr-maths-leaves

  2. Question 25 marks

    Predicting relative frequency

    OCR J560/03 — Foundation (calculator)

    A fair die is rolled.

    (a) State the theoretical probability of rolling a 6. [1]
    (b) Estimate the number of 6s in 1200 rolls. [2]
    (c) After 1200 rolls there are 218 sixes. State, with reason, whether this is consistent with a fair die. [2]

    Ask AI about this

    AI-generated · claude-opus-4-7 · v3-ocr-maths-leaves

  3. Question 34 marks

    Bias detection

    OCR J560/05 — Higher (calculator)

    A coin is flipped 10 000 times. There are 5800 heads.

    (a) Calculate the relative frequency of heads. [1]
    (b) Use this evidence to argue whether the coin is biased. [3]

    Ask AI about this

    AI-generated · claude-opus-4-7 · v3-ocr-maths-leaves

Flashcards

P5 — Empirical samples tend to theoretical distributions with sample size

8-card SR deck for OCR GCSE Mathematics J560 (leaf top-up) topic P5

8 cards · spaced repetition (SM-2)