P5Empirical samples tend to theoretical distributions with sample size

Study with AI tutor 8 flashcards →Snap a question →

Notes

Empirical samples and the law of large numbers

OCR J560 Statistics expects qualitative understanding: as sample size grows, the relative frequency of outcomes gets close to their true probability.

Statement

If a random experiment is repeated n times and an event has true probability p, then the relative frequency of the event in those n trials approaches p as n → ∞.

This is the law of large numbers. It justifies using relative frequency as an estimate of probability.

What this looks like in practice

For a fair coin, P(heads) = 0.5.

Trials	Likely range of heads count
10	3 to 7 (very wide)
100	40 to 60
1000	470 to 530
10 000	4900 to 5100

The range as a proportion shrinks as n grows. Single trials fluctuate; long runs stabilise.

Implications for sampling

Small samples are noisy — be cautious about conclusions.
Larger samples give better estimates — so OCR questions often ask "is your estimate more or less reliable than…" with bigger n always more reliable.
You can never get the exact theoretical probability with a finite sample, only approach it.

Quantitative example

A bag contains an unknown ratio of red and blue counters. After 50 draws (with replacement), you find 30 red.

Estimate of P(red) = 30/50 = 0.6.

After 500 draws, 285 red. P(red) ≈ 0.57. Higher confidence; the 0.6 was a slight overestimate.

After 5000 draws, 2810 red. P(red) ≈ 0.562. Likely the true probability is close to this.

OCR phrasing patterns

OCR sets these as 1–3 mark questions, typically structured:

"Why is sample size of 1000 better than 50 for estimating?" → law of large numbers, less variance.
"Predict the relative frequency for 10 000 trials." → it will be close to (but not exactly) the theoretical probability.

OCR mark scheme conventions

B1 for "more reliable with larger samples" / "law of large numbers" / "less affected by random variation".
B1 for "approaches the theoretical probability".
"Closer to" or "approximately equal to" expected.

⚠Common mistakes

Saying the relative frequency will eventually EXACTLY EQUAL the theoretical probability — it only approaches.
Concluding "fair/biased" from a small sample.
Forgetting that the gambler's fallacy is wrong: past outcomes don't affect future probabilities.

AI-generated · claude-opus-4-7 · v3-ocr-maths-leaves

Practice questions

Try each before peeking at the worked solution.

Question 14 marks
Sample size and reliability
OCR J560/02 — Foundation (calculator)

A spinner is suspected of being biased. Two students do experiments to estimate P(spinner = 1).
- Student A: 50 spins, 14 ones.
- Student B: 500 spins, 132 ones.
(a) Calculate the relative frequency of P(1) for each student. [2]
(b) Whose estimate of P(1) is more reliable? Justify. [2]
Ask AI about this
AI-generated · claude-opus-4-7 · v3-ocr-maths-leaves
Question 25 marks
Predicting relative frequency
OCR J560/03 — Foundation (calculator)

A fair die is rolled.

(a) State the theoretical probability of rolling a 6. [1]
(b) Estimate the number of 6s in 1200 rolls. [2]
(c) After 1200 rolls there are 218 sixes. State, with reason, whether this is consistent with a fair die. [2]
Ask AI about this
AI-generated · claude-opus-4-7 · v3-ocr-maths-leaves
Question 34 marks
Bias detection
OCR J560/05 — Higher (calculator)

A coin is flipped 10 000 times. There are 5800 heads.

(a) Calculate the relative frequency of heads. [1]
(b) Use this evidence to argue whether the coin is biased. [3]
Ask AI about this
AI-generated · claude-opus-4-7 · v3-ocr-maths-leaves

Flashcards

P5 — Empirical samples tend to theoretical distributions with sample size

8-card SR deck for OCR GCSE Mathematics J560 (leaf top-up) topic P5

8 cards · spaced repetition (SM-2)