Probability
Sample Space and Events
A random experiment is one whose outcome cannot be predicted with certainty in advance, even though every possible result is known beforehand. Tossing a coin, rolling a die, or drawing a card from a well-shuffled pack are all random experiments: you know the menu of possibilities, but not which one will turn up on a given trial.
Each individual result of the experiment is an outcome. The set of all possible outcomes is the sample space, written $S$. For a single die, $S = \{1, 2, 3, 4, 5, 6\}$; for one toss of a coin, $S = \{H, T\}$. Every element of $S$ is called a sample point, and $n(S)$ is the total number of sample points.
An event is any subset of the sample space — a collection of outcomes we choose to single out. "Getting an even number on a die" is the event $E = \{2, 4, 6\}$, a subset of $S$. This is the bridge to the previous chapter: events are sets, and the sample space is the universal set. Everything we do with probability is set theory wearing a different hat.
Because events are sets, the set operations carry direct meanings. The complement $\overline{A}$ (also written $A'$) is the event "$A$ does not happen". The union $A \cup B$ is "$A$ or $B$ (or both)", and the intersection $A \cap B$ is "$A$ and $B$ together".
| Type of event | Meaning | Example (single die) |
|---|---|---|
| Simple (elementary) | Exactly one outcome | $\{4\}$ — getting a 4 |
| Compound | Two or more outcomes | $\{2, 4, 6\}$ — an even number |
| Sure (certain) | The whole sample space $S$ | $\{1,2,3,4,5,6\}$ — a number $\le 6$ |
| Impossible | The empty set $\varnothing$ | getting a 7 |
| Mutually exclusive | $A \cap B = \varnothing$ (cannot both occur) | $\{1,3,5\}$ and $\{2,4,6\}$ |
| Exhaustive | Union covers all of $S$ | $\{1,2,3\}$ and $\{3,4,5,6\}$ |
Two events $A$ and $B$ are mutually exclusive when they share no outcome, so they can never happen on the same trial. A collection of events is exhaustive when their union is the entire sample space, so at least one of them must occur. When events are both mutually exclusive and exhaustive, they partition $S$ cleanly into non-overlapping pieces that together account for everything.
Deeper Insight — probability is set theory with a measuring tape: The reason Chapter 1 on Sets sits in the same syllabus as this chapter is no accident. A sample space is just a universal set, an event is just a subset, and "or", "and", "not" translate exactly into union, intersection, and complement. This is why the vocabulary feels familiar: "mutually exclusive" is the everyday name for disjoint sets, and "exhaustive" simply restates that a family of subsets covers the universal set. Once you accept that an event is a set of outcomes, the laws you already know — De Morgan's laws, the distributive laws — transfer wholesale. Probability then adds one new ingredient on top of this set machinery: a way to measure how large an event is, relative to the whole sample space. Get the set picture firm first, and the numerical rules in the next two topics will feel like bookkeeping rather than new ideas.
- Each coin shows H or T, so the outcomes pair up: $S = \{HH, HT, TH, TT\}$, giving $n(S) = 4$.
- "At least one head" means one or two heads — drop the all-tails outcome.
- $E = \{HH, HT, TH\}$.
Answer: $S = \{HH, HT, TH, TT\}$ and $E = \{HH, HT, TH\}$, with $n(E) = 3$.
- Intersection: $A \cap B = \{3\}$, which is not empty — so they are not mutually exclusive (the outcome 3 lies in both).
- Union: $A \cup B = \{1,2,3,4,5,6\} = S$, so together they cover the whole sample space.
Answer: Not mutually exclusive (they share 3), but they are exhaustive.
- Each die independently shows $1$ to $6$, so $n(S) = 6 \times 6 = 36$ ordered pairs.
- Pairs summing to $7$: $(1,6),(2,5),(3,4),(4,3),(5,2),(6,1)$.
- That is $6$ favourable sample points.
Answer: $n(S) = 36$, and the event "sum $= 7$" has $6$ outcomes.
- $A$ = "heart" contains the $13$ hearts, so $n(A) = 13$.
- The complement $\overline{A}$ = "not a heart" collects every other card.
- $n(\overline{A}) = 52 - 13 = 39$ (the spades, clubs and diamonds).
Answer: $\overline{A}$ = "the card is a spade, club or diamond", with $n(\overline{A}) = 39$.
- Pair each coin result with each die face: $S = \{H1,H2,H3,H4,H5,H6, T1,T2,T3,T4,T5,T6\}$.
- So $n(S) = 2 \times 6 = 12$.
- "Head with an even number" needs H paired with $2,4$ or $6$: $\{H2, H4, H6\}$.
Answer: $n(S) = 12$ and the event is $\{H2, H4, H6\}$, with $3$ outcomes.
- Primes on a die: $A = \{2, 3, 5\}$. Odd numbers: $B = \{1, 3, 5\}$.
- $A \cup B = \{1, 2, 3, 5\}$ — "prime or odd".
- $A \cap B = \{3, 5\}$ — "prime and odd".
Answer: $A \cup B = \{1,2,3,5\}$ (prime or odd) and $A \cap B = \{3,5\}$ (prime and odd).
- A random experiment has known possible outcomes but an unpredictable result; the set of all outcomes is the sample space $S$.
- An event is a subset of $S$; $\varnothing$ is the impossible event and $S$ itself is the sure event.
- Events are mutually exclusive when $A \cap B = \varnothing$, and exhaustive when their union equals $S$.
- Set operations carry meaning: $A \cup B$ = "A or B", $A \cap B$ = "A and B", $\overline{A}$ = "not A".
- Probability is set theory plus a way to measure how large an event is relative to $S$.
Axiomatic Probability
Once outcomes and events are in place, we attach a number — the probability — to each event, measuring how likely it is. The axiomatic approach, due to Kolmogorov, fixes three simple rules that any sensible probability must obey, and everything else follows from them.
For a sample space $S$ and any event $E$, the axioms of probability are:
In words: (1) every probability lies between $0$ and $1$, (2) the sure event has probability $1$, and (3) for mutually exclusive events the probabilities simply add. The impossible event follows immediately: $P(\varnothing) = 0$.
When all outcomes are equally likely — a fair coin, a fair die, a well-shuffled pack — the probability of an event reduces to a ratio of counts. This is the classical (Laplace) definition you will use most:
Notice that this formula only applies when the outcomes are equally likely. A loaded die or a bent coin breaks the assumption, and then you must use observed (empirical) frequencies instead.
A constant companion of any event is its complement $\overline{A}$, the event "$A$ does not occur". Since $A$ and $\overline{A}$ are mutually exclusive and together exhaust $S$, their probabilities add to $1$:
This complement rule is often the fastest route to an answer: when "at least one" appears in a question, computing $1 - P(\text{none})$ is usually far shorter than counting the favourable cases directly.
| Situation | Probability | Reason |
|---|---|---|
| Sure event | $P(S) = 1$ | contains every outcome |
| Impossible event | $P(\varnothing) = 0$ | contains no outcome |
| Single fair die shows a $4$ | $\dfrac{1}{6}$ | $1$ favourable of $6$ equally likely |
| Even number on a die | $\dfrac{3}{6} = \dfrac{1}{2}$ | $\{2,4,6\}$ of $6$ |
| Complement of $A$ | $1 - P(A)$ | $A$ and $\overline{A}$ partition $S$ |
Deeper Insight — why axioms beat a single "definition": Earlier you may have met probability as nothing more than favourable over total. That classical formula is genuinely useful, but it quietly assumes the outcomes are equally likely — an assumption that fails for a weighted coin, an unfair die, or a continuous quantity like a person's exact height. The axiomatic framework sidesteps this by not committing to how the numbers arise; it only demands that whatever numbers we assign are non-negative, sum to $1$ over the whole sample space, and add up over disjoint events. The familiar $\tfrac{n(A)}{n(S)}$ is then just one way to satisfy the axioms, the way that happens to work when symmetry makes every outcome equally likely. Because the complement rule, the addition theorem, and every later result are all derived from these three axioms, learning to reason from them — rather than memorising isolated formulas — is what makes harder problems tractable.
- $n(S) = 6$ equally likely outcomes.
- (a) Greater than $4$: $\{5, 6\}$, so $P = \dfrac{2}{6} = \dfrac{1}{3}$.
- (b) Multiples of $3$: $\{3, 6\}$, so $P = \dfrac{2}{6} = \dfrac{1}{3}$.
Answer: (a) $\dfrac{1}{3}$; (b) $\dfrac{1}{3}$.
- $n(S) = 52$ equally likely cards.
- (a) There are $4$ kings: $P = \dfrac{4}{52} = \dfrac{1}{13}$.
- (b) Face cards are J, Q, K in each suit: $3 \times 4 = 12$, so $P = \dfrac{12}{52} = \dfrac{3}{13}$.
Answer: (a) $\dfrac{1}{13}$; (b) $\dfrac{3}{13}$.
- Use the complement rule: $P(\overline{A}) = 1 - P(A)$.
- $P(\overline{A}) = 1 - \dfrac{2}{5} = \dfrac{3}{5}$.
Answer: $P(\overline{A}) = \dfrac{3}{5}$.
- $n(S) = 2^3 = 8$ equally likely outcomes.
- "At least one tail" is the complement of "no tail" (all heads, the single outcome $HHH$).
- $P(\text{no tail}) = \dfrac{1}{8}$, so $P(\text{at least one tail}) = 1 - \dfrac{1}{8} = \dfrac{7}{8}$.
Answer: $\dfrac{7}{8}$.
- $n(S) = 36$ equally likely ordered pairs.
- Pairs giving a sum of $8$: $(2,6),(3,5),(4,4),(5,3),(6,2)$ — that is $5$ favourable outcomes.
- $P = \dfrac{5}{36}$.
Answer: $\dfrac{5}{36}$.
- Total balls $n(S) = 4 + 6 = 10$, each equally likely to be drawn.
- (a) Red: $P(\text{red}) = \dfrac{4}{10} = \dfrac{2}{5}$.
- (b) By the complement rule, $P(\text{not red}) = 1 - \dfrac{2}{5} = \dfrac{3}{5}$ (check: $\dfrac{6}{10} = \dfrac{3}{5}$).
Answer: (a) $\dfrac{2}{5}$; (b) $\dfrac{3}{5}$.
- The axioms: $0 \le P(E) \le 1$, $P(S) = 1$, and probabilities add for mutually exclusive events.
- For equally likely outcomes, $P(A) = \dfrac{n(A)}{n(S)}$ — favourable over total.
- $P(\varnothing) = 0$ (impossible) and $P(S) = 1$ (sure).
- Complement rule: $P(\overline{A}) = 1 - P(A)$ — ideal for "at least one" problems.
- The classical formula needs equally likely outcomes; without that symmetry, use empirical frequencies.
Addition Theorem
The complement rule handles a single event and its negation. The addition theorem handles the probability that at least one of two events occurs — the event $A \cup B$, read "$A$ or $B$".
For any two events $A$ and $B$ of the same sample space:
The reason for the subtraction is geometric. If you simply add $P(A)$ and $P(B)$, the outcomes lying in both events — the overlap $A \cap B$ — get counted twice, once inside each event. Subtracting $P(A \cap B)$ removes the double count and restores the correct total. On a Venn diagram, you are adding two overlapping discs and then peeling off the lens-shaped middle that you covered twice.
When the two events are mutually exclusive, they have no common outcome, so $A \cap B = \varnothing$ and $P(A \cap B) = 0$. The theorem then collapses to the simple additive form, which is exactly the third axiom:
The result extends to three events, again alternately adding and subtracting overlaps:
A pairing with the complement rule is worth memorising: "neither $A$ nor $B$" is the complement of "$A$ or $B$", which by De Morgan's law gives a handy shortcut.
| Phrase in the question | Event in symbols | How to compute |
|---|---|---|
| $A$ or $B$ (at least one) | $A \cup B$ | $P(A) + P(B) - P(A \cap B)$ |
| $A$ and $B$ (both) | $A \cap B$ | given, or count directly |
| Exactly one of $A$, $B$ | $(A \cup B) \setminus (A \cap B)$ | $P(A \cup B) - P(A \cap B)$ |
| Neither $A$ nor $B$ | $\overline{A \cup B}$ | $1 - P(A \cup B)$ |
Deeper Insight — the addition theorem is inclusion–exclusion in probability dress: If the subtraction in $P(A \cup B) = P(A) + P(B) - P(A \cap B)$ feels familiar, that is because you have already met its set-counting twin: $n(A \cup B) = n(A) + n(B) - n(A \cap B)$ from the chapter on Sets. The two are the same identity. Divide every term of the counting version by $n(S)$ and each count becomes a probability, turning the inclusion–exclusion principle for sizes into the addition theorem for likelihoods. This is the clearest illustration of the chapter's central message: probability is built directly on set theory, and the overlap must be subtracted for exactly the same reason a Venn diagram cannot count its shared region twice. Recognising this lets you reuse one mental tool in two settings, and it explains why the three-event formula alternates signs in the inclusion–exclusion pattern.
- Let $A$ = "king" and $B$ = "heart". Then $P(A) = \dfrac{4}{52}$ and $P(B) = \dfrac{13}{52}$.
- The overlap is the king of hearts, a single card: $P(A \cap B) = \dfrac{1}{52}$.
- Addition theorem: $P(A \cup B) = \dfrac{4}{52} + \dfrac{13}{52} - \dfrac{1}{52} = \dfrac{16}{52} = \dfrac{4}{13}$.
Answer: $\dfrac{4}{13}$.
- $A$ = even $= \{2,4,6\}$, so $P(A) = \dfrac{3}{6}$. $B$ = greater than $4$ $= \{5,6\}$, so $P(B) = \dfrac{2}{6}$.
- Overlap $A \cap B = \{6\}$, so $P(A \cap B) = \dfrac{1}{6}$.
- $P(A \cup B) = \dfrac{3}{6} + \dfrac{2}{6} - \dfrac{1}{6} = \dfrac{4}{6} = \dfrac{2}{3}$.
Answer: $\dfrac{2}{3}$.
- Mutually exclusive means $A \cap B = \varnothing$, so $P(A \cap B) = 0$.
- The theorem reduces to $P(A \cup B) = P(A) + P(B)$.
- $P(A \cup B) = 0.5 + 0.3 = 0.8$.
Answer: $P(A \cup B) = 0.8$.
- $n(S) = 36$. Sum $= 7$: $6$ outcomes, so $P(A) = \dfrac{6}{36}$. Sum $= 11$: $(5,6),(6,5)$, so $P(B) = \dfrac{2}{36}$.
- A roll cannot total both $7$ and $11$, so the events are mutually exclusive: $P(A \cap B) = 0$.
- $P(A \cup B) = \dfrac{6}{36} + \dfrac{2}{36} = \dfrac{8}{36} = \dfrac{2}{9}$.
Answer: $\dfrac{2}{9}$.
- First find "cricket or football": $P(A \cup B) = 0.6 + 0.5 - 0.3 = 0.8$.
- "Neither" is the complement of "at least one": $P(\overline{A \cup B}) = 1 - P(A \cup B)$.
- $P(\text{neither}) = 1 - 0.8 = 0.2$.
Answer: $0.2$.
- $A$ = spade: $13$ cards, $P(A) = \dfrac{13}{52}$. $B$ = face card (J, Q, K of each suit): $12$ cards, $P(B) = \dfrac{12}{52}$.
- Overlap = face cards that are spades (J, Q, K of spades): $3$ cards, $P(A \cap B) = \dfrac{3}{52}$.
- $P(A \cup B) = \dfrac{13}{52} + \dfrac{12}{52} - \dfrac{3}{52} = \dfrac{22}{52} = \dfrac{11}{26}$.
Answer: $\dfrac{11}{26}$.
- Addition theorem: $P(A \cup B) = P(A) + P(B) - P(A \cap B)$ — subtract the overlap so it is counted once.
- For mutually exclusive events ($A \cap B = \varnothing$), this becomes $P(A \cup B) = P(A) + P(B)$.
- "Neither $A$ nor $B$" $= 1 - P(A \cup B)$, using the complement of the union.
- The theorem is the inclusion–exclusion principle from Sets, divided through by $n(S)$.
- Translate words first: "or" means $\cup$, "and" means $\cap$, then apply the formula.