Data Handling
Mean, Median, Mode & Range
Measures of Center describe the typical value in a dataset.
| Measure | Definition | Example: {2,4,4,6,9} |
|---|---|---|
| Mean | Sum divided by count | (2+4+4+6+9)=25 / 5 = 5 |
| Median | Middle value when data is sorted | Sorted: 2,4,4,6,9 — Middle = 4 |
| Mode | Most frequently occurring value | 4 appears twice — Mode = 4 |
| Range | Maximum minus minimum | 9 - 2 = 7 |
When dataset has even count: Median = average of the two middle values.
Outliers affect the mean significantly but not the median.
FINDING MEDIAN:
Odd dataset: {2, 4, 4, 6, 9}
Sorted: 2 4 [4] 6 9
^
Median = 4 (3rd of 5)
Even dataset: {2, 4, 6, 8}
Sorted: 2 [4 6] 8
^
Median = (4+6)/2 = 5
MEAN as balance point:
Data: {2, 4, 4, 6, 9} Mean=5
Deviations: -3, -1, -1, +1, +4
Sum of deviations = 0 (always!)- Mean = sum ÷ count; median = middle value; mode = most frequent.
- Range = highest − lowest.
- Mean is sensitive to outliers (extreme values)
- Median is resistant to outliers
- Mode is the most frequent value; a dataset can have no mode or multiple modes
- Range measures spread (not center)
Bar Graphs & Double Bar Graphs
A bar graph shows data with bars whose heights give the values. A double bar graph places two bars side by side for each item, so two sets of data can be compared directly.
- Bar height shows the value; the tallest is the most.
- A double bar graph compares two data sets.
Probability
Probability measures the chance of an event, from 0 (impossible) to 1 (certain). It is found by P = (favourable outcomes) ÷ (total outcomes). A fair coin gives P(head) = ½.
- Probability ranges from 0 (impossible) to 1 (certain).
- P = favourable outcomes ÷ total outcomes.