IMOClass 9 › Statistics

Statistics

Collecting and Presenting Data

What is Statistics?

Statistics is the branch of mathematics that deals with the collection, organization, analysis, and interpretation of data. It helps us make sense of information and draw meaningful conclusions.

What is Data?

Data is a collection of facts, numbers, or observations. Examples include:

  • Test scores of students in a class
  • Heights of players on a sports team
  • Number of cars passing a toll booth each hour

Types of Data:

TypeDefinitionExample
**Secondary Data**Obtained from published sourcesData from a government report

Steps in Data Collection and Presentation:

  1. Collect the data through observation, surveys, or experiments
  2. Organize the data in a systematic way
  3. Present the data in tables or graphs for easy understanding

Frequency Distribution Table:

A frequency distribution table shows how often each value (or range of values) occurs in the data.

Raw Data: 5, 3, 8, 5, 7, 5, 9, 8, 5, 7, 3, 8, 5, 6, 7

Frequency Table:

ValueTally MarksFrequency
5IIII5
6I1
7III3
8III3
9I1
**Total****15**
Data Collection & Frequency DistributionRaw Data: Marks of 20 students: 32,45,67,78,56,45,89,67,45,78,90,56,78,45,67,90,56,78,67,89MarksTallyFrequencyCumul. Freq32I1145IIII4556III3867IIII41278IIII41689II21890II220Class interval groups continuous data (e.g. 30-40, 40-50, ...)Class width = upper limit − lower limit; Tally marks group in 5s
Example 1: Class mark of 20–30?
(20 + 30) ÷ 2 = 25.
Example 2: Class size of 15.5–20.5?
20.5 − 15.5 = 5.
Quick recap
  • Grouped data uses classes and frequencies.
  • Class mark = (lower + upper) ÷ 2; size = upper − lower.
✓ Quick check
Which of these is the formula for class mark?
Class mark is the exact midpoint of the class interval, calculated as (Upper limit + Lower limit)/2.
The class mark of the class interval 120 − 150 is:
Class mark = (Lower limit + Upper limit)/2 = (120 + 150)/2 = 270/2 = 135.

Graphs of Data

What are Graphical Representations?

Graphical representations are visual ways to display data, making it easier to understand patterns, trends, and comparisons at a glance.

Types of Graphs:

Graph TypeBest Used ForKey Feature
**Histogram**Showing distribution of continuous dataBars touch each other (no gaps)
**Frequency Polygon**Comparing multiple distributionsLine connecting midpoints

Bar Graph:

  • Used for discrete data (separate categories)
  • Bars have equal width with gaps between them
  • Height of bar represents frequency
  • Can be vertical or horizontal

Histogram:

  • Used for continuous data grouped into class intervals
  • Bars touch each other (no gaps)
  • Width of bar represents class interval size
  • Area of bar represents frequency

Frequency Polygon:

  • Created by joining the midpoints of histogram bars
  • Starts and ends on the x-axis (at frequency 0)
  • Useful for comparing two or more data sets on same graph
Measures of Central TendencyMeanSum of all values / Total count(32+45+67)/3 = 48MMedianMiddle value when sorted32,45,67 → Median=45MoModeMost frequently occurring value32,45,45,67 → Mode=45Median Rules:Odd n: Median = ((n+1)/2)th valueEven n: Median = average of (n/2)th and (n/2+1)th valuesA dataset can have no mode, one mode, or multiple modes (bimodal/multimodal)Empirical relationship: Mode ≈ 3×Median − 2×Mean
Example 1: Which graph suits continuous grouped data?
A histogram (bars are adjacent, no gaps).
Example 2: A frequency polygon joins which points?
The midpoints of the tops of the histogram rectangles.
Quick recap
  • Bar graph: gaps; histogram: adjacent bars, area ∝ frequency.
  • Frequency polygon joins bar-top midpoints.
✓ Quick check
What is the true class limit for the class interval 10-14 if the next interval starts at 15?
For discontinuous classes, we subtract 0.5 from the lower limit and add 0.5 to the upper limit. So, 9.5 to 14.5.
A tally mark of four vertical lines crossed by a diagonal line represents the number:
A group of five tallies is conventionally drawn as four vertical lines and one diagonal line across them.

Mean, Median and Mode

What are Measures of Central Tendency?

Measures of central tendency are single values that describe the "center" or "typical value" of a data set. The three main measures are mean, median, and mode.

Mean (Average):

  • Sum of all values divided by the number of values
  • Formula: \(\bar{x} = \frac{\text{Sum of all observations}}{\text{Number of observations}}\)
  • Most commonly used measure

Median (Middle Value):

  • The middle value when data is arranged in order
  • For odd number of observations: middle value
  • For even number of observations: average of two middle values
  • Not affected by extreme values (outliers)

Mode (Most Frequent):

  • The value that occurs most frequently
  • A data set can have one mode (unimodal), two modes (bimodal), or no mode
  • Useful for categorical data

Comparison of Measures:

MeasureBest Used WhenAffected by Outliers?
MedianData has outliers or is skewedNo
ModeData is categorical or has repeated valuesNo
Bar Graph & HistogramBar Graph (Discrete Data)Math80Sci65Eng90Art50ScoreHistogram (Continuous Data)30-4040-5050-6060-7070-80Bar Graph: gaps between bars (discrete categories)Histogram: NO gaps (continuous data, class intervals)Area of each bar = Frequency × Class width
Example 1: Mean of 4, 6, 8, 10, 12?
(4+6+8+10+12) ÷ 5 = 40 ÷ 5 = 8.
Example 2: Mode of 2, 3, 3, 5, 3, 7?
3, since it appears most often.
Quick recap
  • Mean = sum ÷ count; median = middle value (ordered).
  • Mode = most frequent; Mode ≈ 3·Median − 2·Mean.
✓ Quick check
A milkman sells milk at ₹50, ₹55, ₹60, and ₹65 per litre in four different localities. If he sells equal quantities in each locality, the mean price is:
Mean = (50 + 55 + 60 + 65) / 4 = 230 / 4 = ₹57.5.
A class consists of 15 boys and 10 girls. The average weight of boys is 40 kg and that of girls is 35 kg. The average weight of the whole class is:
Total weight = (15 × 40) + (10 × 35) = 600 + 350 = 950. Total students = 25. Average = 950/25 = 38 kg.
Ready to test this chapter?
Take the Chapter Test →