The Logic Behind the Math: Measuring the "Spread"
The core logic of JEE Statistics is calculating Dispersion. The most important formula to master is the one for Variance, which tells us how far, on average, each data point is from the mean.$$\text{Variance } (\sigma^2) = \frac{\Sigma x_i^2}{n} - (\bar{x})^2$$
This is often called the "Mean of Squares minus the Square of the Mean." It is much faster than the standard $(x - \bar{x})^2$ method in an exam.
Step-by-Step Solved Example
Problem: Find the Mean and Variance of the first five natural numbers: $1, 2, 3, 4, 5$.
- Step 1: Calculate Mean ($\bar{x}$).$(1+2+3+4+5) / 5 = 15/5 = 3$.
- Step 2: Calculate Mean of Squares.$(1^2 + 2^2 + 3^2 + 4^2 + 5^2) / 5 = (1 + 4 + 9 + 16 + 25) / 5 = 55/5 = 11$.
- Step 3: Apply the Variance Formula.$\sigma^2 = (\text{Mean of Squares}) - (\text{Mean})^2$.$\sigma^2 = 11 - (3)^2 = 11 - 9 = 2$.
- Step 4: Find Standard Deviation ($\sigma$).$\sigma = \sqrt{2} \approx 1.414$.
Alternative Methods: Assumed Mean Method
When dealing with large numbers (e.g., $1005, 1010, 1015$), don't calculate from zero. Pick an "Assumed Mean" ($A = 1010$) and work with the deviations ($d_i = x_i - A$). This keeps the numbers small and prevents calculation errors. This is the Shift of Origin property.
Exam Trap Alert: The "Outlier" Effect
The Mean is very sensitive to outliers, but the Median is not.
JEE Trap: If a data set is $\{1, 2, 3, 4, 100\}$, the Mean is $22$, which doesn't represent most of the numbers. However, the Median is $3$. In exams, if the question asks for the "best" representation of skewed data, the answer is often the Median.
Practice Problem (JEE Main Level)
Question: If the mean of 10 observations is 50 and their standard deviation is 8, find the new mean and standard deviation if every observation is multiplied by 2 and then increased by 5.Hint: Mean follows both operations; SD only follows multiplication.