ABD 3e Chapter 3
Location (central tendency)
Width (spread)
Association* (correlation)
*later in term
We do statistics to learn about the World Out There (the population) from Data (the sample)
Location summarizes where values tend to fall in a distribution.
Different summaries answer different questions.
Mean
Median
Mode
Estimate = \(\bar{Y}\)
“Sample mean”
Parameter = \(\mu\)
“Population mean”
\[ \bar{Y} = \frac{\sum Y_i}{n} \]
where
The mode is the most common value in a dataset.
Different measures of width emphasize different aspects of spread.
Range
Difference between the largest and smallest values
Interquartile range (IQR)
Spread of the middle 50% of values
Variance and standard deviation
Typical deviation from the mean
Coefficient of variation (CV)
Variability expressed relative to the mean
Use as a simple measure of spread, when working with small datasets, or as a quick exploratory metric
Example:
Minimum value = \(1.25\)
Maximum value = \(3.55\)
Range = \(3.55−1.25=2.30\)
Because small samples tend to give lower estimates of the range than large samples, the sample range is a biased estimator of the true range of the population.
Median = middle observation, partitions data into two halves
Quartiles = Q1, Q2 (median), Q3, partition data into four quarters
Interquartile range (IQR) =“Q” 3−“Q” 1
Example: \(Q3−Q1 = 3.045−2.34=0.705\)
These measures quantify how far observations typically lie from the average.
Key idea:
Variability is a property of the data, not noise.
\[ \sigma^2 = \frac{\sum (x_i - \mu)^2}{n} \]
\[ s^2 = \frac{\sum (x_i - \bar{x})^2}{n - 1} \] where
\[ \sigma = \sqrt{\frac{\sum (x_i - \mu)^2}{n}} \]
\[ s = \sqrt{\frac{\sum (x_i - \bar{x})^2}{n - 1}} \]
\[ \mathrm{CV} = \frac{s}{\bar{x}} \times 100\% \]
where
Use different measures of variability depending on your goal.
Rounding affects how results are interpreted.
Do not round intermediate steps. Keep full precision during calculations to avoid compounding error
Round only final results. This preserves accuracy while improving readability
Report appropriate precision. Typically one–two decimal places more than the original measurements
Goal: Communicate results clearly without implying false precision.
Example: Values are integers, report mean and SD w/ 1 decimal place

BIOL 275 Biostatistics | Spring 2026