Training Statistics Random Variables and Normal Models
3 / 5

Random Variables and Normal Models

24 min Statistics

Random Variables and Normal Models

Many real-world phenomena — heights of adults, measurement errors in a laboratory, standardized test scores — follow a remarkably predictable pattern when you collect enough data. Their histograms form a symmetric, bell-shaped curve that mathematicians call the normal distribution. Understanding this distribution is one of the most powerful tools in all of statistics, because it lets you make precise probability statements about where individual observations or sample averages are likely to fall.

Before introducing the normal model, we need the concept of a random variable — a quantity whose value is determined by the outcome of a random process. A random variable can be discrete (like the number of heads in ten coin flips) or continuous (like the exact time it takes to run a mile). Every random variable has an expected value, which is its long-run average, and a standard deviation, which measures how much individual outcomes tend to differ from that average.

The normal distribution is a specific continuous model defined entirely by two parameters: its mean $\mu$ (the center of the bell) and its standard deviation $\sigma$ (how wide the bell spreads). The celebrated 68-95-99.7 rule tells you that roughly 68% of observations fall within one standard deviation of the mean, about 95% fall within two standard deviations, and nearly all — 99.7% — fall within three. This simple rule lets you quickly estimate probabilities without reaching for a calculator or a z-table.

In this lesson you will explore how changing $\mu$ and $\sigma$ reshapes the normal curve, practice applying the empirical rule to real-world scenarios, and build the intuition you need for the confidence intervals and hypothesis tests that follow in the next lesson.

Expected Value

For a discrete random variable, $$E(X)=\sum xP(x).$$

Normal Distribution

The normal model is bell-shaped, symmetric, and described by mean $\mu$ and standard deviation $\sigma$.

68-95-99.7 Rule

About 68% of observations lie within $1\sigma$, 95% within $2\sigma$, and 99.7% within $3\sigma$ of the mean.

Example 1

If test scores are normal with mean 70 and standard deviation 10, what interval captures about 95% of scores?

  1. $70 \pm 20$
  2. so about $50$ to $90$.
Interactive Explorer: Normal Distribution
68% interval: −10 to 10
95% interval: −20 to 20
99.7% interval: −30 to 30

Practice Problems

1. What does a random variable do?
2. State the 68-95-99.7 rule.
3. Mean $=100$, SD $=15$. Give the 68% interval.
4. Mean $=100$, SD $=15$. Give the 95% interval.
5. Is the normal distribution symmetric or skewed?
Show Answer Key

1. It assigns a number to each outcome

2. 68% within 1 SD, 95% within 2 SD, 99.7% within 3 SD

3. $85$ to $115$

4. $70$ to $130$

5. Symmetric