Misleading Graphs & Design Principles
Misleading Graphs & Design Principles
- Truncated y-axis: starting the axis above zero exaggerates differences
- Distorted scales: unequal intervals on an axis
- Cherry-picked range: showing only part of the data to support a narrative
- 3-D effects: perspective distortion makes comparisons unreliable
- Dual axes: two different y-scales can imply false correlations
- Area/volume encoding: humans misjudge areas (doubling radius quadruples area)
- Maximize data-ink ratio
- Avoid chart junk and non-data ink
- Show data variation, not design variation
- Use labels directly on the graphic when possible
- Small multiples over one complex chart
$$\text{Lie Factor} = \frac{\text{Size of effect shown in graphic}}{\text{Size of effect in data}}$$
Ideally = 1. Values far from 1 indicate distortion.
A bar chart shows profits of $\$100M$ vs. $\$105M$ but the y-axis starts at $\$98M$. Why is this misleading?
The 5% increase looks like a 70% increase visually. Starting at 0 would show the true proportion.
Sales doubled, but a graphic uses icons where the taller icon is 2× the height and 2× the width. Lie factor?
Area quadrupled (2×2). Lie factor $= 4/2 = 2$. The graphic exaggerates the change by a factor of 2.
A graph shows temperature rising from 14.0°C to 14.8°C over 100 years with a steep line. Is it misleading?
Depends on context. A truncated y-axis exaggerates the visual trend. Show the full scale or clearly label the narrow range.
Practice Problems
Show Answer Key
1. Decorative, non-data visual elements that distract
2. For bar charts yes; for line charts showing small changes, a truncated axis with clear labeling can be acceptable
3. Perspective makes front slices look larger than back slices
4. Area $\propto r^2$; visual effect $= 4\times$, data effect $= 2\times$. Lie factor $= 2$.
5. Two different y-scales; can make unrelated series appear correlated, scales can be manipulated
6. Repeating the same chart structure with different subsets of data for comparison
7. Poor color scales (rainbow) can create false boundaries; sequential scales are better for continuous data
8. Reduces eye movement and cognitive load
9. Growth rates, data spanning several orders of magnitude
10. Only if clearly indicated and if zero isn't meaningful for the context
11. Clear title, labeled axes, source attribution (also: consistent scale, minimal chart junk)
12. Start the y-axis at 0, or use an axis break with clear notation