9.118 min read
The Linear Reduction Problem
After centering, the linear reduction problem is: minimize over all matrices with orthonormal columns.
Equivalently, this maximizes the variance captured: . Since is fixed, minimizing the residual is the same as maximizing the captured variance — the key duality behind PCA.
Formal View
Theorem 9.4 — Equivalence: Minimize Residual = Maximize Captured Variance
For centered data matrix and orthonormal :
Minimizing reconstruction error is equivalent to maximizing .
Why This Matters
The equivalence between minimizing residuals and maximizing variance is what makes PCA "optimal" in the least-squares sense.
- PCA finds directions of maximum variance — the same as directions of minimum error.
- Sensor placement: find locations capturing the most variance in a field.
- Visualizing to 2D: maximize variance preserves the most structure.
Quiz
Question 1
Minimizing the total reconstruction error is equivalent to maximizing the total variance captured.
Question 2
The reconstruction error equals:
Common Mistakes
- Thinking we maximize variance of the original data — we maximize variance of the projected data .
- Confusing (coordinates, ) with (projection, ).