9.198 min read
Visualizing Low-Dimensional Structure
The principal coordinates give each data point's position in the low-dimensional space. For , plotting as a scatter reveals natural groupings. The fraction of variance explained tells how much information is preserved.
A "scree plot" graphs vs . A sharp "elbow" suggests the appropriate . Common heuristic: choose so that FVE (90% variance explained).
Formal View
Definition 9.8 — Fraction of Variance Explained
.
A scree plot graphs vs . An "elbow" suggests the appropriate .
Interactive Visualization
Span Visualizer
Why This Matters
PCA scatter plots compress high-dimensional data into a human-readable 2D picture.
- Genomics: 2D PCA of genetic data separates populations by ancestry.
- Quality control: PCA of sensor data identifies anomalous batches.
- Finance: PCA reveals market regimes.
Quiz
Question 1
If singular values are , what fraction of variance is explained by the top-2 components?
Question 2
A 2D PCA scatter plot is useful even for data in .
Common Mistakes
- Assuming 2D PCA always gives a useful plot — check FVE first.
- Forgetting axes in a PCA plot are principal components, not original features.