Motivation for SVD
The spectral theorem gives a beautiful decomposition for symmetric matrices. But what about a general matrix that might not even be square? We need a generalization — something that reveals the geometric structure of any linear transformation, not just symmetric ones.
The key insight: even if itself is not symmetric, the products (an matrix) and (an matrix) are always symmetric and PSD. We can apply the spectral theorem to each of them. The Singular Value Decomposition (SVD) packages this information into a single clean factorization that works for any matrix.
Geometrically, the SVD says every linear map is a composition of: (1) a rotation/reflection in the domain, (2) a scaling along coordinate axes, and (3) a rotation/reflection in the codomain. This is the most fundamental geometric description of any linear transformation.
Formal View
Why This Matters
SVD is the Swiss army knife of linear algebra — it solves least squares, compresses data, finds low-rank structure, and enables PCA.
- Image compression: keep only the top singular values and vectors to get a compact representation.
- Natural language processing: latent semantic analysis uses SVD to find topics in text.
- Recommendation systems: matrix factorization via SVD identifies latent factors in user-item matrices.
Quiz
The spectral theorem applies to all matrices, not just symmetric ones.
For any matrix , the product is:
Common Mistakes
- Thinking SVD is just for square matrices — it works for any matrix, even non-square.
- Confusing SVD with eigendecomposition — eigendecomposition applies to square matrices, SVD applies universally.