9.78 min read

Computing SVD via Covariance Matrix

The algorithm for SVD via $K = AA^\top$ : (1) form $K = AA^\top$ ; (2) eigendecompose $K = U\Lambda_K U^\top$ ; (3) set $\sigma_i = \sqrt{(\Lambda_K)_{ii}}$ ; (4) compute $\mathbf{v}_i = \frac{1}{\sigma_i}A^\top \mathbf{u}_i$ for each nonzero $\sigma_i$ ; (5) extend to orthonormal basis for zero $\sigma_i$ .

This approach is preferred when $m < n$ (the covariance matrix is smaller than the Gram matrix). The result is the same full SVD regardless of which path is taken.

Formal View

Example 9.1 — SVD via Covariance — Algorithm

Given

A

(

m \times n

), compute SVD via

K = AA^\top

: 1. Form

K = AA^\top

(

m \times m

). 2. Eigendecompose:

K = U\Lambda_K U^\top

, sort

\lambda_i

descending. 3.

\sigma_i = \sqrt{\lambda_i}

; form

m \times n

matrix

\Sigma

. 4.

\mathbf{v}_i = \frac{1}{\sigma_i}A^\top \mathbf{u}_i

for

\sigma_i > 0

; extend for zero

\sigma_i

. 5. Verify:

A = U\Sigma V^\top

Interactive Visualization

Matrix Product — Column Perspective

Why This Matters

Knowing how to compute SVD from either $AA^\top$ or $A^\top A$ allows choosing the smaller matrix to reduce computational cost.

Word embeddings: tall matrices (many words, few documents) use covariance path.
Scientific computing: $10000 \times 100$ matrices use the $100 \times 100$ Gram path.
MATLAB's `svd(A)` internally uses an optimized algorithm.

Learning Resources

SVD computation methods

Steve Brunton

Practical SVD computation via covariance and Gram matrices.

20 min

Computing the SVD step by step

MIT OpenCourseWare

Worked examples of computing SVD from spectral decompositions.

45 min

Quiz

Question 1

For a $100 \times 10000$ matrix $A$ , which is more efficient?

Question 2

Both the covariance-path and Gram-path algorithms produce the same SVD.

Common Mistakes

Computing $AA^\top$ when you want the Gram matrix — Gram is $A^\top A$ , covariance is $AA^\top$ .
Not verifying $A = U\Sigma V^\top$ at the end.