Linear Algebra
12.148 min read

Quadratic Case: Critical Points

For a quadratic function f(x)=xTAx+bTx+cf(\mathbf{x}) = \mathbf{x}^T A \mathbf{x} + \mathbf{b}^T \mathbf{x} + c (symmetric AA), the gradient is f(x)=2Ax+b\nabla f(\mathbf{x}) = 2A\mathbf{x} + \mathbf{b}.

Setting the gradient to zero: 2Ax+b=02A\mathbf{x} + \mathbf{b} = \mathbf{0}, giving Ax=b/2A\mathbf{x} = -\mathbf{b}/2. If AA is invertible, the unique critical point is x=A1b/2\mathbf{x}^* = -A^{-1}\mathbf{b}/2.

If AA is singular (not invertible): the system either has no solution (if b/2col(A)\mathbf{b}/2 \notin \text{col}(A), meaning no critical points) or infinitely many solutions (if b/2col(A)\mathbf{b}/2 \in \text{col}(A), meaning an entire subspace of critical points).

Formal View

Theorem 12.8 — Critical Point of a Quadratic
For f(x)=xTAx+bTx+cf(\mathbf{x}) = \mathbf{x}^T A \mathbf{x} + \mathbf{b}^T \mathbf{x} + c with symmetric AA: the critical points satisfy 2Ax=b2A\mathbf{x} = -\mathbf{b}. If AA is invertible, the unique critical point is x=12A1b\mathbf{x}^* = -\frac{1}{2}A^{-1}\mathbf{b}.

Interactive Visualization

Eigenvector Explorer

Why This Matters

Quadratic optimization is the backbone of all second-order methods and regularized regression.

  • Least squares has a quadratic objective whose unique critical point is the normal equation solution
  • Ridge regression adds λx2\lambda\|\mathbf{x}\|^2 making the Hessian ATA+λIA^T A + \lambda I positive definite and invertible
  • All quadratic programming (QP) solvers exploit this structure

Quiz

Question 1

For f(x)=xTAx+bTxf(\mathbf{x}) = \mathbf{x}^T A \mathbf{x} + \mathbf{b}^T \mathbf{x} with invertible symmetric AA, the unique critical point is:

Question 2

If AA is singular and bcol(A)\mathbf{b} \notin \text{col}(A), then f(x)=xTAx+bTxf(\mathbf{x}) = \mathbf{x}^T A \mathbf{x} + \mathbf{b}^T \mathbf{x} has no critical points.

Common Mistakes

  • Forgetting the factor of 2 in (xTAx)=2Ax\nabla(\mathbf{x}^T A \mathbf{x}) = 2A\mathbf{x}.
  • Assuming a unique critical point exists without checking invertibility of AA.
  • Confusing the critical point with the Hessian — the critical point is a vector, the Hessian is a matrix.