16.37 min read

The Hessian Matrix

Just as the gradient vector organizes all first partial derivatives, the Hessian matrix organizes all second partial derivatives into an $n \times n$ matrix.

The $(i,j)$ entry of the Hessian is $\frac{\partial^2 f}{\partial x_i \partial x_j}$ . Because mixed partials are equal (Clairaut's theorem), the Hessian is always symmetric for $f \in D^2$ .

The Hessian captures the "curvature" of $f$ at a point. Just as the gradient tells you which direction $f$ increases fastest (first-order information), the Hessian tells you how fast that rate is changing (second-order information).

For $f(x,y) = xy^3 + x^2 - y$ , the Hessian function is $Hf(x,y) = \begin{bmatrix} 2 & 3y^2 \\ 3y^2 & 6xy \end{bmatrix}$ — a matrix-valued function of $(x,y)$ .

Formal View

Definition 16.3 — Hessian Matrix

Given

f(\mathbf{x})

\mathbb{R}^n

with all second partials at

\mathbf{x}_0

, the Hessian

Hf(\mathbf{x}_0)

is the

n \times n

matrix with

(i,j)

entry

\frac{\partial^2 f}{\partial x_i \partial x_j}(\mathbf{x}_0)

. When

f \in D^2

Hf

is symmetric.

Example 16.1 — Computing the Hessian

For

f(x,y) = xy^3 + x^2 - y

: first partials are

f_x = y^3 + 2x

and

f_y = 3xy^2 - 1

. Second partials:

f_{xx} = 2

f_{xy} = f_{yx} = 3y^2

f_{yy} = 6xy

. Hessian:

Hf(x,y) = \begin{bmatrix} 2 & 3y^2 \\ 3y^2 & 6xy \end{bmatrix}

Why This Matters

The Hessian is the multivariable analog of the second derivative $f''(x)$ in single-variable calculus — the key object for understanding curvature and classifying critical points.

Newton's method updates: $\mathbf{x}_{k+1} = \mathbf{x}_k - [Hf(\mathbf{x}_k)]^{-1} \nabla f(\mathbf{x}_k)$
Quasi-Newton methods (L-BFGS) approximate the Hessian to speed up optimization
Second-order sensitivity analysis in engineering and economics
Hessian-vector products used in efficient curvature computation in deep learning

Learning Resources

The Hessian matrix

Khan Academy

Introduction to the Hessian matrix with step-by-step computation.

10 min

Second derivatives and the Hessian matrix

Steve Brunton

Places the Hessian in the context of optimization and second-order methods.

14 min

Quiz

Question 1

The Hessian $Hf(\mathbf{x}_0)$ is always symmetric when:

Question 2

For $f$ on $\mathbb{R}^3$ , what is the size of the Hessian matrix?

Question 3

For $f(x,y) = 3x^2 + 2xy + 5y^2$ , what is $Hf$ ?

Question 4

The Hessian $Hf$ at a point $\mathbf{x}_0$ is a single number (scalar).

Common Mistakes

Confusing the Hessian with the Jacobian — the Jacobian is the matrix of first partials, the Hessian is the matrix of second partials.
Forgetting to divide by 2 when reconstructing $f$ from its Hessian (see the quadratic section).
Thinking the Hessian is always constant — it depends on position unless $f$ is quadratic.