12.610 min read

The Gradient Vector

The gradient of a scalar function $f: \mathbb{R}^n \to \mathbb{R}$ at $\mathbf{a}$ is the column vector of all partial derivatives: $\nabla f(\mathbf{a}) = \begin{bmatrix} \partial f/\partial x_1(\mathbf{a}) \\ \vdots \\ \partial f/\partial x_n(\mathbf{a}) \end{bmatrix}$

The gradient has a fundamental geometric property: it points in the direction of steepest increase of $f$ at $\mathbf{a}$ . More precisely, among all unit directions, the one maximizing $D_\mathbf{u} f(\mathbf{a}) = \nabla f(\mathbf{a}) \cdot \mathbf{u}$ is $\mathbf{u} = \nabla f(\mathbf{a})/\|\nabla f(\mathbf{a})\|$ , by Cauchy-Schwarz.

The gradient is also perpendicular to level sets of $f$ : if $f(\mathbf{x}) = c$ defines a level surface, then $\nabla f$ is normal to that surface. This is because moving along the level surface gives $D_\mathbf{u} f = 0$ , which means $\mathbf{u} \perp \nabla f$ .

Formal View

Definition 12.2 — Gradient Vector

The gradient of

f: D \subseteq \mathbb{R}^n \to \mathbb{R}

\mathbf{a}

is the column vector

\nabla f(\mathbf{a}) = \begin{bmatrix}\partial f/\partial x_1(\mathbf{a}) \\ \vdots \\ \partial f/\partial x_n(\mathbf{a})\end{bmatrix} = (Df(\mathbf{a}))^T

Theorem 12.5 — Gradient Points Toward Steepest Ascent

For differentiable

f

with

\nabla f(\mathbf{a}) \neq \mathbf{0}

\max_{\|\mathbf{u}\|=1} D_\mathbf{u} f(\mathbf{a}) = \|\nabla f(\mathbf{a})\|

achieved at

\mathbf{u}^* = \nabla f(\mathbf{a})/\|\nabla f(\mathbf{a})\|

The steepest descent direction is $-\nabla f(\mathbf{a})/\|\nabla f(\mathbf{a})\|$ .

Why This Matters

The gradient is arguably the most important object in applied mathematics, underpinning virtually all optimization methods.

Machine learning: backpropagation computes gradients of loss functions
Physics: electric field is the negative gradient of potential; force is negative gradient of energy
Computer graphics: gradient of a signed distance function gives surface normals

Learning Resources

The Gradient

3Blue1Brown

Essence of calculus perspective on gradients and their geometric meaning.

16 min

Gradient and Level Curves

Khan Academy

Understanding gradients and their perpendicularity to level sets.

8 min

Quiz

Question 1

The gradient $\nabla f(\mathbf{a})$ is perpendicular to:

Question 2

The maximum rate of increase of $f$ at $\mathbf{a}$ is:

Common Mistakes

Confusing the gradient (column vector) with the Jacobian row vector — they are transposes of each other.
Thinking the gradient points toward the nearest maximum — it gives the locally steepest direction, not global direction.
Forgetting that the gradient is zero at critical points, where the steepest-ascent interpretation breaks down.