12.610 min read
The Gradient Vector
The gradient of a scalar function at is the column vector of all partial derivatives:
The gradient has a fundamental geometric property: it points in the direction of steepest increase of at . More precisely, among all unit directions, the one maximizing is , by Cauchy-Schwarz.
The gradient is also perpendicular to level sets of : if defines a level surface, then is normal to that surface. This is because moving along the level surface gives , which means .
Formal View
Definition 12.2 — Gradient Vector
The gradient of at is the column vector
Theorem 12.5 — Gradient Points Toward Steepest Ascent
For differentiable with :
achieved at .
The steepest descent direction is .
Why This Matters
The gradient is arguably the most important object in applied mathematics, underpinning virtually all optimization methods.
- Machine learning: backpropagation computes gradients of loss functions
- Physics: electric field is the negative gradient of potential; force is negative gradient of energy
- Computer graphics: gradient of a signed distance function gives surface normals
Quiz
Question 1
The gradient is perpendicular to:
Question 2
The maximum rate of increase of at is:
Common Mistakes
- Confusing the gradient (column vector) with the Jacobian row vector — they are transposes of each other.
- Thinking the gradient points toward the nearest maximum — it gives the locally steepest direction, not global direction.
- Forgetting that the gradient is zero at critical points, where the steepest-ascent interpretation breaks down.