17.49 min read

The Lagrange Condition — One Constraint

We showed that at a constrained minimum, $\text{Null}(Dg(\mathbf{x}_0)) \subseteq \text{Null}(Df(\mathbf{x}_0))$ . In most situations (when $Df(\mathbf{x}_0) \neq \mathbf{0}$ ), both null spaces have the same dimension $n-1$ , so they must be equal.

Two row vectors with the same null space must be scalar multiples of each other (basic linear algebra). So $Df(\mathbf{x}_0) = \lambda \, Dg(\mathbf{x}_0)$ for some scalar $\lambda$ .

Equivalently: $\nabla f(\mathbf{x}_0) = \lambda \nabla g(\mathbf{x}_0)$ . The gradients are parallel. Geometrically: the $f$ -isosurface and the constraint surface $S$ are tangent to each other at $\mathbf{x}_0$ — they touch and share a tangent plane.

Intuition: if you were hiking and had to stay on a contour of $g$ , you reach a highest or lowest $f$ -contour exactly when the $f$ -contours become tangent to your allowed path. At that point, you cannot move along $g$ without crossing $f$ -contours in both directions.

The scalar $\lambda$ is called a Lagrange multiplier. We do not particularly care about its value — we just care that it exists.

Formal View

Theorem 17.2 — Lagrange Theorem (One Constraint)

Let

f, g \in C^1

. Suppose

\mathbf{x}_0

is a minimum of the constrained problem

\min f

s.t.

g(\mathbf{x}) = k

, and

\text{Rank}(Dg(\mathbf{x}_0)) = 1

. Then there exists a scalar

\lambda

(the Lagrange multiplier) such that:

\lambda \, Dg(\mathbf{x}_0) = Df(\mathbf{x}_0) \qquad \text{equivalently:} \qquad \nabla f(\mathbf{x}_0) = \lambda \nabla g(\mathbf{x}_0)

This is a necessary condition only — not every point satisfying it is a constrained minimum. Such points are called Lagrange points; constrained optima must be among them.

Interactive Visualization

Lagrange Multiplier Visualizer

Why This Matters

The Lagrange condition converts a constrained optimization problem into a system of equations — which can be solved to find all candidate optima.

Finding shortest path on a surface (geodesic problems)
Maximizing utility subject to a budget constraint in economics
Principal component analysis: finds the eigenvector of the covariance matrix
Proving symmetric matrices have eigenvalues (spectral theorem — next section)

Learning Resources

Lagrange multipliers introduction

3Blue1Brown

Beautiful visual derivation of the Lagrange condition via tangent level curves.

13 min

Lagrange multipliers, using tangent planes

Khan Academy

Derives and applies the Lagrange condition step by step.

12 min

Quiz

Question 1

The Lagrange condition $\nabla f = \lambda \nabla g$ means:

Question 2

How many equations does the Lagrange condition $\nabla f(\mathbf{x}) = \lambda \nabla g(\mathbf{x})$ give (for $\mathbf{x} \in \mathbb{R}^n$ )?

Question 3

Every point satisfying the Lagrange condition is a constrained minimum.

Question 4

Geometrically, the Lagrange condition at $\mathbf{x}_0$ means:

Question 5

Why does it not matter what value $\lambda$ takes?

Common Mistakes

Solving $\nabla f = \lambda \nabla g$ and reporting those points as the answer without checking them against the constraint $g(\mathbf{x}) = k$ .
Treating $\lambda$ as meaningful — it is just the proportionality constant, often discarded.
Forgetting the necessary vs. sufficient direction: the Lagrange condition is necessary for a minimum, not sufficient.