Linear Algebra
12.47 min read

U-Derivative from the Jacobian

We now have a clean computational workflow for directional derivatives. Given ff and a point a\mathbf{a}: (1) compute all partial derivatives at a\mathbf{a}, assembling them into the Jacobian row vector Df(a)=[1f(a),,nf(a)]Df(\mathbf{a}) = [\partial_1 f(\mathbf{a}), \ldots, \partial_n f(\mathbf{a})]; (2) for any unit direction u\mathbf{u}, compute Duf(a)=Df(a)uD_\mathbf{u} f(\mathbf{a}) = Df(\mathbf{a})\mathbf{u} (matrix-vector product).

The matrix-vector product interpretation is powerful: the Jacobian row vector "dot products" with any direction to give the directional derivative. This is exactly the action of a linear functional on a vector.

For a vector-valued function f:RnRm\mathbf{f}: \mathbb{R}^n \to \mathbb{R}^m, the directional derivative in direction u\mathbf{u} is Duf(a)=Jf(a)uRmD_\mathbf{u} \mathbf{f}(\mathbf{a}) = J\mathbf{f}(\mathbf{a})\mathbf{u} \in \mathbb{R}^m — a vector of directional derivatives, one for each component.

Formal View

Theorem 12.3 — U-Derivative via Jacobian Multiplication
For differentiable f:RnRm\mathbf{f}: \mathbb{R}^n \to \mathbb{R}^m and unit vector u\mathbf{u}:
Duf(a)=Jf(a)uD_\mathbf{u}\mathbf{f}(\mathbf{a}) = J\mathbf{f}(\mathbf{a})\,\mathbf{u}
For scalar ff (m=1m=1), this gives the scalar Duf(a)=f(a)uD_\mathbf{u} f(\mathbf{a}) = \nabla f(\mathbf{a}) \cdot \mathbf{u}.

Interactive Visualization

Matrix-Vector Multiplication

Why This Matters

Framing directional derivatives as matrix-vector products connects analysis to linear algebra.

  • Automatic differentiation: Jacobian-vector products (JVPs) and vector-Jacobian products (VJPs) are the two modes of AD
  • Adjoint methods in PDE-constrained optimization use Jacobian transposes
  • Sensitivity analysis: how does the output change when inputs change in a specific direction?

Quiz

Question 1

For f:R3R2\mathbf{f}: \mathbb{R}^3 \to \mathbb{R}^2 with Jacobian J=[120301]J = \begin{bmatrix}1&2&0\\3&0&1\end{bmatrix} at a\mathbf{a}, what is Duf(a)D_\mathbf{u}\mathbf{f}(\mathbf{a}) for u=(1,0,0)T\mathbf{u}=(1,0,0)^T?

Common Mistakes

  • Computing uJ\mathbf{u} J (wrong order) instead of JuJ\mathbf{u} — matrix multiplication order matters.
  • For scalar functions, confusing the gradient column vector with the Jacobian row vector in the dot product.