11.912 min read

The Jacobian Matrix

For a scalar function $f: \mathbb{R}^n \to \mathbb{R}$ , the derivative at a point is a $1 \times n$ row vector of partial derivatives. The Jacobian matrix generalizes this to vector-valued functions $\mathbf{f}: \mathbb{R}^n \to \mathbb{R}^m$ .

If $\mathbf{f}(\mathbf{x}) = (f_1(\mathbf{x}), \ldots, f_m(\mathbf{x}))$ , then the Jacobian is the $m \times n$ matrix whose $(i,j)$ entry is $\partial f_i / \partial x_j$ :

$D\mathbf{f}(\mathbf{a}) = J\mathbf{f}(\mathbf{a}) = \begin{bmatrix} \partial f_1/\partial x_1 & \cdots & \partial f_1/\partial x_n \\ \vdots & \ddots & \vdots \\ \partial f_m/\partial x_1 & \cdots & \partial f_m/\partial x_n \end{bmatrix}_{\mathbf{x}=\mathbf{a}}$

Row $i$ of the Jacobian is the gradient of component $f_i$ . Column $j$ of the Jacobian is the vector of partial derivatives with respect to $x_j$ — how each output changes when input $x_j$ changes.

Formal View

Definition 11.10 — Jacobian Matrix

Let

\mathbf{f}: D \subseteq \mathbb{R}^n \to \mathbb{R}^m

with component functions

f_1, \ldots, f_m

. The Jacobian matrix of

\mathbf{f}

\mathbf{a}

J\mathbf{f}(\mathbf{a}) = \begin{bmatrix} \nabla^T f_1(\mathbf{a}) \\ \vdots \\ \nabla^T f_m(\mathbf{a}) \end{bmatrix} \in \mathbb{R}^{m \times n}

where each row is the gradient (transposed) of one output component.

Also written $Df(\mathbf{a})$ , $\partial\mathbf{f}/\partial\mathbf{x}(\mathbf{a})$ , or $J_\mathbf{f}(\mathbf{a})$ .

Theorem 11.3 — LLA via Jacobian

\mathbf{f}: \mathbb{R}^n \to \mathbb{R}^m

is differentiable at

\mathbf{a}

, then

\mathbf{f}(\mathbf{a}+\mathbf{h}) \approx \mathbf{f}(\mathbf{a}) + J\mathbf{f}(\mathbf{a})\,\mathbf{h}

with error

o(\|\mathbf{h}\|)

. The Jacobian is the unique

m \times n

matrix satisfying this approximation.

Interactive Visualization

Matrix-Vector Multiplication

Why This Matters

The Jacobian is the fundamental object of multivariable calculus — it encodes all first-order information about a vector-valued map.

Neural network backpropagation computes Jacobians to propagate gradients through layers
Robotics: Jacobians relate joint velocities to end-effector velocities
The determinant of the Jacobian (Jacobian determinant) gives volume scaling factors in change-of-variables for integrals

Learning Resources

The Jacobian Matrix

3Blue1Brown

Visual explanation of the Jacobian as the derivative for vector-valued functions.

14 min

Jacobian Matrix and Determinant

Khan Academy

Computing the Jacobian matrix and understanding its geometric meaning.

12 min

Quiz

Question 1

For $\mathbf{f}: \mathbb{R}^3 \to \mathbb{R}^2$ , what is the shape of the Jacobian matrix $J\mathbf{f}$ ?

Question 2

The $(i,j)$ entry of $J\mathbf{f}(\mathbf{a})$ is:

Common Mistakes

Transposing the Jacobian — rows correspond to output components, columns to input variables.
Confusing the Jacobian (matrix) with the gradient (vector) — the gradient is the Jacobian only for scalar-valued functions, and it is a column vector, not a row vector.
Computing entries as $\partial f_j/\partial x_i$ instead of $\partial f_i/\partial x_j$ — the first index is the output, the second is the input.