Jacobian: General Definition
The Jacobian matrix can be defined for functions between arbitrary finite-dimensional vector spaces. For , the Jacobian is the unique linear map from to that best approximates near .
The defining property is: . In coordinates, the matrix of this linear map has entry .
For scalar functions (), the Jacobian is a row vector — the gradient transposed. The gradient itself is the column vector . Many authors write for the row vector and for the column vector; they contain the same information.
Formal View
The matrix of in the standard basis is the Jacobian matrix with entry .
Why This Matters
Understanding the Jacobian as a linear map rather than just a matrix clarifies why the chain rule takes the form it does.
- The chain rule for Jacobians mirrors matrix multiplication
- Automatic differentiation systems (JAX, PyTorch) compute Jacobians using the coordinate-free definition
- The inverse function theorem characterizes when a function is locally invertible via the Jacobian determinant
Quiz
For a scalar function , the Jacobian is:
The chain rule for Jacobians states that , which corresponds to matrix multiplication.
Common Mistakes
- Treating the Jacobian and gradient as identical — the gradient is a column vector, the Jacobian is a row vector for scalar functions.
- Forgetting that the Jacobian matrix dimensions depend on both input and output dimensions.
- Not recognizing that the one-variable derivative is a special case of the Jacobian (a matrix).