Approximation Error and Differentiability
We can be precise about how good the local linear approximation is. Define the approximation error . The LLA is "good" if this error is small compared to the size of the displacement .
Formally, is differentiable at if the error is — meaning as . This says the error goes to zero faster than the displacement itself.
In the single-variable case, this reduces to the usual definition of differentiability. The multivariable version is stronger because it requires the linear approximation to be good in every direction simultaneously, not just along the coordinate axes.
Formal View
This definition makes the Jacobian unique when it exists: no other linear map can satisfy this approximation property.
Why This Matters
The error analysis perspective reveals why some numerical methods work and why gradient-based optimization is justified.
- Taylor expansion error bounds: the LLA error tells you when higher-order terms are needed
- Finite-difference approximation accuracy: the condition quantifies truncation error
- Validating that machine-learning training signals (gradients) are meaningful approximations of true descent directions
Quiz
The approximation error satisfies what condition when is differentiable at ?
If for some constant , then is differentiable at .
Common Mistakes
- Confusing (continuity-like) with (differentiability) — the latter is much stronger.
- Assuming differentiability just because the LLA "looks right" at a point without verifying the error condition.
- Thinking the Jacobian uniqueness is obvious — it actually requires proof that no two distinct linear maps can both satisfy the approximation condition.