14.77 min read
Multivariate Setup
The full multivariate chain rule handles the general case: and . The intermediate variables are , and the underlying variables are .
Each output depends on all intermediate variables, which in turn depend on all underlying variables. The total effect of on sums contributions through all intermediate variables .
This "sum of paths" interpretation is the essence of the Leibniz form: . Each term in the sum is the contribution through one intermediate variable.
Formal View
Remark 14.2 — Multivariate Chain Rule: Paths Interpretation
The entry of is:
Each term is a "path" from to through intermediate variable . This is literally the matrix product formula.
Interactive Visualization
Chain Rule Circuit Diagram
Why This Matters
The "sum over paths" interpretation of the chain rule generalizes naturally to computational graphs and backpropagation.
- Computational graphs in deep learning: each path from input to output contributes to the gradient
- Dynamic programming: the Bellman equation sums contributions through all next states
- Chemical kinetics: rate equations sum contributions through all reaction pathways
Quiz
Question 1
In the Leibniz chain rule , how many terms are in the sum for and ?
Common Mistakes
- Summing over the wrong index — the sum is over intermediate variables , not over inputs or outputs .
- Forgetting to evaluate partial derivatives of at , not at .