Not signed in

Want to take part in these discussions? Sign in if you have an account, or apply for one below

Site Tag Cloud

Vanilla 1.1.10 is a product of Lussumo. More Information: Documentation, Community Support.

Welcome to nForum
If you want to take part in these discussions either sign in now (if you have an account), apply for one now (if you don't).

nLab > nLab General Discussions: soft proof of the chain rule

Bottom of Page

1 to 11 of 11

- CommentRowNumber1.
- CommentAuthorTodd_Trimble
- CommentTimeOct 14th 2018
- PermaLink
Author: Todd_Trimble
Format: MarkdownItexThis is more for the sake of amusement than anything else: perhaps the softest imaginable proof of the chain rule for formal power series. Working over a commutative ring $A$, the statement is that if $q, p \in A[ [x] ]$ are power series, with $0$ the constant coefficient of $p$, then $(q \circ p)'(x) = q'(p(x))p'(x)$ under the usual definitions. Let $D = A[y]/(y^2)$ be the representing object for derivations. Let $\delta: A[ [x] ] \to A[ [x] ] \otimes_A D \cong A[ [x] ][y]/(y^2)$ be the unique topological $A$-algebra map (under the $(x)$-adic topologies) that sends $x$ to $x + y$. (If it helps, think $\delta(q) = q(x + y)$.) For $p \in A[ [x] ]$, define $p'$ via the equation $\delta(p) = p(x) + p'(x)y$. Let $\pi: A[ [x] ] \otimes_A D \to A[ [x] ] \otimes_A D$ be the unique topological algebra map taking $x$ to $p(x)$ and $y$ to $p'(x)y$. Let $- \circ p: A[ [x] ] \to A[ [x] ]$ denote the unique topological algebra map that takes $x$ to $p$. Then the diagram $$\array{ A[ [x] ] & \stackrel{\delta}{\to} & A[ [x] ] \otimes_A D \\ \mathllap{- \circ p} \downarrow & & \downarrow \mathrlap{\pi} \\ A[ [x] ] & \underset{\delta}{\to} & A[ [x] ] \otimes_A D }$$ commutes in the category of topological algebras, since the two legs agree when evaluated at the generator $x$. But then, evaluating each leg at a power series $q \in A [ [x]]$, we have $$[\delta(- \circ p)](q) = \delta(q \circ p) = (q \circ p)(x) + (q \circ p)'(x)y$$ and $$[\pi \delta](q) = \pi(\delta(q)) = \pi(q(x) + q'(x)y) = q(p(x)) + q'(p(x))(p'(x)y)$$ whence the coefficients of $y$ agree: $(q \circ p)'(x) = q'(p(x))p'(x)$.

This is more for the sake of amusement than anything else: perhaps the softest imaginable proof of the chain rule for formal power series.

Working over a commutative ring $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>A</mi></mrow><annotation encoding="application/x-tex">A</annotation></semantics></math>$ , the statement is that if $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>q</mi><mo>,</mo><mi>p</mi><mo>∈</mo><mi>A</mi><mo stretchy="false">[</mo><mo stretchy="false">[</mo><mi>x</mi><mo stretchy="false">]</mo><mo stretchy="false">]</mo></mrow><annotation encoding="application/x-tex">q, p \in A[ [x] ]</annotation></semantics></math>$ are power series, with $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>0</mn></mrow><annotation encoding="application/x-tex">0</annotation></semantics></math>$ the constant coefficient of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>p</mi></mrow><annotation encoding="application/x-tex">p</annotation></semantics></math>$ , then $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mi>q</mi><mo>∘</mo><mi>p</mi><mo stretchy="false">)</mo><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mi>q</mi><mo>′</mo><mo stretchy="false">(</mo><mi>p</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">)</mo><mi>p</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">(q \circ p)'(x) = q'(p(x))p'(x)</annotation></semantics></math>$ under the usual definitions.

Let $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>D</mi><mo>=</mo><mi>A</mi><mo stretchy="false">[</mo><mi>y</mi><mo stretchy="false">]</mo><mo stretchy="false">/</mo><mo stretchy="false">(</mo><msup><mi>y</mi> <mn>2</mn></msup><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">D = A[y]/(y^2)</annotation></semantics></math>$ be the representing object for derivations. Let $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>δ</mi><mo>:</mo><mi>A</mi><mo stretchy="false">[</mo><mo stretchy="false">[</mo><mi>x</mi><mo stretchy="false">]</mo><mo stretchy="false">]</mo><mo>→</mo><mi>A</mi><mo stretchy="false">[</mo><mo stretchy="false">[</mo><mi>x</mi><mo stretchy="false">]</mo><mo stretchy="false">]</mo><msub><mo>⊗</mo> <mi>A</mi></msub><mi>D</mi><mo>≅</mo><mi>A</mi><mo stretchy="false">[</mo><mo stretchy="false">[</mo><mi>x</mi><mo stretchy="false">]</mo><mo stretchy="false">]</mo><mo stretchy="false">[</mo><mi>y</mi><mo stretchy="false">]</mo><mo stretchy="false">/</mo><mo stretchy="false">(</mo><msup><mi>y</mi> <mn>2</mn></msup><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">\delta: A[ [x] ] \to A[ [x] ] \otimes_A D \cong A[ [x] ][y]/(y^2)</annotation></semantics></math>$ be the unique topological $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>A</mi></mrow><annotation encoding="application/x-tex">A</annotation></semantics></math>$ -algebra map (under the $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">(x)</annotation></semantics></math>$ -adic topologies) that sends $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi><mo>+</mo><mi>y</mi></mrow><annotation encoding="application/x-tex">x + y</annotation></semantics></math>$ . (If it helps, think $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>δ</mi><mo stretchy="false">(</mo><mi>q</mi><mo stretchy="false">)</mo><mo>=</mo><mi>q</mi><mo stretchy="false">(</mo><mi>x</mi><mo>+</mo><mi>y</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">\delta(q) = q(x + y)</annotation></semantics></math>$ .) For $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>p</mi><mo>∈</mo><mi>A</mi><mo stretchy="false">[</mo><mo stretchy="false">[</mo><mi>x</mi><mo stretchy="false">]</mo><mo stretchy="false">]</mo></mrow><annotation encoding="application/x-tex">p \in A[ [x] ]</annotation></semantics></math>$ , define $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>p</mi><mo>′</mo></mrow><annotation encoding="application/x-tex">p'</annotation></semantics></math>$ via the equation $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>δ</mi><mo stretchy="false">(</mo><mi>p</mi><mo stretchy="false">)</mo><mo>=</mo><mi>p</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>+</mo><mi>p</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mi>y</mi></mrow><annotation encoding="application/x-tex">\delta(p) = p(x) + p'(x)y</annotation></semantics></math>$ .

Let $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>π</mi><mo>:</mo><mi>A</mi><mo stretchy="false">[</mo><mo stretchy="false">[</mo><mi>x</mi><mo stretchy="false">]</mo><mo stretchy="false">]</mo><msub><mo>⊗</mo> <mi>A</mi></msub><mi>D</mi><mo>→</mo><mi>A</mi><mo stretchy="false">[</mo><mo stretchy="false">[</mo><mi>x</mi><mo stretchy="false">]</mo><mo stretchy="false">]</mo><msub><mo>⊗</mo> <mi>A</mi></msub><mi>D</mi></mrow><annotation encoding="application/x-tex">\pi: A[ [x] ] \otimes_A D \to A[ [x] ] \otimes_A D</annotation></semantics></math>$ be the unique topological algebra map taking $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>p</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">p(x)</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi></mrow><annotation encoding="application/x-tex">y</annotation></semantics></math>$ to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>p</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mi>y</mi></mrow><annotation encoding="application/x-tex">p'(x)y</annotation></semantics></math>$ . Let $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo lspace="0.11111em" rspace="0em">−</mo><mo>∘</mo><mi>p</mi><mo>:</mo><mi>A</mi><mo stretchy="false">[</mo><mo stretchy="false">[</mo><mi>x</mi><mo stretchy="false">]</mo><mo stretchy="false">]</mo><mo>→</mo><mi>A</mi><mo stretchy="false">[</mo><mo stretchy="false">[</mo><mi>x</mi><mo stretchy="false">]</mo><mo stretchy="false">]</mo></mrow><annotation encoding="application/x-tex">- \circ p: A[ [x] ] \to A[ [x] ]</annotation></semantics></math>$ denote the unique topological algebra map that takes $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>p</mi></mrow><annotation encoding="application/x-tex">p</annotation></semantics></math>$ . Then the diagram
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mrow><mtable><mtr><mtd><mi>A</mi><mo stretchy="false">[</mo><mo stretchy="false">[</mo><mi>x</mi><mo stretchy="false">]</mo><mo stretchy="false">]</mo></mtd> <mtd><mover><mo>→</mo><mi>δ</mi></mover></mtd> <mtd><mi>A</mi><mo stretchy="false">[</mo><mo stretchy="false">[</mo><mi>x</mi><mo stretchy="false">]</mo><mo stretchy="false">]</mo><msub><mo>⊗</mo> <mi>A</mi></msub><mi>D</mi></mtd></mtr> <mtr><mtd><mpadded width="0px" lspace="-100%width"><mrow><mo lspace="0.11111em" rspace="0em">−</mo><mo>∘</mo><mi>p</mi></mrow></mpadded><mo stretchy="false">↓</mo></mtd> <mtd/> <mtd><mo stretchy="false">↓</mo><mpadded width="0px"><mi>π</mi></mpadded></mtd></mtr> <mtr><mtd><mi>A</mi><mo stretchy="false">[</mo><mo stretchy="false">[</mo><mi>x</mi><mo stretchy="false">]</mo><mo stretchy="false">]</mo></mtd> <mtd><munder><mo>→</mo><mi>δ</mi></munder></mtd> <mtd><mi>A</mi><mo stretchy="false">[</mo><mo stretchy="false">[</mo><mi>x</mi><mo stretchy="false">]</mo><mo stretchy="false">]</mo><msub><mo>⊗</mo> <mi>A</mi></msub><mi>D</mi></mtd></mtr></mtable></mrow></mrow><annotation encoding="application/x-tex">\array{ A[ [x] ] &amp; \stackrel{\delta}{\to} &amp; A[ [x] ] \otimes_A D \\ \mathllap{- \circ p} \downarrow &amp; &amp; \downarrow \mathrlap{\pi} \\ A[ [x] ] &amp; \underset{\delta}{\to} &amp; A[ [x] ] \otimes_A D }</annotation></semantics></math>$
commutes in the category of topological algebras, since the two legs agree when evaluated at the generator $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ . But then, evaluating each leg at a power series $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>q</mi><mo>∈</mo><mi>A</mi><mo stretchy="false">[</mo><mo stretchy="false">[</mo><mi>x</mi><mo stretchy="false">]</mo><mo stretchy="false">]</mo></mrow><annotation encoding="application/x-tex">q \in A [ [x]]</annotation></semantics></math>$ , we have
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mo stretchy="false">[</mo><mi>δ</mi><mo stretchy="false">(</mo><mo lspace="0.11111em" rspace="0em">−</mo><mo>∘</mo><mi>p</mi><mo stretchy="false">)</mo><mo stretchy="false">]</mo><mo stretchy="false">(</mo><mi>q</mi><mo stretchy="false">)</mo><mo>=</mo><mi>δ</mi><mo stretchy="false">(</mo><mi>q</mi><mo>∘</mo><mi>p</mi><mo stretchy="false">)</mo><mo>=</mo><mo stretchy="false">(</mo><mi>q</mi><mo>∘</mo><mi>p</mi><mo stretchy="false">)</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>+</mo><mo stretchy="false">(</mo><mi>q</mi><mo>∘</mo><mi>p</mi><mo stretchy="false">)</mo><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mi>y</mi></mrow><annotation encoding="application/x-tex">[\delta(- \circ p)](q) = \delta(q \circ p) = (q \circ p)(x) + (q \circ p)'(x)y</annotation></semantics></math>$
and
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mo stretchy="false">[</mo><mi>π</mi><mi>δ</mi><mo stretchy="false">]</mo><mo stretchy="false">(</mo><mi>q</mi><mo stretchy="false">)</mo><mo>=</mo><mi>π</mi><mo stretchy="false">(</mo><mi>δ</mi><mo stretchy="false">(</mo><mi>q</mi><mo stretchy="false">)</mo><mo stretchy="false">)</mo><mo>=</mo><mi>π</mi><mo stretchy="false">(</mo><mi>q</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>+</mo><mi>q</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mi>y</mi><mo stretchy="false">)</mo><mo>=</mo><mi>q</mi><mo stretchy="false">(</mo><mi>p</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">)</mo><mo>+</mo><mi>q</mi><mo>′</mo><mo stretchy="false">(</mo><mi>p</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">)</mo><mo stretchy="false">(</mo><mi>p</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mi>y</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">[\pi \delta](q) = \pi(\delta(q)) = \pi(q(x) + q'(x)y) = q(p(x)) + q'(p(x))(p'(x)y)</annotation></semantics></math>$
whence the coefficients of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi></mrow><annotation encoding="application/x-tex">y</annotation></semantics></math>$ agree: $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mi>q</mi><mo>∘</mo><mi>p</mi><mo stretchy="false">)</mo><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mi>q</mi><mo>′</mo><mo stretchy="false">(</mo><mi>p</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">)</mo><mi>p</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">(q \circ p)'(x) = q'(p(x))p'(x)</annotation></semantics></math>$ .
- CommentRowNumber2.
- CommentAuthorTodd_Trimble
- CommentTimeOct 14th 2018
- PermaLink
Author: Todd_Trimble
Format: MarkdownItexWell, I'll admit it's a little silly, almost parodic in the manner of *Mathematics Made Difficult*. A *halfway* normal proof might start the same way but then observe that in the ring with square-nilpotent $y$ we have $$\left(p(x) + p'(x)y\right)^n = p(x) + n p(x)^{n-1} p'(x)y$$ say by the binomial theorem, or by an inductive argument. This gives the chain rule $(q \circ p)'(x) = q'(p(x)) \cdot p'(x)$ at least for $q(x) = x^n$. Then extend to all power series $q(x) = \sum_{n \geq 0} a_n x^n$ by linearity, continuity, etc. Of course that's just one approach; there are others (such as 'meta' approaches that observe that the usual analysis proof implies the result in formal algebra). Any of those could be easier to follow than what I produced above. So what's the point? Nothing, really, except that it amused me that one could prove the statement with virtually zero calculation (like the binomial theorem) *anywhere* -- just by sticking to pure concepts and using the universal property of $A[ [x] ]$.

Well, I’ll admit it’s a little silly, almost parodic in the manner of Mathematics Made Difficult. A halfway normal proof might start the same way but then observe that in the ring with square-nilpotent $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi></mrow><annotation encoding="application/x-tex">y</annotation></semantics></math>$ we have
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><msup><mrow><mo>(</mo><mi>p</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>+</mo><mi>p</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mi>y</mi><mo>)</mo></mrow> <mi>n</mi></msup><mo>=</mo><mi>p</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>+</mo><mi>n</mi><mi>p</mi><mo stretchy="false">(</mo><mi>x</mi><msup><mo stretchy="false">)</mo> <mrow><mi>n</mi><mo>−</mo><mn>1</mn></mrow></msup><mi>p</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mi>y</mi></mrow><annotation encoding="application/x-tex">\left(p(x) + p'(x)y\right)^n = p(x) + n p(x)^{n-1} p'(x)y</annotation></semantics></math>$
say by the binomial theorem, or by an inductive argument. This gives the chain rule $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mi>q</mi><mo>∘</mo><mi>p</mi><mo stretchy="false">)</mo><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mi>q</mi><mo>′</mo><mo stretchy="false">(</mo><mi>p</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">)</mo><mo>⋅</mo><mi>p</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">(q \circ p)'(x) = q'(p(x)) \cdot p'(x)</annotation></semantics></math>$ at least for $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>q</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><msup><mi>x</mi> <mi>n</mi></msup></mrow><annotation encoding="application/x-tex">q(x) = x^n</annotation></semantics></math>$ . Then extend to all power series $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>q</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><msub><mo lspace="0.16667em" rspace="0.16667em">∑</mo> <mrow><mi>n</mi><mo>≥</mo><mn>0</mn></mrow></msub><msub><mi>a</mi> <mi>n</mi></msub><msup><mi>x</mi> <mi>n</mi></msup></mrow><annotation encoding="application/x-tex">q(x) = \sum_{n \geq 0} a_n x^n</annotation></semantics></math>$ by linearity, continuity, etc. Of course that’s just one approach; there are others (such as ’meta’ approaches that observe that the usual analysis proof implies the result in formal algebra).

Any of those could be easier to follow than what I produced above. So what’s the point? Nothing, really, except that it amused me that one could prove the statement with virtually zero calculation (like the binomial theorem) anywhere – just by sticking to pure concepts and using the universal property of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>A</mi><mo stretchy="false">[</mo><mo stretchy="false">[</mo><mi>x</mi><mo stretchy="false">]</mo><mo stretchy="false">]</mo></mrow><annotation encoding="application/x-tex">A[ [x] ]</annotation></semantics></math>$ .
- CommentRowNumber3.
- CommentAuthorRichard Williamson
- CommentTimeOct 14th 2018
- PermaLink
Author: Richard Williamson
Format: MarkdownItexNice, and good fun! I think that your argument is perfectly easy to follow, indeed a very nice to proceed! However, regarding avoiding use of calculation, I'm not sure this is really the case, for in the following line... > For $p \in A[ [x] ]$, define $p'$ via the equation $\delta(p) = p(x) + p'(x)y$. ...I think one would need the binomial theorem or similar to identify $p'$ with one of the usual definitions. But perhaps I'm missing something?

Nice, and good fun! I think that your argument is perfectly easy to follow, indeed a very nice to proceed! However, regarding avoiding use of calculation, I’m not sure this is really the case, for in the following line…

For $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>p</mi><mo>∈</mo><mi>A</mi><mo stretchy="false">[</mo><mo stretchy="false">[</mo><mi>x</mi><mo stretchy="false">]</mo><mo stretchy="false">]</mo></mrow><annotation encoding="application/x-tex">p \in A[ [x] ]</annotation></semantics></math>$ , define $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>p</mi><mo>′</mo></mrow><annotation encoding="application/x-tex">p'</annotation></semantics></math>$ via the equation $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>δ</mi><mo stretchy="false">(</mo><mi>p</mi><mo stretchy="false">)</mo><mo>=</mo><mi>p</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>+</mo><mi>p</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mi>y</mi></mrow><annotation encoding="application/x-tex">\delta(p) = p(x) + p'(x)y</annotation></semantics></math>$ .

…I think one would need the binomial theorem or similar to identify $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>p</mi><mo>′</mo></mrow><annotation encoding="application/x-tex">p'</annotation></semantics></math>$ with one of the usual definitions. But perhaps I’m missing something?
- CommentRowNumber4.
- CommentAuthorTodd_Trimble
- CommentTimeOct 14th 2018
- PermaLink
Author: Todd_Trimble
Format: MarkdownItexThanks! I just meant that with that definition of $p'$, we prove the chain rule with a minimum of calculation. Other properties might require more calculation. Of course, the definition is just the algebraic analogue of something familiar from SDG: if $p: R \to R$ is a function on the line and $D$ denotes the walking tangent vector, then under the Kock-Lawvere axiom which asserts an isomorphism $R \times R \cong R^D$ we may form a composite $$R \stackrel{\Delta}{\to} R \times R \cong R^D \stackrel{p^D}{\to} R^D \cong R \times R$$ which is just $\langle p, p' \rangle: R \to R \times R$. (It's a little easy here to get turned around with algebro-geometric duality, but I believe it works as advertised.) Even with all these hints, though, plus the fact that the chain rule is trying to express the functoriality of the tangent bundle vector $(-)^D$, it *still* took me some time to massage it into the form above.

Thanks! I just meant that with that definition of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>p</mi><mo>′</mo></mrow><annotation encoding="application/x-tex">p'</annotation></semantics></math>$ , we prove the chain rule with a minimum of calculation. Other properties might require more calculation.

Of course, the definition is just the algebraic analogue of something familiar from SDG: if $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>p</mi><mo>:</mo><mi>R</mi><mo>→</mo><mi>R</mi></mrow><annotation encoding="application/x-tex">p: R \to R</annotation></semantics></math>$ is a function on the line and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>D</mi></mrow><annotation encoding="application/x-tex">D</annotation></semantics></math>$ denotes the walking tangent vector, then under the Kock-Lawvere axiom which asserts an isomorphism $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>R</mi><mo>×</mo><mi>R</mi><mo>≅</mo><msup><mi>R</mi> <mi>D</mi></msup></mrow><annotation encoding="application/x-tex">R \times R \cong R^D</annotation></semantics></math>$ we may form a composite
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mi>R</mi><mover><mo>→</mo><mi>Δ</mi></mover><mi>R</mi><mo>×</mo><mi>R</mi><mo>≅</mo><msup><mi>R</mi> <mi>D</mi></msup><mover><mo>→</mo><mrow><msup><mi>p</mi> <mi>D</mi></msup></mrow></mover><msup><mi>R</mi> <mi>D</mi></msup><mo>≅</mo><mi>R</mi><mo>×</mo><mi>R</mi></mrow><annotation encoding="application/x-tex">R \stackrel{\Delta}{\to} R \times R \cong R^D \stackrel{p^D}{\to} R^D \cong R \times R</annotation></semantics></math>$
which is just $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">⟨</mo><mi>p</mi><mo>,</mo><mi>p</mi><mo>′</mo><mo stretchy="false">⟩</mo><mo>:</mo><mi>R</mi><mo>→</mo><mi>R</mi><mo>×</mo><mi>R</mi></mrow><annotation encoding="application/x-tex">\langle p, p' \rangle: R \to R \times R</annotation></semantics></math>$ . (It’s a little easy here to get turned around with algebro-geometric duality, but I believe it works as advertised.) Even with all these hints, though, plus the fact that the chain rule is trying to express the functoriality of the tangent bundle vector $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mo lspace="0.11111em" rspace="0em">−</mo><msup><mo stretchy="false">)</mo> <mi>D</mi></msup></mrow><annotation encoding="application/x-tex">(-)^D</annotation></semantics></math>$ , it still took me some time to massage it into the form above.
- CommentRowNumber5.
- CommentAuthorRichard Williamson
- CommentTimeOct 14th 2018
- PermaLink
Author: Richard Williamson
Format: MarkdownItexAh, I didn't realise that this definition of $p'$ was one of the novel aspects, even if inspired by SDG. Then the observation is nicer still!

Ah, I didn’t realise that this definition of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>p</mi><mo>′</mo></mrow><annotation encoding="application/x-tex">p'</annotation></semantics></math>$ was one of the novel aspects, even if inspired by SDG. Then the observation is nicer still!
- CommentRowNumber6.
- CommentAuthorTodd_Trimble
- CommentTimeOct 14th 2018
- PermaLink
Author: Todd_Trimble
Format: MarkdownItexSorry, I might not have made myself clear. I don't think "my" definition of $p'$ is novel. The part that took massaging was the proof of the chain rule I came up with; specifically, the end run (American slang) around the diagram where I sneak in this map I called $\pi$. That technique might not be novel either, but I can at least say it's not something I'd seen before. I've now added this demonstration to [[chain rule]].

Sorry, I might not have made myself clear. I don’t think “my” definition of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>p</mi><mo>′</mo></mrow><annotation encoding="application/x-tex">p'</annotation></semantics></math>$ is novel. The part that took massaging was the proof of the chain rule I came up with; specifically, the end run (American slang) around the diagram where I sneak in this map I called $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>π</mi></mrow><annotation encoding="application/x-tex">\pi</annotation></semantics></math>$ . That technique might not be novel either, but I can at least say it’s not something I’d seen before.

I’ve now added this demonstration to chain rule.
- CommentRowNumber7.
- CommentAuthorRichard Williamson
- CommentTimeOct 15th 2018
- PermaLink
Author: Richard Williamson
Format: MarkdownItexAh, I see. Would it be possible to provide some reference for this definition of $p'$? I suppose in a way it is tied up with the identification of the representing object for derivations, but it would be good to make that more explicit.

Ah, I see. Would it be possible to provide some reference for this definition of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>p</mi><mo>′</mo></mrow><annotation encoding="application/x-tex">p'</annotation></semantics></math>$ ? I suppose in a way it is tied up with the identification of the representing object for derivations, but it would be good to make that more explicit.
- CommentRowNumber8.
- CommentAuthorTodd_Trimble
- CommentTimeOct 15th 2018
- (edited Oct 15th 2018)
- PermaLink
Author: Todd_Trimble
Format: MarkdownItexGiving a reference is a slight embarrassment of riches problem. (If Urs is reading this, he might know a canonical reference right away.) Let me show the definition gives the correct answer in the mother of all contexts which is that of real-valued $C^\infty$ functions, rather than power series with coefficients in a commutative ring $A$. In other words, replace $A$ with $\mathbb{R}$ and $A[ [x] ]$ with the ring $C = C^\infty(\mathbb{R})$ of $C^\infty$ functions on $\mathbb{R}$. (Doing it with polynomials would probably be equally convincing.) So again we have the ring of dual numbers $D = \mathbb{R}[y]/(y^2)$. Given a point $x_0 \in \mathbb{R}$, [[Hadamard's lemma]] allows us to expand any function $f(x) \in C$ in the form $f(x_0) + a_1(x - x_0) + g(x)(x - x_0)^2$ for some unique number $a_1$ and function $g(x) \in C$, and $f'(x_0)$ is according to standard definition this value $a_1$. In other words, there is by this lemma a canonical isomorphism $$C/(x - x_0)^2C \cong D$$ which maps the residue class of $f(x)$ to $f(x_0) + f'(x_0)y \in D$. In fact any homomorphism $\phi: C \to D$, written in the form $\phi(f) = \phi_0(f) + \phi_1(f)y$, is given by (1) a homomorphism $\phi_0: C \to \mathbb{R}$, with any such on $C = C^\infty(\mathbb{R})$ being evaluation at a uniquely determined point $x_0$, and (2) a [[derivation]] $\phi_1: C \to \mathbb{R}$, meaning a linear function satisfying $\phi_1(f g) = \phi_1(f)\phi_0(g) + \phi_0(f)\phi_1(g) = \phi_1(f)g(x_0) + f(x_0)\phi_1(g)$. There is a whole tangent line's worth of possible derivations, but the particular one we want will take the function $f(x) = x$ to its correct derivative $1$ at $x_0$. Hence we would want the $\phi: C \to D$ such that $\phi(x) = x_0 + 1 \cdot y$. Since $C$ is the free $C^\infty$ ring on a generator $x$, this condition uniquely determines $\phi$ (the book by Moerdijk and Reyes would be a good reference for the theory of $C^\infty$ rings). The set-up I used just packages all these $\phi$ into the single homomorphism $\delta: C \to C \otimes_\mathbb{R} D \cong C[y]/(y^2)$ that sends $x$ to $x + y$, so that when we compute the composite $$C \stackrel{\delta}{\to} C \otimes_\mathbb{R} D \stackrel{ev_{x_0} \otimes 1}{\to} \mathbb{R} \otimes_\mathbb{R} D \cong D$$ we find that it sends $f(x)$ to $f(x_0) + f'(x_0)y$, as promised. I haven't checked the nLab to see to what extent we've already attested to these facts, but it's pretty standard material in SDG and algebraic geometry.

Giving a reference is a slight embarrassment of riches problem. (If Urs is reading this, he might know a canonical reference right away.) Let me show the definition gives the correct answer in the mother of all contexts which is that of real-valued $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>C</mi> <mn>∞</mn></msup></mrow><annotation encoding="application/x-tex">C^\infty</annotation></semantics></math>$ functions, rather than power series with coefficients in a commutative ring $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>A</mi></mrow><annotation encoding="application/x-tex">A</annotation></semantics></math>$ . In other words, replace $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>A</mi></mrow><annotation encoding="application/x-tex">A</annotation></semantics></math>$ with $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ℝ</mi></mrow><annotation encoding="application/x-tex">\mathbb{R}</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>A</mi><mo stretchy="false">[</mo><mo stretchy="false">[</mo><mi>x</mi><mo stretchy="false">]</mo><mo stretchy="false">]</mo></mrow><annotation encoding="application/x-tex">A[ [x] ]</annotation></semantics></math>$ with the ring $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>C</mi><mo>=</mo><msup><mi>C</mi> <mn>∞</mn></msup><mo stretchy="false">(</mo><mi>ℝ</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">C = C^\infty(\mathbb{R})</annotation></semantics></math>$ of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>C</mi> <mn>∞</mn></msup></mrow><annotation encoding="application/x-tex">C^\infty</annotation></semantics></math>$ functions on $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ℝ</mi></mrow><annotation encoding="application/x-tex">\mathbb{R}</annotation></semantics></math>$ . (Doing it with polynomials would probably be equally convincing.)

So again we have the ring of dual numbers $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>D</mi><mo>=</mo><mi>ℝ</mi><mo stretchy="false">[</mo><mi>y</mi><mo stretchy="false">]</mo><mo stretchy="false">/</mo><mo stretchy="false">(</mo><msup><mi>y</mi> <mn>2</mn></msup><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">D = \mathbb{R}[y]/(y^2)</annotation></semantics></math>$ . Given a point $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi>x</mi> <mn>0</mn></msub><mo>∈</mo><mi>ℝ</mi></mrow><annotation encoding="application/x-tex">x_0 \in \mathbb{R}</annotation></semantics></math>$ , Hadamard’s lemma allows us to expand any function $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>∈</mo><mi>C</mi></mrow><annotation encoding="application/x-tex">f(x) \in C</annotation></semantics></math>$ in the form $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo stretchy="false">(</mo><msub><mi>x</mi> <mn>0</mn></msub><mo stretchy="false">)</mo><mo>+</mo><msub><mi>a</mi> <mn>1</mn></msub><mo stretchy="false">(</mo><mi>x</mi><mo>−</mo><msub><mi>x</mi> <mn>0</mn></msub><mo stretchy="false">)</mo><mo>+</mo><mi>g</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">(</mo><mi>x</mi><mo>−</mo><msub><mi>x</mi> <mn>0</mn></msub><msup><mo stretchy="false">)</mo> <mn>2</mn></msup></mrow><annotation encoding="application/x-tex">f(x_0) + a_1(x - x_0) + g(x)(x - x_0)^2</annotation></semantics></math>$ for some unique number $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi>a</mi> <mn>1</mn></msub></mrow><annotation encoding="application/x-tex">a_1</annotation></semantics></math>$ and function $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>g</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>∈</mo><mi>C</mi></mrow><annotation encoding="application/x-tex">g(x) \in C</annotation></semantics></math>$ , and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><msub><mi>x</mi> <mn>0</mn></msub><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">f'(x_0)</annotation></semantics></math>$ is according to standard definition this value $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi>a</mi> <mn>1</mn></msub></mrow><annotation encoding="application/x-tex">a_1</annotation></semantics></math>$ . In other words, there is by this lemma a canonical isomorphism
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mi>C</mi><mo stretchy="false">/</mo><mo stretchy="false">(</mo><mi>x</mi><mo>−</mo><msub><mi>x</mi> <mn>0</mn></msub><msup><mo stretchy="false">)</mo> <mn>2</mn></msup><mi>C</mi><mo>≅</mo><mi>D</mi></mrow><annotation encoding="application/x-tex">C/(x - x_0)^2C \cong D</annotation></semantics></math>$
which maps the residue class of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">f(x)</annotation></semantics></math>$ to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo stretchy="false">(</mo><msub><mi>x</mi> <mn>0</mn></msub><mo stretchy="false">)</mo><mo>+</mo><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><msub><mi>x</mi> <mn>0</mn></msub><mo stretchy="false">)</mo><mi>y</mi><mo>∈</mo><mi>D</mi></mrow><annotation encoding="application/x-tex">f(x_0) + f'(x_0)y \in D</annotation></semantics></math>$ . In fact any homomorphism $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ϕ</mi><mo>:</mo><mi>C</mi><mo>→</mo><mi>D</mi></mrow><annotation encoding="application/x-tex">\phi: C \to D</annotation></semantics></math>$ , written in the form $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ϕ</mi><mo stretchy="false">(</mo><mi>f</mi><mo stretchy="false">)</mo><mo>=</mo><msub><mi>ϕ</mi> <mn>0</mn></msub><mo stretchy="false">(</mo><mi>f</mi><mo stretchy="false">)</mo><mo>+</mo><msub><mi>ϕ</mi> <mn>1</mn></msub><mo stretchy="false">(</mo><mi>f</mi><mo stretchy="false">)</mo><mi>y</mi></mrow><annotation encoding="application/x-tex">\phi(f) = \phi_0(f) + \phi_1(f)y</annotation></semantics></math>$ , is given by (1) a homomorphism $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi>ϕ</mi> <mn>0</mn></msub><mo>:</mo><mi>C</mi><mo>→</mo><mi>ℝ</mi></mrow><annotation encoding="application/x-tex">\phi_0: C \to \mathbb{R}</annotation></semantics></math>$ , with any such on $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>C</mi><mo>=</mo><msup><mi>C</mi> <mn>∞</mn></msup><mo stretchy="false">(</mo><mi>ℝ</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">C = C^\infty(\mathbb{R})</annotation></semantics></math>$ being evaluation at a uniquely determined point $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi>x</mi> <mn>0</mn></msub></mrow><annotation encoding="application/x-tex">x_0</annotation></semantics></math>$ , and (2) a derivation $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi>ϕ</mi> <mn>1</mn></msub><mo>:</mo><mi>C</mi><mo>→</mo><mi>ℝ</mi></mrow><annotation encoding="application/x-tex">\phi_1: C \to \mathbb{R}</annotation></semantics></math>$ , meaning a linear function satisfying $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi>ϕ</mi> <mn>1</mn></msub><mo stretchy="false">(</mo><mi>f</mi><mi>g</mi><mo stretchy="false">)</mo><mo>=</mo><msub><mi>ϕ</mi> <mn>1</mn></msub><mo stretchy="false">(</mo><mi>f</mi><mo stretchy="false">)</mo><msub><mi>ϕ</mi> <mn>0</mn></msub><mo stretchy="false">(</mo><mi>g</mi><mo stretchy="false">)</mo><mo>+</mo><msub><mi>ϕ</mi> <mn>0</mn></msub><mo stretchy="false">(</mo><mi>f</mi><mo stretchy="false">)</mo><msub><mi>ϕ</mi> <mn>1</mn></msub><mo stretchy="false">(</mo><mi>g</mi><mo stretchy="false">)</mo><mo>=</mo><msub><mi>ϕ</mi> <mn>1</mn></msub><mo stretchy="false">(</mo><mi>f</mi><mo stretchy="false">)</mo><mi>g</mi><mo stretchy="false">(</mo><msub><mi>x</mi> <mn>0</mn></msub><mo stretchy="false">)</mo><mo>+</mo><mi>f</mi><mo stretchy="false">(</mo><msub><mi>x</mi> <mn>0</mn></msub><mo stretchy="false">)</mo><msub><mi>ϕ</mi> <mn>1</mn></msub><mo stretchy="false">(</mo><mi>g</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">\phi_1(f g) = \phi_1(f)\phi_0(g) + \phi_0(f)\phi_1(g) = \phi_1(f)g(x_0) + f(x_0)\phi_1(g)</annotation></semantics></math>$ . There is a whole tangent line’s worth of possible derivations, but the particular one we want will take the function $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mi>x</mi></mrow><annotation encoding="application/x-tex">f(x) = x</annotation></semantics></math>$ to its correct derivative $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn></mrow><annotation encoding="application/x-tex">1</annotation></semantics></math>$ at $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi>x</mi> <mn>0</mn></msub></mrow><annotation encoding="application/x-tex">x_0</annotation></semantics></math>$ . Hence we would want the $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ϕ</mi><mo>:</mo><mi>C</mi><mo>→</mo><mi>D</mi></mrow><annotation encoding="application/x-tex">\phi: C \to D</annotation></semantics></math>$ such that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ϕ</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><msub><mi>x</mi> <mn>0</mn></msub><mo>+</mo><mn>1</mn><mo>⋅</mo><mi>y</mi></mrow><annotation encoding="application/x-tex">\phi(x) = x_0 + 1 \cdot y</annotation></semantics></math>$ . Since $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>C</mi></mrow><annotation encoding="application/x-tex">C</annotation></semantics></math>$ is the free $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>C</mi> <mn>∞</mn></msup></mrow><annotation encoding="application/x-tex">C^\infty</annotation></semantics></math>$ ring on a generator $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ , this condition uniquely determines $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ϕ</mi></mrow><annotation encoding="application/x-tex">\phi</annotation></semantics></math>$ (the book by Moerdijk and Reyes would be a good reference for the theory of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>C</mi> <mn>∞</mn></msup></mrow><annotation encoding="application/x-tex">C^\infty</annotation></semantics></math>$ rings).

The set-up I used just packages all these $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ϕ</mi></mrow><annotation encoding="application/x-tex">\phi</annotation></semantics></math>$ into the single homomorphism $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>δ</mi><mo>:</mo><mi>C</mi><mo>→</mo><mi>C</mi><msub><mo>⊗</mo> <mi>ℝ</mi></msub><mi>D</mi><mo>≅</mo><mi>C</mi><mo stretchy="false">[</mo><mi>y</mi><mo stretchy="false">]</mo><mo stretchy="false">/</mo><mo stretchy="false">(</mo><msup><mi>y</mi> <mn>2</mn></msup><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">\delta: C \to C \otimes_\mathbb{R} D \cong C[y]/(y^2)</annotation></semantics></math>$ that sends $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi><mo>+</mo><mi>y</mi></mrow><annotation encoding="application/x-tex">x + y</annotation></semantics></math>$ , so that when we compute the composite
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mi>C</mi><mover><mo>→</mo><mi>δ</mi></mover><mi>C</mi><msub><mo>⊗</mo> <mi>ℝ</mi></msub><mi>D</mi><mover><mo>→</mo><mrow><msub><mi>ev</mi> <mrow><msub><mi>x</mi> <mn>0</mn></msub></mrow></msub><mo>⊗</mo><mn>1</mn></mrow></mover><mi>ℝ</mi><msub><mo>⊗</mo> <mi>ℝ</mi></msub><mi>D</mi><mo>≅</mo><mi>D</mi></mrow><annotation encoding="application/x-tex">C \stackrel{\delta}{\to} C \otimes_\mathbb{R} D \stackrel{ev_{x_0} \otimes 1}{\to} \mathbb{R} \otimes_\mathbb{R} D \cong D</annotation></semantics></math>$
we find that it sends $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">f(x)</annotation></semantics></math>$ to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo stretchy="false">(</mo><msub><mi>x</mi> <mn>0</mn></msub><mo stretchy="false">)</mo><mo>+</mo><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><msub><mi>x</mi> <mn>0</mn></msub><mo stretchy="false">)</mo><mi>y</mi></mrow><annotation encoding="application/x-tex">f(x_0) + f'(x_0)y</annotation></semantics></math>$ , as promised.

I haven’t checked the nLab to see to what extent we’ve already attested to these facts, but it’s pretty standard material in SDG and algebraic geometry.
- CommentRowNumber9.
- CommentAuthorRichard Williamson
- CommentTimeOct 15th 2018
- (edited Oct 15th 2018)
- PermaLink
Author: Richard Williamson
Format: MarkdownItexIt would be very nice to include this somewhere if we do not already have it, perhaps at [[differentiation]], and then link to it from your argument [[chain rule]]. It looks like we may have something at [[differentiation]] which is similar to what you wrote in #8, but probably you could incorporate some more of #8 there. With regard to actual formal power series, it follows very easily, as far as I see, from the binomial theorem/induction that we have the right thing (i.e. the usual formal derivative of power series). But again it would be good to put this observation in somewhere, if we do not already have it, and link to it from your argument at [[chain rule]].

It would be very nice to include this somewhere if we do not already have it, perhaps at differentiation, and then link to it from your argument chain rule. It looks like we may have something at differentiation which is similar to what you wrote in #8, but probably you could incorporate some more of #8 there.

With regard to actual formal power series, it follows very easily, as far as I see, from the binomial theorem/induction that we have the right thing (i.e. the usual formal derivative of power series). But again it would be good to put this observation in somewhere, if we do not already have it, and link to it from your argument at chain rule.
- CommentRowNumber10.
- CommentAuthorTodd_Trimble
- CommentTimeOct 15th 2018
- PermaLink
Author: Todd_Trimble
Format: MarkdownItexThanks for the suggestions, Richard -- they make sense. I'll look into it and think about how to do it decorously.

Thanks for the suggestions, Richard – they make sense. I’ll look into it and think about how to do it decorously.
- CommentRowNumber11.
- CommentAuthorRichard Williamson
- CommentTimeOct 15th 2018
- PermaLink
Author: Richard Williamson
Format: MarkdownItexGreat! Thanks again for the nice additions!

Great! Thanks again for the nice additions!

1 to 11 of 11

nForum

Discussion Feed

Not signed in

Site Tag Cloud

nLab > nLab General Discussions: soft proof of the chain rule