Not signed in

Want to take part in these discussions? Sign in if you have an account, or apply for one below

Site Tag Cloud

Vanilla 1.1.10 is a product of Lussumo. More Information: Documentation, Community Support.

Welcome to nForum
If you want to take part in these discussions either sign in now (if you have an account), apply for one now (if you don't).

Atrium > Mathematics, Physics & Philosophy: differentials

Bottom of Page

1 to 71 of 71

- CommentRowNumber1.
- CommentAuthorMike Shulman
- CommentTimeNov 28th 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItexI just got around to reading [Dray and Manogue](http://www.physics.orst.edu/bridge/papers/CMJdifferentials.pdf)'s paean to differentials. Most of it I agree with, but I am confused by their antagonism towards differentials for linear approximation. Some of what they say seems just wrong, e.g. they claim (bottom of p95) that between a functional relationship $y=f(x)$ and its inverse $x = f^{-1}(y)$ the resulting notions of $dx$ and $dy$ are different, but I don't see it. It seems to me what's different is rather the relationship between $\Delta x$ and $\Delta y$ and $dx$ and $dy$. Specifically, for $y=f(x)$ we have $\Delta x = dx$ but $\Delta y \neq dy$, while for $x = f^{-1}(y)$ we have $\Delta y = dy$ and $\Delta x \neq dx$, but in both cases the $d$'s represent changes along the tangent line to the curve. They also say that using differentials for linear approximation obstructs their use as infinitesimals, but I don't see that either. Quite the opposite, in fact: I would say that infinitesimals are a way of making precise exactly what a linear approximation is. The idea of a linear approximation is that when $x$ is close to $a$, then $f(x)$ is close to $f(a) + f'(a)(x-a)$, but what does that mean exactly? You can say it with epsilons and deltas, but it's more intuitive to say it with infinitesimals: when $x$ is first-order close to $a$, then $f(x)$ is second-order close to $f(a) + f'(a)(x-a)$. Isn't it important in applications outside of mathematics that a Taylor series approximates a function even for appreciable (non-infinitesimal) changes? Frequently it seems that in practice we use the smooth (infinitesimal change) to approximate the discrete (appreciable change). And the notation isn't contradictory either: $dy = f'(x) dx$ makes sense as a relationship between $dx$ and $dy$ as they range over both infinitesimals and appreciable values; we use the infinitesimal version to *define* the relationship and work with it formally, but the appreciable one in applications.

I just got around to reading Dray and Manogue’s paean to differentials. Most of it I agree with, but I am confused by their antagonism towards differentials for linear approximation.

Some of what they say seems just wrong, e.g. they claim (bottom of p95) that between a functional relationship $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi><mo>=</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">y=f(x)</annotation></semantics></math>$ and its inverse $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi><mo>=</mo><msup><mi>f</mi> <mrow><mo lspace="0.11111em" rspace="0em">−</mo><mn>1</mn></mrow></msup><mo stretchy="false">(</mo><mi>y</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">x = f^{-1}(y)</annotation></semantics></math>$ the resulting notions of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>dx</mi></mrow><annotation encoding="application/x-tex">dx</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>dy</mi></mrow><annotation encoding="application/x-tex">dy</annotation></semantics></math>$ are different, but I don’t see it. It seems to me what’s different is rather the relationship between $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>Δ</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\Delta x</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>Δ</mi><mi>y</mi></mrow><annotation encoding="application/x-tex">\Delta y</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>dx</mi></mrow><annotation encoding="application/x-tex">dx</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>dy</mi></mrow><annotation encoding="application/x-tex">dy</annotation></semantics></math>$ . Specifically, for $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi><mo>=</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">y=f(x)</annotation></semantics></math>$ we have $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>Δ</mi><mi>x</mi><mo>=</mo><mi>dx</mi></mrow><annotation encoding="application/x-tex">\Delta x = dx</annotation></semantics></math>$ but $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>Δ</mi><mi>y</mi><mo>≠</mo><mi>dy</mi></mrow><annotation encoding="application/x-tex">\Delta y \neq dy</annotation></semantics></math>$ , while for $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi><mo>=</mo><msup><mi>f</mi> <mrow><mo lspace="0.11111em" rspace="0em">−</mo><mn>1</mn></mrow></msup><mo stretchy="false">(</mo><mi>y</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">x = f^{-1}(y)</annotation></semantics></math>$ we have $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>Δ</mi><mi>y</mi><mo>=</mo><mi>dy</mi></mrow><annotation encoding="application/x-tex">\Delta y = dy</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>Δ</mi><mi>x</mi><mo>≠</mo><mi>dx</mi></mrow><annotation encoding="application/x-tex">\Delta x \neq dx</annotation></semantics></math>$ , but in both cases the $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>d</mi></mrow><annotation encoding="application/x-tex">d</annotation></semantics></math>$ ’s represent changes along the tangent line to the curve.

They also say that using differentials for linear approximation obstructs their use as infinitesimals, but I don’t see that either. Quite the opposite, in fact: I would say that infinitesimals are a way of making precise exactly what a linear approximation is. The idea of a linear approximation is that when $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ is close to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>a</mi></mrow><annotation encoding="application/x-tex">a</annotation></semantics></math>$ , then $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">f(x)</annotation></semantics></math>$ is close to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo stretchy="false">(</mo><mi>a</mi><mo stretchy="false">)</mo><mo>+</mo><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>a</mi><mo stretchy="false">)</mo><mo stretchy="false">(</mo><mi>x</mi><mo>−</mo><mi>a</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">f(a) + f'(a)(x-a)</annotation></semantics></math>$ , but what does that mean exactly? You can say it with epsilons and deltas, but it’s more intuitive to say it with infinitesimals: when $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ is first-order close to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>a</mi></mrow><annotation encoding="application/x-tex">a</annotation></semantics></math>$ , then $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">f(x)</annotation></semantics></math>$ is second-order close to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo stretchy="false">(</mo><mi>a</mi><mo stretchy="false">)</mo><mo>+</mo><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>a</mi><mo stretchy="false">)</mo><mo stretchy="false">(</mo><mi>x</mi><mo>−</mo><mi>a</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">f(a) + f'(a)(x-a)</annotation></semantics></math>$ . Isn’t it important in applications outside of mathematics that a Taylor series approximates a function even for appreciable (non-infinitesimal) changes? Frequently it seems that in practice we use the smooth (infinitesimal change) to approximate the discrete (appreciable change). And the notation isn’t contradictory either: $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>dy</mi><mo>=</mo><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mi>dx</mi></mrow><annotation encoding="application/x-tex">dy = f'(x) dx</annotation></semantics></math>$ makes sense as a relationship between $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>dx</mi></mrow><annotation encoding="application/x-tex">dx</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>dy</mi></mrow><annotation encoding="application/x-tex">dy</annotation></semantics></math>$ as they range over both infinitesimals and appreciable values; we use the infinitesimal version to define the relationship and work with it formally, but the appreciable one in applications.
- CommentRowNumber2.
- CommentAuthorUrs
- CommentTimeNov 28th 2013
- PermaLink
Author: Urs
Format: MarkdownItex> I would say that infinitesimals are a way of making precise exactly what a linear approximation is. Yes, indeed. The finite order in "nilpotent element" is precisely the finite order in "linear approximation to some order". It is also striking that their list of formalizations of infinitesimals on p. 96 (7 of 11) omits what is probably the best way, namely Grothendieck's way as later highlighted in its essence by Lawvere and as used all the time by all algebraic geometers (and in fact intuitively by many physicists without formal mathematical training).

I would say that infinitesimals are a way of making precise exactly what a linear approximation is.

Yes, indeed. The finite order in “nilpotent element” is precisely the finite order in “linear approximation to some order”.

It is also striking that their list of formalizations of infinitesimals on p. 96 (7 of 11) omits what is probably the best way, namely Grothendieck’s way as later highlighted in its essence by Lawvere and as used all the time by all algebraic geometers (and in fact intuitively by many physicists without formal mathematical training).
- CommentRowNumber3.
- CommentAuthorMike Shulman
- CommentTimeNov 28th 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItexIt is odd that they omit nilpotent infinitesimals. FWIW, though, my current opinion is that for *pedagogical* purposes, and perhaps for many applied fields as well, invertible infinitesimals are preferable to nilpotent ones.

It is odd that they omit nilpotent infinitesimals. FWIW, though, my current opinion is that for pedagogical purposes, and perhaps for many applied fields as well, invertible infinitesimals are preferable to nilpotent ones.
- CommentRowNumber4.
- CommentAuthorTodd_Trimble
- CommentTimeNov 28th 2013
- PermaLink
Author: Todd_Trimble
Format: MarkdownItexIt is probably worth adding that one can formalize the Dirac distribution via invertible infinitesimals, but not via nilpotent ones.

It is probably worth adding that one can formalize the Dirac distribution via invertible infinitesimals, but not via nilpotent ones.
- CommentRowNumber5.
- CommentAuthorTobyBartels
- CommentTimeNov 30th 2013
- PermaLink
Author: TobyBartels
Format: MarkdownItexTo me, the most annoying error is right there on the front page, where they put ‘dividing’ in scare quotes. It is quite literally the operation of division! To shy away from that is exactly the kind of thinking that leads to banishing differentials in the first place.

To me, the most annoying error is right there on the front page, where they put ‘dividing’ in scare quotes. It is quite literally the operation of division! To shy away from that is exactly the kind of thinking that leads to banishing differentials in the first place.
- CommentRowNumber6.
- CommentAuthorTobyBartels
- CommentTimeNov 30th 2013
- PermaLink
Author: TobyBartels
Format: MarkdownItexI also object to their division between differentials of equations and differentials of functions. There\'s only one kind of differential (in their paper), which is the differential of an expression/quantity. As applications, these are different; the first starts with two quantities $u$ and $v$ and uses the theorem that $\mathrm{d}u = \mathrm{d}v$ if $u = v$ (and either $\mathrm{d}u$ or $\mathrm{d}v$ exists), while the second starts with a quantity $u$ and a function $f$ and uses the theorem that $\mathrm{d}f(u) = f'(u) \,\mathrm{d}u$ (if $\mathrm{d}u$ exists and $f'$ is defined at $u$). But it\'s the same operation $\mathrm{d}$.

I also object to their division between differentials of equations and differentials of functions. There's only one kind of differential (in their paper), which is the differential of an expression/quantity. As applications, these are different; the first starts with two quantities $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>u</mi></mrow><annotation encoding="application/x-tex">u</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>v</mi></mrow><annotation encoding="application/x-tex">v</annotation></semantics></math>$ and uses the theorem that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>u</mi><mo>=</mo><mi mathvariant="normal">d</mi><mi>v</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}u = \mathrm{d}v</annotation></semantics></math>$ if $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>u</mi><mo>=</mo><mi>v</mi></mrow><annotation encoding="application/x-tex">u = v</annotation></semantics></math>$ (and either $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>u</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}u</annotation></semantics></math>$ or $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>v</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}v</annotation></semantics></math>$ exists), while the second starts with a quantity $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>u</mi></mrow><annotation encoding="application/x-tex">u</annotation></semantics></math>$ and a function $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ and uses the theorem that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>f</mi><mo stretchy="false">(</mo><mi>u</mi><mo stretchy="false">)</mo><mo>=</mo><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>u</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>u</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}f(u) = f'(u) \,\mathrm{d}u</annotation></semantics></math>$ (if $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>u</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}u</annotation></semantics></math>$ exists and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>′</mo></mrow><annotation encoding="application/x-tex">f'</annotation></semantics></math>$ is defined at $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>u</mi></mrow><annotation encoding="application/x-tex">u</annotation></semantics></math>$ ). But it's the same operation $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}</annotation></semantics></math>$ .
- CommentRowNumber7.
- CommentAuthorTobyBartels
- CommentTimeNov 30th 2013
- PermaLink
Author: TobyBartels
Format: MarkdownItexI would also consider their calculation for the problem on pages 92&93 to be in error, in the step where they move from the last equation with differentials to the following one. The basic principle of optimization is that the maximum or minimum value of $u$ can only occur when $\mathrm{d}u$ is $0$ or undefined; so having established that $\mathrm{d}\ell = (a/p - b/q) \,\mathrm{d}a$, they should conclude that the extreme values of $\ell$ occur only when $a/p - b/q = 0$, $p = 0$, $q = 0$, $\mathrm{d}a = 0$, or $\mathrm{d}a$ is undefined. The last two possibilities can only be dealt with by examining the nature of $a$ in the context of the original problem, to see that it is possible to vary $a$ smoothly (so $\mathrm{d}a$ is defined) and without pausing (so $\mathrm{d}a \ne 0$) except for the extreme cases where $a = 0$ or $b = 0$. (After all, it would be illegitimate to write $\mathrm{d}\ell = 1 \,\mathrm{d}\ell$ and conclude that $\ell$ has no extreme values because $1 = 0$ has no solution. At some point you must check that you\'ve used a differential of a quantity whose critical behaviour you already understand.) Since $p = 0$ and $q = 0$ are impossible, this leaves us with (only) these extreme cases in addition to the one considered in the paper. As it happens, while they derive the minimum value of $\ell$, one of the extreme cases gives us the maximum value of $\ell$; both extremes occur.

I would also consider their calculation for the problem on pages 92&93 to be in error, in the step where they move from the last equation with differentials to the following one. The basic principle of optimization is that the maximum or minimum value of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>u</mi></mrow><annotation encoding="application/x-tex">u</annotation></semantics></math>$ can only occur when $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>u</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}u</annotation></semantics></math>$ is $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>0</mn></mrow><annotation encoding="application/x-tex">0</annotation></semantics></math>$ or undefined; so having established that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>ℓ</mi><mo>=</mo><mo stretchy="false">(</mo><mi>a</mi><mo stretchy="false">/</mo><mi>p</mi><mo>−</mo><mi>b</mi><mo stretchy="false">/</mo><mi>q</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>a</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}\ell = (a/p - b/q) \,\mathrm{d}a</annotation></semantics></math>$ , they should conclude that the extreme values of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ℓ</mi></mrow><annotation encoding="application/x-tex">\ell</annotation></semantics></math>$ occur only when $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>a</mi><mo stretchy="false">/</mo><mi>p</mi><mo>−</mo><mi>b</mi><mo stretchy="false">/</mo><mi>q</mi><mo>=</mo><mn>0</mn></mrow><annotation encoding="application/x-tex">a/p - b/q = 0</annotation></semantics></math>$ , $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>p</mi><mo>=</mo><mn>0</mn></mrow><annotation encoding="application/x-tex">p = 0</annotation></semantics></math>$ , $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>q</mi><mo>=</mo><mn>0</mn></mrow><annotation encoding="application/x-tex">q = 0</annotation></semantics></math>$ , $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>a</mi><mo>=</mo><mn>0</mn></mrow><annotation encoding="application/x-tex">\mathrm{d}a = 0</annotation></semantics></math>$ , or $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>a</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}a</annotation></semantics></math>$ is undefined. The last two possibilities can only be dealt with by examining the nature of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>a</mi></mrow><annotation encoding="application/x-tex">a</annotation></semantics></math>$ in the context of the original problem, to see that it is possible to vary $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>a</mi></mrow><annotation encoding="application/x-tex">a</annotation></semantics></math>$ smoothly (so $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>a</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}a</annotation></semantics></math>$ is defined) and without pausing (so $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>a</mi><mo>≠</mo><mn>0</mn></mrow><annotation encoding="application/x-tex">\mathrm{d}a \ne 0</annotation></semantics></math>$ ) except for the extreme cases where $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>a</mi><mo>=</mo><mn>0</mn></mrow><annotation encoding="application/x-tex">a = 0</annotation></semantics></math>$ or $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>b</mi><mo>=</mo><mn>0</mn></mrow><annotation encoding="application/x-tex">b = 0</annotation></semantics></math>$ . (After all, it would be illegitimate to write $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>ℓ</mi><mo>=</mo><mn>1</mn><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>ℓ</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}\ell = 1 \,\mathrm{d}\ell</annotation></semantics></math>$ and conclude that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ℓ</mi></mrow><annotation encoding="application/x-tex">\ell</annotation></semantics></math>$ has no extreme values because $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn><mo>=</mo><mn>0</mn></mrow><annotation encoding="application/x-tex">1 = 0</annotation></semantics></math>$ has no solution. At some point you must check that you've used a differential of a quantity whose critical behaviour you already understand.) Since $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>p</mi><mo>=</mo><mn>0</mn></mrow><annotation encoding="application/x-tex">p = 0</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>q</mi><mo>=</mo><mn>0</mn></mrow><annotation encoding="application/x-tex">q = 0</annotation></semantics></math>$ are impossible, this leaves us with (only) these extreme cases in addition to the one considered in the paper. As it happens, while they derive the minimum value of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ℓ</mi></mrow><annotation encoding="application/x-tex">\ell</annotation></semantics></math>$ , one of the extreme cases gives us the maximum value of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ℓ</mi></mrow><annotation encoding="application/x-tex">\ell</annotation></semantics></math>$ ; both extremes occur.
- CommentRowNumber8.
- CommentAuthorTobyBartels
- CommentTimeNov 30th 2013
- PermaLink
Author: TobyBartels
Format: MarkdownItexAs to the point at hand … I teach that an equation involving infinitesimals leads to an approximate equation involving finitesimal (or ‘appreciable’, that\'s a good word) differences. So for example, from $v = u^2$, we derive $\mathrm{d}v = 2 u \,\mathrm{d}u$ and so $\Delta{v} \approx 2 u \,\Delta{u}$.[^bug] (I delay discussion of the precision of this approximation to the treatment of Taylor polynomials in the sequence on infinite series, although in principle that could be done earlier.) Dray & Manogue\'s discussion here seems particularly confused, especially the bit about $f$ and $y$ agreeing on the graph of the function (which is largely moot) and their use of $\Delta{y}$ for the approximate change in $y$ (which I would consider unforgivable). But I can understand their objection to the textbook treatment; if you motivate differentials as infinitesimal changes, then using $\mathrm{d}x$ and $\mathrm{d}y$ for appreciable quantities (whether $\Delta{y}$ and $\Delta{x}$ themselves or merely approximations thereto) seems wrong. If you instead motivate differentials as changes in a linear approximation, then this is not a problem, but this is not a motivation that I would give students when I introduce them (even though ultimately it underlies the rigorous definition). Still, they seem to say that linear approximation requires giving a name to the function that $y$ is of $x$, and that\'s just not true. Particularly in applications, there is no need to do this (just as there is no need to give a name to the function that $\ell$ is of $a$ in the optimization problem). So, we need to use differentials of equations here, and that\'s what I do; but rather than identify a differential with either a difference or an approximation thereto, I give the rule (as I gave it above) that you can change differentials to differences in an equation so long as you also change equality to approximate equality. [^bug]: Here\'s a bug in iTeX; `\Delta` comes out in italics by default (which I don\'t mind) yet `\mathrm` may not be applied to it (which I do mind).
As to the point at hand … I teach that an equation involving infinitesimals leads to an approximate equation involving finitesimal (or ‘appreciable’, that's a good word) differences. So for example, from $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>v</mi><mo>=</mo><msup><mi>u</mi> <mn>2</mn></msup></mrow><annotation encoding="application/x-tex">v = u^2</annotation></semantics></math>$ , we derive $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>v</mi><mo>=</mo><mn>2</mn><mi>u</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>u</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}v = 2 u \,\mathrm{d}u</annotation></semantics></math>$ and so $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>Δ</mi><mi>v</mi><mo>≈</mo><mn>2</mn><mi>u</mi><mspace width="0.16667em"/><mi>Δ</mi><mi>u</mi></mrow><annotation encoding="application/x-tex">\Delta{v} \approx 2 u \,\Delta{u}</annotation></semantics></math>$ .¹ (I delay discussion of the precision of this approximation to the treatment of Taylor polynomials in the sequence on infinite series, although in principle that could be done earlier.) Dray & Manogue's discussion here seems particularly confused, especially the bit about $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi></mrow><annotation encoding="application/x-tex">y</annotation></semantics></math>$ agreeing on the graph of the function (which is largely moot) and their use of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>Δ</mi><mi>y</mi></mrow><annotation encoding="application/x-tex">\Delta{y}</annotation></semantics></math>$ for the approximate change in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi></mrow><annotation encoding="application/x-tex">y</annotation></semantics></math>$ (which I would consider unforgivable).

But I can understand their objection to the textbook treatment; if you motivate differentials as infinitesimal changes, then using $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}x</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>y</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}y</annotation></semantics></math>$ for appreciable quantities (whether $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>Δ</mi><mi>y</mi></mrow><annotation encoding="application/x-tex">\Delta{y}</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>Δ</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\Delta{x}</annotation></semantics></math>$ themselves or merely approximations thereto) seems wrong. If you instead motivate differentials as changes in a linear approximation, then this is not a problem, but this is not a motivation that I would give students when I introduce them (even though ultimately it underlies the rigorous definition).

Still, they seem to say that linear approximation requires giving a name to the function that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi></mrow><annotation encoding="application/x-tex">y</annotation></semantics></math>$ is of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ , and that's just not true. Particularly in applications, there is no need to do this (just as there is no need to give a name to the function that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ℓ</mi></mrow><annotation encoding="application/x-tex">\ell</annotation></semantics></math>$ is of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>a</mi></mrow><annotation encoding="application/x-tex">a</annotation></semantics></math>$ in the optimization problem). So, we need to use differentials of equations here, and that's what I do; but rather than identify a differential with either a difference or an approximation thereto, I give the rule (as I gave it above) that you can change differentials to differences in an equation so long as you also change equality to approximate equality.
1. Here's a bug in iTeX; \Delta comes out in italics by default (which I don't mind) yet \mathrm may not be applied to it (which I do mind). ↩
- CommentRowNumber9.
- CommentAuthorTobyBartels
- CommentTimeNov 30th 2013
- PermaLink
Author: TobyBartels
Format: MarkdownItexI don\'t like to identify the differentials of Calculus with either nilpotent infinitesimals or nonstandard infinitesimals. I want them to be invertible (in appropriate contexts), so that I can write $\mathrm{d}y/\mathrm{d}x$, and I also want $\mathrm{d}y/\mathrm{d}x$ to be equal (not merely adequal) to the derivative. So to me, they are differential forms in the sense of standard differential geometry; and even though I use Lawvere\'s ideas to explain (or to avoid explaining) what space they are differential forms *on*, I\'m not doing <small>SDG</small>.

I don't like to identify the differentials of Calculus with either nilpotent infinitesimals or nonstandard infinitesimals. I want them to be invertible (in appropriate contexts), so that I can write $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>y</mi><mo stretchy="false">/</mo><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}y/\mathrm{d}x</annotation></semantics></math>$ , and I also want $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>y</mi><mo stretchy="false">/</mo><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}y/\mathrm{d}x</annotation></semantics></math>$ to be equal (not merely adequal) to the derivative. So to me, they are differential forms in the sense of standard differential geometry; and even though I use Lawvere's ideas to explain (or to avoid explaining) what space they are differential forms on, I'm not doing SDG.
- CommentRowNumber10.
- CommentAuthorUrs
- CommentTimeNov 30th 2013
- (edited Nov 30th 2013)
- PermaLink
Author: Urs
Format: MarkdownItexTodd, one can formalize the Dirac distribution also without any infinitesimals at all. Seriously, I think there is overwhelming pactical evidence that nilpotent infinitesimals is the right way to do differential calculus, while there is close to no practical evidence for the use of non-nilpotent infinitesimals.

Todd, one can formalize the Dirac distribution also without any infinitesimals at all.

Seriously, I think there is overwhelming pactical evidence that nilpotent infinitesimals is the right way to do differential calculus, while there is close to no practical evidence for the use of non-nilpotent infinitesimals.
- CommentRowNumber11.
- CommentAuthorTobyBartels
- CommentTimeNov 30th 2013
- PermaLink
Author: TobyBartels
Format: MarkdownItexOf course, their statement on page 98 about $\mathrm{d}x \,\mathrm{d}y = r \,\mathrm{d}r \,\mathrm{d}\theta$, that it\'s at best shorthand for an equality of integrals, does not go far enough either. It is rather an equation between [[absolute differential forms]] which may be calculated as follows: $$ \mathrm{d}x \,\mathrm{d}y = {|\mathrm{d}x \wedge \mathrm{d}y|} = {|\mathrm{d}(r \cos\theta) \wedge \mathrm{d}(r \sin\theta)|} = {|(\cos\theta \,\mathrm{d}r - r \sin\theta \,\mathrm{d}\theta) \wedge (\sin\theta \,\mathrm{d}r + r \cos\theta \,\mathrm{d}\theta)|} = {|\sin\theta \cos\theta \,\mathrm{d}r \wedge \mathrm{d}r + r \cos^2\theta \,\mathrm{d}r \wedge \mathrm{d}\theta - r \sin^2\theta \,\mathrm{d}\theta \wedge \mathrm{d}r - r^2 \sin\theta \cos\theta \,\mathrm{d}\theta \wedge \mathrm{d}\theta|} = {|\sin\theta \cos\theta \,0 + r \cos^2\theta \,\mathrm{d}r \wedge \mathrm{d}\theta + r \sin^2\theta \mathrm{d}r \wedge \mathrm{d}\theta + r^2 \sin\theta \cos\theta \,0|} = {|r \,\mathrm{d}r \wedge \mathrm{d}\theta|} = {|r|} {|\mathrm{d}r \wedge \mathrm{d}\theta|} = r \,\mathrm{d}r \mathrm{d}\theta .$$ I teach this in the multivariable term in place of the Jacobian determinant (although there are tricks to speed it up).

Of course, their statement on page 98 about $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>x</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>y</mi><mo>=</mo><mi>r</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>r</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>θ</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}x \,\mathrm{d}y = r \,\mathrm{d}r \,\mathrm{d}\theta</annotation></semantics></math>$ , that it's at best shorthand for an equality of integrals, does not go far enough either. It is rather an equation between absolute differential forms which may be calculated as follows:
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mi mathvariant="normal">d</mi><mi>x</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>y</mi><mo>=</mo><mrow><mo stretchy="false">|</mo><mi mathvariant="normal">d</mi><mi>x</mi><mo>∧</mo><mi mathvariant="normal">d</mi><mi>y</mi><mo stretchy="false">|</mo></mrow><mo>=</mo><mrow><mo stretchy="false">|</mo><mi mathvariant="normal">d</mi><mo stretchy="false">(</mo><mi>r</mi><mi>cos</mi><mi>θ</mi><mo stretchy="false">)</mo><mo>∧</mo><mi mathvariant="normal">d</mi><mo stretchy="false">(</mo><mi>r</mi><mi>sin</mi><mi>θ</mi><mo stretchy="false">)</mo><mo stretchy="false">|</mo></mrow><mo>=</mo><mrow><mo stretchy="false">|</mo><mo stretchy="false">(</mo><mi>cos</mi><mi>θ</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>r</mi><mo>−</mo><mi>r</mi><mi>sin</mi><mi>θ</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>θ</mi><mo stretchy="false">)</mo><mo>∧</mo><mo stretchy="false">(</mo><mi>sin</mi><mi>θ</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>r</mi><mo>+</mo><mi>r</mi><mi>cos</mi><mi>θ</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>θ</mi><mo stretchy="false">)</mo><mo stretchy="false">|</mo></mrow><mo>=</mo><mrow><mo stretchy="false">|</mo><mi>sin</mi><mi>θ</mi><mi>cos</mi><mi>θ</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>r</mi><mo>∧</mo><mi mathvariant="normal">d</mi><mi>r</mi><mo>+</mo><mi>r</mi><msup><mi>cos</mi> <mn>2</mn></msup><mi>θ</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>r</mi><mo>∧</mo><mi mathvariant="normal">d</mi><mi>θ</mi><mo>−</mo><mi>r</mi><msup><mi>sin</mi> <mn>2</mn></msup><mi>θ</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>θ</mi><mo>∧</mo><mi mathvariant="normal">d</mi><mi>r</mi><mo>−</mo><msup><mi>r</mi> <mn>2</mn></msup><mi>sin</mi><mi>θ</mi><mi>cos</mi><mi>θ</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>θ</mi><mo>∧</mo><mi mathvariant="normal">d</mi><mi>θ</mi><mo stretchy="false">|</mo></mrow><mo>=</mo><mrow><mo stretchy="false">|</mo><mi>sin</mi><mi>θ</mi><mi>cos</mi><mi>θ</mi><mspace width="0.16667em"/><mn>0</mn><mo>+</mo><mi>r</mi><msup><mi>cos</mi> <mn>2</mn></msup><mi>θ</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>r</mi><mo>∧</mo><mi mathvariant="normal">d</mi><mi>θ</mi><mo>+</mo><mi>r</mi><msup><mi>sin</mi> <mn>2</mn></msup><mi>θ</mi><mi mathvariant="normal">d</mi><mi>r</mi><mo>∧</mo><mi mathvariant="normal">d</mi><mi>θ</mi><mo>+</mo><msup><mi>r</mi> <mn>2</mn></msup><mi>sin</mi><mi>θ</mi><mi>cos</mi><mi>θ</mi><mspace width="0.16667em"/><mn>0</mn><mo stretchy="false">|</mo></mrow><mo>=</mo><mrow><mo stretchy="false">|</mo><mi>r</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>r</mi><mo>∧</mo><mi mathvariant="normal">d</mi><mi>θ</mi><mo stretchy="false">|</mo></mrow><mo>=</mo><mrow><mo stretchy="false">|</mo><mi>r</mi><mo stretchy="false">|</mo></mrow><mrow><mo stretchy="false">|</mo><mi mathvariant="normal">d</mi><mi>r</mi><mo>∧</mo><mi mathvariant="normal">d</mi><mi>θ</mi><mo stretchy="false">|</mo></mrow><mo>=</mo><mi>r</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>r</mi><mi mathvariant="normal">d</mi><mi>θ</mi><mo>.</mo></mrow><annotation encoding="application/x-tex"> \mathrm{d}x \,\mathrm{d}y = {|\mathrm{d}x \wedge \mathrm{d}y|} = {|\mathrm{d}(r \cos\theta) \wedge \mathrm{d}(r \sin\theta)|} = {|(\cos\theta \,\mathrm{d}r - r \sin\theta \,\mathrm{d}\theta) \wedge (\sin\theta \,\mathrm{d}r + r \cos\theta \,\mathrm{d}\theta)|} = {|\sin\theta \cos\theta \,\mathrm{d}r \wedge \mathrm{d}r + r \cos^2\theta \,\mathrm{d}r \wedge \mathrm{d}\theta - r \sin^2\theta \,\mathrm{d}\theta \wedge \mathrm{d}r - r^2 \sin\theta \cos\theta \,\mathrm{d}\theta \wedge \mathrm{d}\theta|} = {|\sin\theta \cos\theta \,0 + r \cos^2\theta \,\mathrm{d}r \wedge \mathrm{d}\theta + r \sin^2\theta \mathrm{d}r \wedge \mathrm{d}\theta + r^2 \sin\theta \cos\theta \,0|} = {|r \,\mathrm{d}r \wedge \mathrm{d}\theta|} = {|r|} {|\mathrm{d}r \wedge \mathrm{d}\theta|} = r \,\mathrm{d}r \mathrm{d}\theta .</annotation></semantics></math>$
I teach this in the multivariable term in place of the Jacobian determinant (although there are tricks to speed it up).
- CommentRowNumber12.
- CommentAuthorTodd_Trimble
- CommentTimeNov 30th 2013
- PermaLink
Author: Todd_Trimble
Format: MarkdownItexUrs #10, surely you don't believe I don't know that?? That's why I used the phrase "can be formalized" -- not "are formalized".

Urs #10, surely you don’t believe I don’t know that?? That’s why I used the phrase “can be formalized” – not “are formalized”.
- CommentRowNumber13.
- CommentAuthorTodd_Trimble
- CommentTimeNov 30th 2013
- PermaLink
Author: Todd_Trimble
Format: MarkdownItexActually, though, I think the assertions in the second paragraph of #10 are deserving of more careful articulation and supporting evidence, since I am sure there are many mathematicians who would instinctively disagree with such dogma. To be clear: maybe you're right, Urs, but such a sweeping dismissal of invertible infinitesimals does deserve at least some explanation by *someone*.

Actually, though, I think the assertions in the second paragraph of #10 are deserving of more careful articulation and supporting evidence, since I am sure there are many mathematicians who would instinctively disagree with such dogma. To be clear: maybe you’re right, Urs, but such a sweeping dismissal of invertible infinitesimals does deserve at least some explanation by someone.
- CommentRowNumber14.
- CommentAuthorzskoda
- CommentTimeNov 30th 2013
- (edited Nov 30th 2013)
- PermaLink
Author: zskoda
Format: MarkdownItexWhen I studied nonstandard analysis, I read at several places that there are places in applied mathematics where they have several infinitesimal scales involved which are not functionally dependent, and that there are subtle kinds of convergence suited to deal with such situations; so it is intuitively easier to indeed have infinitesimals whose smallness s not scaled as powers of a fixed infinitesimal. It does not look to me that this is straightforward to treat with nilpotent infinitesimals. On the other hand, nonstandard analysis is not only bringing the infinitesimals but the transfer principle which makes automatic transfer of one whole class of theorems. In SDG one needs to work in a special way with infinitesimals and prove many theorems from scratch. By no means SDG replaces nonstandard analysis in all important applications. For example, Keistler is emphasising a power of very rich functional spaces of nonstandard analysis, e.g. Loeb probability spaces.

When I studied nonstandard analysis, I read at several places that there are places in applied mathematics where they have several infinitesimal scales involved which are not functionally dependent, and that there are subtle kinds of convergence suited to deal with such situations; so it is intuitively easier to indeed have infinitesimals whose smallness s not scaled as powers of a fixed infinitesimal. It does not look to me that this is straightforward to treat with nilpotent infinitesimals.

On the other hand, nonstandard analysis is not only bringing the infinitesimals but the transfer principle which makes automatic transfer of one whole class of theorems. In SDG one needs to work in a special way with infinitesimals and prove many theorems from scratch. By no means SDG replaces nonstandard analysis in all important applications. For example, Keistler is emphasising a power of very rich functional spaces of nonstandard analysis, e.g. Loeb probability spaces.
- CommentRowNumber15.
- CommentAuthorMike Shulman
- CommentTimeNov 30th 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItexGood points, Zoran. In general, I am skeptical of claims that any one way to do something is "the right way".

Good points, Zoran. In general, I am skeptical of claims that any one way to do something is “the right way”.
- CommentRowNumber16.
- CommentAuthorzskoda
- CommentTimeNov 30th 2013
- (edited Nov 30th 2013)
- PermaLink
Author: zskoda
Format: MarkdownItexEven if one accepts not to do nonstandard kind of differentals, there are other kinds of nonnilpotent infinitesmals, namely it is useful also to work with full completion rather than finite nilpotent thickenings; e.g. theorems on formal functions around a subvariety, which are supported on the completion; they are not supported at any nilpotent level but at the colimit where the differentials are still infinitesimal in the sense that the series does not converge in finitary sense, and they are not nilpotent as all powers contribute. Zariski's algebraic geometry could not make sense of those and Grothendieck did in his related work. I do not know if formal functions along (=normal to) submanifold in SDG have their status already at the axiomatic level or one needs to go to a specific model ?

Even if one accepts not to do nonstandard kind of differentals, there are other kinds of nonnilpotent infinitesmals, namely it is useful also to work with full completion rather than finite nilpotent thickenings; e.g. theorems on formal functions around a subvariety, which are supported on the completion; they are not supported at any nilpotent level but at the colimit where the differentials are still infinitesimal in the sense that the series does not converge in finitary sense, and they are not nilpotent as all powers contribute. Zariski’s algebraic geometry could not make sense of those and Grothendieck did in his related work.

I do not know if formal functions along (=normal to) submanifold in SDG have their status already at the axiomatic level or one needs to go to a specific model ?
- CommentRowNumber17.
- CommentAuthorMike Shulman
- CommentTimeNov 30th 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItexThe word "appreciable" is from nonstandard analysis literature. I'm not sure how standard (npi) it is.

The word “appreciable” is from nonstandard analysis literature. I’m not sure how standard (npi) it is.
- CommentRowNumber18.
- CommentAuthorMike Shulman
- CommentTimeDec 2nd 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItexI actually don't even know how to do ordinary 1-variable integration with nilpotent infinitesimals. The only SDG treatments of integration that I've seen basically postulate it as an axiom, which is not really satisfying, especially pedagogically.

I actually don’t even know how to do ordinary 1-variable integration with nilpotent infinitesimals. The only SDG treatments of integration that I’ve seen basically postulate it as an axiom, which is not really satisfying, especially pedagogically.
- CommentRowNumber19.
- CommentAuthorMike Shulman
- CommentTimeDec 2nd 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItexRe #8 and #9, my inclination would be to say that the differential of a quantity is another quantity that represents the *first-order change* in the first quantity. Since the meaning of "first-order" is relative to some (explicitly or implicitly) chosen scale, it could be either infinitesimal or appreciable depending on context, but in either case there is still the intuition of it being "small". We can define precisely what differentials mean by using (invertible) infinitesimals --- or using appreciables, with epsilons and deltas --- but once we've done that then there's nothing wrong with plugging in either kind of value. And because the differential only represents the first-order change, the quotient $dy/dx$ is always equal to the derivative.

Re #8 and #9, my inclination would be to say that the differential of a quantity is another quantity that represents the first-order change in the first quantity. Since the meaning of “first-order” is relative to some (explicitly or implicitly) chosen scale, it could be either infinitesimal or appreciable depending on context, but in either case there is still the intuition of it being “small”. We can define precisely what differentials mean by using (invertible) infinitesimals — or using appreciables, with epsilons and deltas — but once we’ve done that then there’s nothing wrong with plugging in either kind of value. And because the differential only represents the first-order change, the quotient $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>dy</mi><mo stretchy="false">/</mo><mi>dx</mi></mrow><annotation encoding="application/x-tex">dy/dx</annotation></semantics></math>$ is always equal to the derivative.
- CommentRowNumber20.
- CommentAuthorMichael_Bachtold
- CommentTimeDec 3rd 2013
- PermaLink
Author: Michael_Bachtold
Format: MarkdownItex@Toby #9 >So to me, they are differential forms in the sense of standard differential geometry; and even though I use Lawvere's ideas to explain (or to avoid explaining) what space they are differential forms on, I'm not doing SDG. I'm not sure if differential forms in standard differential geometry are superior to non-invertible differentials when it comes to making sense of a fraction $\frac{dy}{dx}$. In fact a single differential form $dx$ is not invertible ($\frac{1}{dx}$ is not defined) just as for nilsquare infintesimals. What we mean by $\frac{dy}{dx}$ is the ratio of differentials, which makes sense whenever there is a variable quantity $f$ such that $dy=fdx$. This probably makes sense for any flavour of differentials.

@Toby #9

So to me, they are differential forms in the sense of standard differential geometry; and even though I use Lawvere’s ideas to explain (or to avoid explaining) what space they are differential forms on, I’m not doing SDG.

I’m not sure if differential forms in standard differential geometry are superior to non-invertible differentials when it comes to making sense of a fraction $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mi>dy</mi><mi>dx</mi></mfrac></mrow><annotation encoding="application/x-tex">\frac{dy}{dx}</annotation></semantics></math>$ . In fact a single differential form $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>dx</mi></mrow><annotation encoding="application/x-tex">dx</annotation></semantics></math>$ is not invertible ( $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mn>1</mn><mi>dx</mi></mfrac></mrow><annotation encoding="application/x-tex">\frac{1}{dx}</annotation></semantics></math>$ is not defined) just as for nilsquare infintesimals. What we mean by $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mi>dy</mi><mi>dx</mi></mfrac></mrow><annotation encoding="application/x-tex">\frac{dy}{dx}</annotation></semantics></math>$ is the ratio of differentials, which makes sense whenever there is a variable quantity $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ such that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>dy</mi><mo>=</mo><mi>fdx</mi></mrow><annotation encoding="application/x-tex">dy=fdx</annotation></semantics></math>$ . This probably makes sense for any flavour of differentials.
- CommentRowNumber21.
- CommentAuthorMichael_Bachtold
- CommentTimeDec 3rd 2013
- (edited Dec 3rd 2013)
- PermaLink
Author: Michael_Bachtold
Format: MarkdownItex@Zoran 14: >When I studied nonstandard analysis, I read at several places that there are places in applied mathematics where they have several infinitesimal scales involved which are not functionally dependent, and that there are subtle kinds of convergence suited to deal with such situations; That sounds interesting, do you remember where you read that or what examples they where talking about? @Mike 18: >The only SDG treatments of integration that I’ve seen basically postulate it as an axiom, which is not really satisfying, especially pedagogically. I've also been wondering about this recently. If I recall correctly SDG postulates the existence of antiderivatives by an axiom and defines the definite integral by the fundamental theorem of calculus. So actually there is no fundamental theorem of calculus in SDG (correct me if I'm wrong). So how could one restore a fundamental theorem of calculus inside SDG? Related question: how to do things like numerical integration inside SDG? Is there a notion of "discrete approximation" of a space inside SDG? The limit (in the categorical sense) of a family of discrete spaces approximating another space? Edit: I had a look at the 2009 Book by Kock Synthetic geometry of manifolds where he says p106: "Integration theory in SDG is not very well developed; in most places, like in [36], the theory depends on anti-derivatives. The present text is no exception, and the theory here is even more primitive than in [36]" Here [36] is the older book by Kock on SDG.

@Zoran 14:

When I studied nonstandard analysis, I read at several places that there are places in applied mathematics where they have several infinitesimal scales involved which are not functionally dependent, and that there are subtle kinds of convergence suited to deal with such situations;

That sounds interesting, do you remember where you read that or what examples they where talking about?

@Mike 18:

The only SDG treatments of integration that I’ve seen basically postulate it as an axiom, which is not really satisfying, especially pedagogically.

I’ve also been wondering about this recently. If I recall correctly SDG postulates the existence of antiderivatives by an axiom and defines the definite integral by the fundamental theorem of calculus. So actually there is no fundamental theorem of calculus in SDG (correct me if I’m wrong).

So how could one restore a fundamental theorem of calculus inside SDG? Related question: how to do things like numerical integration inside SDG? Is there a notion of “discrete approximation” of a space inside SDG? The limit (in the categorical sense) of a family of discrete spaces approximating another space?

Edit: I had a look at the 2009 Book by Kock Synthetic geometry of manifolds where he says p106: “Integration theory in SDG is not very well developed; in most places, like in [36], the theory depends on anti-derivatives. The present text is no exception, and the theory here is even more primitive than in [36]” Here [36] is the older book by Kock on SDG.
- CommentRowNumber22.
- CommentAuthorzskoda
- CommentTimeDec 3rd 2013
- PermaLink
Author: zskoda
Format: MarkdownItex> do you remember where you read that or what examples they where talking about I think it was first time from Hoegh-Krohn, but came across later as well. Once I am back from next week conference I will be glad to search for his examples.

do you remember where you read that or what examples they where talking about

I think it was first time from Hoegh-Krohn, but came across later as well. Once I am back from next week conference I will be glad to search for his examples.
- CommentRowNumber23.
- CommentAuthorTobyBartels
- CommentTimeDec 3rd 2013
- PermaLink
Author: TobyBartels
Format: MarkdownItex@Michael #20: The reciprocal of a nowhere-$0$ differential on a $1$-dimensional manifold is defined (and the reciprocal of any differential on a $1$-dimensional manifold is partially defined); this generalizes to any line bundle. (In fact, since the reciprocal line bundle is the dual line bundle, the reciprocal of $\mathrm{d}x$ is the vector field $\partial/\partial{x}$ (no subscripts necessary). The notation $\partial/\partial{x}$ (or $\mathrm{d}/\mathrm{d}x$) is for the application of this vector field to scalar fields, which is really the combination of applying the differential and then the pairing of vector fields with covector fields (aka differential $1$-forms). So $1/\mathrm{d}x$ is appropriate notation when we\'re going to multiply directly by a differential.) However, you are correct that $\mathrm{d}y/\mathrm{d}x$ can mean the unique solution $f$ to $\mathrm{d}y = f \,\mathrm{d}x$, without any meaning given to $1/\mathrm{d}x$. This is important, since sometimes we want $\mathrm{d}y/\mathrm{d}x$ without assuming that the unspecified underlying space is $1$-dimensional. So your broader point, that the notation works just fine with nilpotent infinitesimals, is correct.

@Michael #20: The reciprocal of a nowhere- $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>0</mn></mrow><annotation encoding="application/x-tex">0</annotation></semantics></math>$ differential on a $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn></mrow><annotation encoding="application/x-tex">1</annotation></semantics></math>$ -dimensional manifold is defined (and the reciprocal of any differential on a $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn></mrow><annotation encoding="application/x-tex">1</annotation></semantics></math>$ -dimensional manifold is partially defined); this generalizes to any line bundle. (In fact, since the reciprocal line bundle is the dual line bundle, the reciprocal of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}x</annotation></semantics></math>$ is the vector field $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo>∂</mo><mo stretchy="false">/</mo><mo>∂</mo><mi>x</mi></mrow><annotation encoding="application/x-tex">\partial/\partial{x}</annotation></semantics></math>$ (no subscripts necessary). The notation $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo>∂</mo><mo stretchy="false">/</mo><mo>∂</mo><mi>x</mi></mrow><annotation encoding="application/x-tex">\partial/\partial{x}</annotation></semantics></math>$ (or $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mo stretchy="false">/</mo><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}/\mathrm{d}x</annotation></semantics></math>$ ) is for the application of this vector field to scalar fields, which is really the combination of applying the differential and then the pairing of vector fields with covector fields (aka differential $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn></mrow><annotation encoding="application/x-tex">1</annotation></semantics></math>$ -forms). So $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn><mo stretchy="false">/</mo><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">1/\mathrm{d}x</annotation></semantics></math>$ is appropriate notation when we're going to multiply directly by a differential.)

However, you are correct that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>y</mi><mo stretchy="false">/</mo><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}y/\mathrm{d}x</annotation></semantics></math>$ can mean the unique solution $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>y</mi><mo>=</mo><mi>f</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}y = f \,\mathrm{d}x</annotation></semantics></math>$ , without any meaning given to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn><mo stretchy="false">/</mo><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">1/\mathrm{d}x</annotation></semantics></math>$ . This is important, since sometimes we want $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>y</mi><mo stretchy="false">/</mo><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}y/\mathrm{d}x</annotation></semantics></math>$ without assuming that the unspecified underlying space is $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn></mrow><annotation encoding="application/x-tex">1</annotation></semantics></math>$ -dimensional. So your broader point, that the notation works just fine with nilpotent infinitesimals, is correct.
- CommentRowNumber24.
- CommentAuthorMichael_Bachtold
- CommentTimeDec 3rd 2013
- PermaLink
Author: Michael_Bachtold
Format: MarkdownItexI see, thanks for clarifying.

I see, thanks for clarifying.
- CommentRowNumber25.
- CommentAuthorMike Shulman
- CommentTimeDec 11th 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItexIt's a thousand pities that the phrase "differential equation" has come to mean what should really be called a "derivative equation". So what kind of equation is, say, $2x \, dx + 2y \, dy = 0$?

It’s a thousand pities that the phrase “differential equation” has come to mean what should really be called a “derivative equation”. So what kind of equation is, say, $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>2</mn><mi>x</mi><mspace width="0.16667em"/><mi>dx</mi><mo>+</mo><mn>2</mn><mi>y</mi><mspace width="0.16667em"/><mi>dy</mi><mo>=</mo><mn>0</mn></mrow><annotation encoding="application/x-tex">2x \, dx + 2y \, dy = 0</annotation></semantics></math>$ ?
- CommentRowNumber26.
- CommentAuthorZhen Lin
- CommentTimeDec 11th 2013
- PermaLink
Author: Zhen Lin
Format: MarkdownItexThe two notions are not so different in the one-variable, first-order case. So I would still call that a differential equation.

The two notions are not so different in the one-variable, first-order case. So I would still call that a differential equation.
- CommentRowNumber27.
- CommentAuthorTobyBartels
- CommentTimeDec 11th 2013
- PermaLink
Author: TobyBartels
Format: MarkdownItexYes, I agree with Zhen. Yesterday I told my students ‘A differential equation is an equation with differentials or derivatives in it.’ and gave these three examples, all essentially equivalent: * $\mathrm{d}y = 3 y \,\mathrm{d}x$, * $\displaystyle \frac{\mathrm{d}y}{\mathrm{d}x} = 3 y$, * $f'(x) = 3 f(x)$.
Yes, I agree with Zhen. Yesterday I told my students ‘A differential equation is an equation with differentials or derivatives in it.’ and gave these three examples, all essentially equivalent:
- $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>y</mi><mo>=</mo><mn>3</mn><mi>y</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}y = 3 y \,\mathrm{d}x</annotation></semantics></math>$ ,
- $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mstyle displaystyle="true"><mfrac><mrow><mi mathvariant="normal">d</mi><mi>y</mi></mrow><mrow><mi mathvariant="normal">d</mi><mi>x</mi></mrow></mfrac><mo>=</mo><mn>3</mn><mi>y</mi></mstyle></mrow><annotation encoding="application/x-tex">\displaystyle \frac{\mathrm{d}y}{\mathrm{d}x} = 3 y</annotation></semantics></math>$ ,
- $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mn>3</mn><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">f'(x) = 3 f(x)</annotation></semantics></math>$ .
- CommentRowNumber28.
- CommentAuthorTobyBartels
- CommentTimeDec 11th 2013
- (edited Dec 11th 2013)
- PermaLink
Author: TobyBartels
Format: MarkdownItexA higher order differential equation (even partial) is equivalent to a system of equations in which differentials (and no derivatives) appear. For example, $$ \frac{\partial^2{u}}{\partial{x}^2} + \frac{\partial^2{u}}{\partial{y}^2} = 0 $$ is equivalent to the system $$ \mathrm{d}u = A \,\mathrm{d}x + B \,\mathrm{d}y ,$$ $$ \mathrm{d}A = C \,\mathrm{d}x + D \,\mathrm{d}y ,$$ $$ \mathrm{d}B = D \,\mathrm{d}x + E \,\mathrm{d}y ,$$ $$ C + E = 0 .$$ I don\'t know that this is worth it! Alternatively, using higher differentials, we could write $$ \mathrm{d}^2u = C \,\mathrm{d}^2x + A \,\mathrm{d}x^2 + 2 D \,\mathrm{d}x \,\mathrm{d}y + E \,\mathrm{d}^2y + B \,\mathrm{d}y^2 ,$$ $$ C + E = 0 .$$ Or even $$ \mathrm{d}^2u = C \,\mathrm{d}^2x + A \,\mathrm{d}x^2 + 2 D \,\mathrm{d}x \,\mathrm{d}y - C \,\mathrm{d}^2y + B \,\mathrm{d}y^2 ;$$ there you go, a second-order--differential equation!

A higher order differential equation (even partial) is equivalent to a system of equations in which differentials (and no derivatives) appear. For example,
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mfrac><mrow><msup><mo>∂</mo> <mn>2</mn></msup><mi>u</mi></mrow><mrow><mo>∂</mo><msup><mi>x</mi> <mn>2</mn></msup></mrow></mfrac><mo>+</mo><mfrac><mrow><msup><mo>∂</mo> <mn>2</mn></msup><mi>u</mi></mrow><mrow><mo>∂</mo><msup><mi>y</mi> <mn>2</mn></msup></mrow></mfrac><mo>=</mo><mn>0</mn></mrow><annotation encoding="application/x-tex"> \frac{\partial^2{u}}{\partial{x}^2} + \frac{\partial^2{u}}{\partial{y}^2} = 0 </annotation></semantics></math>$
is equivalent to the system
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mi mathvariant="normal">d</mi><mi>u</mi><mo>=</mo><mi>A</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi><mo>+</mo><mi>B</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>y</mi><mo>,</mo></mrow><annotation encoding="application/x-tex"> \mathrm{d}u = A \,\mathrm{d}x + B \,\mathrm{d}y ,</annotation></semantics></math>$ $<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mi mathvariant="normal">d</mi><mi>A</mi><mo>=</mo><mi>C</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi><mo>+</mo><mi>D</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>y</mi><mo>,</mo></mrow><annotation encoding="application/x-tex"> \mathrm{d}A = C \,\mathrm{d}x + D \,\mathrm{d}y ,</annotation></semantics></math>$ $<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mi mathvariant="normal">d</mi><mi>B</mi><mo>=</mo><mi>D</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi><mo>+</mo><mi>E</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>y</mi><mo>,</mo></mrow><annotation encoding="application/x-tex"> \mathrm{d}B = D \,\mathrm{d}x + E \,\mathrm{d}y ,</annotation></semantics></math>$ $<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mi>C</mi><mo>+</mo><mi>E</mi><mo>=</mo><mn>0</mn><mo>.</mo></mrow><annotation encoding="application/x-tex"> C + E = 0 .</annotation></semantics></math>$
I don't know that this is worth it!

Alternatively, using higher differentials, we could write
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>u</mi><mo>=</mo><mi>C</mi><mspace width="0.16667em"/><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>x</mi><mo>+</mo><mi>A</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><msup><mi>x</mi> <mn>2</mn></msup><mo>+</mo><mn>2</mn><mi>D</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>y</mi><mo>+</mo><mi>E</mi><mspace width="0.16667em"/><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>y</mi><mo>+</mo><mi>B</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><msup><mi>y</mi> <mn>2</mn></msup><mo>,</mo></mrow><annotation encoding="application/x-tex"> \mathrm{d}^2u = C \,\mathrm{d}^2x + A \,\mathrm{d}x^2 + 2 D \,\mathrm{d}x \,\mathrm{d}y + E \,\mathrm{d}^2y + B \,\mathrm{d}y^2 ,</annotation></semantics></math>$ $<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mi>C</mi><mo>+</mo><mi>E</mi><mo>=</mo><mn>0</mn><mo>.</mo></mrow><annotation encoding="application/x-tex"> C + E = 0 .</annotation></semantics></math>$
Or even
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>u</mi><mo>=</mo><mi>C</mi><mspace width="0.16667em"/><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>x</mi><mo>+</mo><mi>A</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><msup><mi>x</mi> <mn>2</mn></msup><mo>+</mo><mn>2</mn><mi>D</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>y</mi><mo>−</mo><mi>C</mi><mspace width="0.16667em"/><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>y</mi><mo>+</mo><mi>B</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><msup><mi>y</mi> <mn>2</mn></msup><mo>;</mo></mrow><annotation encoding="application/x-tex"> \mathrm{d}^2u = C \,\mathrm{d}^2x + A \,\mathrm{d}x^2 + 2 D \,\mathrm{d}x \,\mathrm{d}y - C \,\mathrm{d}^2y + B \,\mathrm{d}y^2 ;</annotation></semantics></math>$
there you go, a second-order–differential equation!
- CommentRowNumber29.
- CommentAuthorMike Shulman
- CommentTimeDec 11th 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItexOkay, if anyone comes complaining to me about calling equations-with-differentials "differential equations", I'll cite you guys. (-: But I'm not entirely happy with > A differential equation is an equation with differentials or derivatives in it because it would include equations like $$ (dy)^2 = \sqrt{dx} + e^{dx}$$

Okay, if anyone comes complaining to me about calling equations-with-differentials “differential equations”, I’ll cite you guys. (-: But I’m not entirely happy with

A differential equation is an equation with differentials or derivatives in it

because it would include equations like
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mo stretchy="false">(</mo><mi>dy</mi><msup><mo stretchy="false">)</mo> <mn>2</mn></msup><mo>=</mo><msqrt><mi>dx</mi></msqrt><mo>+</mo><msup><mi>e</mi> <mi>dx</mi></msup></mrow><annotation encoding="application/x-tex"> (dy)^2 = \sqrt{dx} + e^{dx}</annotation></semantics></math>$
- CommentRowNumber30.
- CommentAuthorTobyBartels
- CommentTimeDec 12th 2013
- PermaLink
Author: TobyBartels
Format: MarkdownItexWould you say that a vector equation is an equation with vectors in it? How about $$ \vec{v} + x = 0 $$ ?

Would you say that a vector equation is an equation with vectors in it? How about
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mover><mi>v</mi><mo stretchy="false">→</mo></mover><mo>+</mo><mi>x</mi><mo>=</mo><mn>0</mn></mrow><annotation encoding="application/x-tex"> \vec{v} + x = 0 </annotation></semantics></math>$
?
- CommentRowNumber31.
- CommentAuthorMike Shulman
- CommentTimeDec 12th 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItexNo, I would not. In fact, I don't think I've ever had occasion to use the phrase "vector equation".

No, I would not. In fact, I don’t think I’ve ever had occasion to use the phrase “vector equation”.
- CommentRowNumber32.
- CommentAuthorTobyBartels
- CommentTimeDec 13th 2013
- PermaLink
Author: TobyBartels
Format: MarkdownItexOh. Well, I have. I guess that my point is that your example is ill formed.

Oh. Well, I have.

I guess that my point is that your example is ill formed.
- CommentRowNumber33.
- CommentAuthorMike Shulman
- CommentTimeDec 13th 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItexHow do you define "ill formed"?

How do you define “ill formed”?
- CommentRowNumber34.
- CommentAuthorTobyBartels
- CommentTimeDec 15th 2013
- PermaLink
Author: TobyBartels
Format: MarkdownItexI\'m inclined to say that your example fails to be a differential equation for the same reason that $$ x/ = {}^2y + e^( $$ fails to be an equation at all. There is no universal definition of what makes something well formed or ill formed; but one must establish that one\'s equation has meaning before writing it down. That said, in both the vector-equation and differential-equation case, the problem is a matter of homogeneity or dimensionalysis. Don\'t add vectors to scalars, don\'t add first-order differentials to second-order ones, don\'t add distances to speeds, etc. Of course, this is not a universal rule; the geometric algebraists break it all the time, but this only splits things up into several independent equations. From that perspective, my equation $$ \vec{v} + x = 0 $$ splits into this system of equations: * $\vec{v} = 0$ (vector), * $x = 0$ (scalar). (The unique solution is now immediate.) So following this, your example $$ (dy)^2 = \sqrt{dx} + e^{dx} $$ splits into the following infinite system of differential equations: * $0 = 1$ (order $0$) * $0 = \sqrt{dx}$ (order $1/2$), * $0 = dx$ (order $1$), * $dy^2 = dx^2/2$ (order $2$), * $0 = dx^3/6$ (order $3$), * $0 = dx^4/24$ (order $4$), * $0 = dx^5/120$ (order $5$), * etc. (Thanks to the first equation, this system has no solutions.) Possibly $e^{dx}$ should be given some other interpretation; but that\'s the job of the person writing down the equation, I\'m just trying to be generous by coming up with something.
I'm inclined to say that your example fails to be a differential equation for the same reason that
x/=2y+e(

fails to be an equation at all. There is no universal definition of what makes something well formed or ill formed; but one must establish that one's equation has meaning before writing it down.

That said, in both the vector-equation and differential-equation case, the problem is a matter of homogeneity or dimensionalysis. Don't add vectors to scalars, don't add first-order differentials to second-order ones, don't add distances to speeds, etc. Of course, this is not a universal rule; the geometric algebraists break it all the time, but this only splits things up into several independent equations. From that perspective, my equation
→v+x=0

splits into this system of equations:
- $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mover><mi>v</mi><mo stretchy="false">→</mo></mover><mo>=</mo><mn>0</mn></mrow><annotation encoding="application/x-tex">\vec{v} = 0</annotation></semantics></math>$ (vector),
- $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi><mo>=</mo><mn>0</mn></mrow><annotation encoding="application/x-tex">x = 0</annotation></semantics></math>$ (scalar).
(The unique solution is now immediate.)

So following this, your example
(dy)2=√dx+edx

splits into the following infinite system of differential equations:
- $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>0</mn><mo>=</mo><mn>1</mn></mrow><annotation encoding="application/x-tex">0 = 1</annotation></semantics></math>$ (order $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>0</mn></mrow><annotation encoding="application/x-tex">0</annotation></semantics></math>$ )
- $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>0</mn><mo>=</mo><msqrt><mi>dx</mi></msqrt></mrow><annotation encoding="application/x-tex">0 = \sqrt{dx}</annotation></semantics></math>$ (order $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn><mo stretchy="false">/</mo><mn>2</mn></mrow><annotation encoding="application/x-tex">1/2</annotation></semantics></math>$ ),
- $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>0</mn><mo>=</mo><mi>dx</mi></mrow><annotation encoding="application/x-tex">0 = dx</annotation></semantics></math>$ (order $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn></mrow><annotation encoding="application/x-tex">1</annotation></semantics></math>$ ),
- $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>dy</mi> <mn>2</mn></msup><mo>=</mo><msup><mi>dx</mi> <mn>2</mn></msup><mo stretchy="false">/</mo><mn>2</mn></mrow><annotation encoding="application/x-tex">dy^2 = dx^2/2</annotation></semantics></math>$ (order $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>2</mn></mrow><annotation encoding="application/x-tex">2</annotation></semantics></math>$ ),
- $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>0</mn><mo>=</mo><msup><mi>dx</mi> <mn>3</mn></msup><mo stretchy="false">/</mo><mn>6</mn></mrow><annotation encoding="application/x-tex">0 = dx^3/6</annotation></semantics></math>$ (order $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>3</mn></mrow><annotation encoding="application/x-tex">3</annotation></semantics></math>$ ),
- $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>0</mn><mo>=</mo><msup><mi>dx</mi> <mn>4</mn></msup><mo stretchy="false">/</mo><mn>24</mn></mrow><annotation encoding="application/x-tex">0 = dx^4/24</annotation></semantics></math>$ (order $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>4</mn></mrow><annotation encoding="application/x-tex">4</annotation></semantics></math>$ ),
- $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>0</mn><mo>=</mo><msup><mi>dx</mi> <mn>5</mn></msup><mo stretchy="false">/</mo><mn>120</mn></mrow><annotation encoding="application/x-tex">0 = dx^5/120</annotation></semantics></math>$ (order $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>5</mn></mrow><annotation encoding="application/x-tex">5</annotation></semantics></math>$ ),
- etc.
(Thanks to the first equation, this system has no solutions.) Possibly $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>e</mi> <mi>dx</mi></msup></mrow><annotation encoding="application/x-tex">e^{dx}</annotation></semantics></math>$ should be given some other interpretation; but that's the job of the person writing down the equation, I'm just trying to be generous by coming up with something.
- CommentRowNumber35.
- CommentAuthorMike Shulman
- CommentTimeDec 15th 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItexThat makes sense to me, but do you explain it that way to your students? I would expect that $dx$ looks just like another variable to them. How do you define "differential" for them in such a way that "don't add first-order differentials to second-order ones" makes sense?

That makes sense to me, but do you explain it that way to your students? I would expect that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>dx</mi></mrow><annotation encoding="application/x-tex">dx</annotation></semantics></math>$ looks just like another variable to them. How do you define “differential” for them in such a way that “don’t add first-order differentials to second-order ones” makes sense?
- CommentRowNumber36.
- CommentAuthorTobyBartels
- CommentTimeDec 16th 2013
- PermaLink
Author: TobyBartels
Format: MarkdownItexNow I see your point; I *don\'t* really explain that to the students. That said, I do tell them (much earlier, when discussing how to spot errors in the calculation of the differential of an expression) that if they see an expression with two terms, one of which has a differential factor and one of which doesn\'t, then there has been a mistake. (Especially in the context of an equation, I can explain this by saying that something infinitely small can\'t be equal to something finitely small.) So if somebody did see $$ (dy)^2 = \sqrt{dx} + e^{dx} ,$$ then I could explain that the problem is similar (especially since $e^{dx}$ is finitesimal). We only really deal with first-order differential equations in any of the classes that I teach.

Now I see your point; I don't really explain that to the students.

That said, I do tell them (much earlier, when discussing how to spot errors in the calculation of the differential of an expression) that if they see an expression with two terms, one of which has a differential factor and one of which doesn't, then there has been a mistake. (Especially in the context of an equation, I can explain this by saying that something infinitely small can't be equal to something finitely small.) So if somebody did see
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mo stretchy="false">(</mo><mi>dy</mi><msup><mo stretchy="false">)</mo> <mn>2</mn></msup><mo>=</mo><msqrt><mi>dx</mi></msqrt><mo>+</mo><msup><mi>e</mi> <mi>dx</mi></msup><mo>,</mo></mrow><annotation encoding="application/x-tex"> (dy)^2 = \sqrt{dx} + e^{dx} ,</annotation></semantics></math>$
then I could explain that the problem is similar (especially since $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>e</mi> <mi>dx</mi></msup></mrow><annotation encoding="application/x-tex">e^{dx}</annotation></semantics></math>$ is finitesimal).

We only really deal with first-order differential equations in any of the classes that I teach.
- CommentRowNumber37.
- CommentAuthorZhen Lin
- CommentTimeDec 16th 2013
- PermaLink
Author: Zhen Lin
Format: MarkdownItexCan't this be tackled with traditional dimensional analysis, at least in physically meaningful cases? You can't add an area to a volume, after all.

Can’t this be tackled with traditional dimensional analysis, at least in physically meaningful cases? You can’t add an area to a volume, after all.
- CommentRowNumber38.
- CommentAuthorTobyBartels
- CommentTimeDec 16th 2013
- PermaLink
Author: TobyBartels
Format: MarkdownItexIt\'s certainly the same kind of issue, but if $x$ and $y$ are dimensionless quantities, still Mike\'s equation is unbalanced.

It's certainly the same kind of issue, but if $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi></mrow><annotation encoding="application/x-tex">y</annotation></semantics></math>$ are dimensionless quantities, still Mike's equation is unbalanced.
- CommentRowNumber39.
- CommentAuthorMike Shulman
- CommentTimeDec 16th 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItexAnd if $x$ and $y$ are lengths, then $x^2 \, dy = y\, (dx)^2$ is dimensionally balanced but differentially unbalanced. Toby, I wish I could sit in on one of your classes from start to end. (-: Clearly you've thought all this out very carefully, and I sort of have a sense of how you do it after all of our discussions, but not, I think, enough to replicate it myself.

And if $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi></mrow><annotation encoding="application/x-tex">y</annotation></semantics></math>$ are lengths, then $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>x</mi> <mn>2</mn></msup><mspace width="0.16667em"/><mi>dy</mi><mo>=</mo><mi>y</mi><mspace width="0.16667em"/><mo stretchy="false">(</mo><mi>dx</mi><msup><mo stretchy="false">)</mo> <mn>2</mn></msup></mrow><annotation encoding="application/x-tex">x^2 \, dy = y\, (dx)^2</annotation></semantics></math>$ is dimensionally balanced but differentially unbalanced.

Toby, I wish I could sit in on one of your classes from start to end. (-: Clearly you’ve thought all this out very carefully, and I sort of have a sense of how you do it after all of our discussions, but not, I think, enough to replicate it myself.
- CommentRowNumber40.
- CommentAuthorTobyBartels
- CommentTimeDec 18th 2013
- (edited Dec 18th 2013)
- PermaLink
Author: TobyBartels
Format: MarkdownItexWell, I\'ve thought it out enough to fake whatever I haven\'t thought out! One can partly sit in on my classes by reading the notes (near the bottom) for [Applied Calculus](http://tobybartels.name/MATH-1400/2013s/), [regular Calculus](http://tobybartels.name/MATH-1600/2013f/), and [multivariable Calculus](http://tobybartels.name/MATH-2080/2013s/).

Well, I've thought it out enough to fake whatever I haven't thought out!

One can partly sit in on my classes by reading the notes (near the bottom) for Applied Calculus, regular Calculus, and multivariable Calculus.
- CommentRowNumber41.
- CommentAuthorTobyBartels
- CommentTimeDec 18th 2013
- PermaLink
Author: TobyBartels
Format: MarkdownItexI want to redo the discussion of differentials in the last to emphasize curves instead of vectors as the thing that differentials act on, to be less dependent on the precise nature of the unspecified domain.

I want to redo the discussion of differentials in the last to emphasize curves instead of vectors as the thing that differentials act on, to be less dependent on the precise nature of the unspecified domain.
- CommentRowNumber42.
- CommentAuthorUrs
- CommentTimeDec 19th 2013
- (edited Dec 19th 2013)
- PermaLink
Author: Urs
Format: MarkdownItexSorry for the belated reaction, now from my phone: Sorry Todd, I did not mean to imply you did not know it, but I did object to the suggestion that there is a defect of nilpotent infinitesimals which is cured by nonstandard analysis. For the sake of argument, I'll keep insisting on that. Mike may be sceptical of general claims, but empirically by looking at what happens in practice, this is what I see. Zoran means to give a counterexample above, but I doubt it: certainly with nilpotent infinitesimals it is not true that they are all proportional to each other. On the contrary. I see nilpotent differentials govern large areas of maths and deeply so. On the other hand I see nonstandard anaysis as a hack that proves that it can be done if one insists, but that does not show up naturally. One thing that would convince me of nonstandard analysis is if it could be shown to model differential cohesion, as Toby suggested recently in another thread. That would be neat. But I don't quite see it yet.

Sorry for the belated reaction, now from my phone:

Sorry Todd, I did not mean to imply you did not know it, but I did object to the suggestion that there is a defect of nilpotent infinitesimals which is cured by nonstandard analysis.

For the sake of argument, I’ll keep insisting on that. Mike may be sceptical of general claims, but empirically by looking at what happens in practice, this is what I see.

Zoran means to give a counterexample above, but I doubt it: certainly with nilpotent infinitesimals it is not true that they are all proportional to each other. On the contrary.

I see nilpotent differentials govern large areas of maths and deeply so. On the other hand I see nonstandard anaysis as a hack that proves that it can be done if one insists, but that does not show up naturally.

One thing that would convince me of nonstandard analysis is if it could be shown to model differential cohesion, as Toby suggested recently in another thread. That would be neat. But I don’t quite see it yet.
- CommentRowNumber43.
- CommentAuthorUrs
- CommentTimeDec 19th 2013
- (edited Dec 19th 2013)
- PermaLink
Author: Urs
Format: MarkdownItexConcerning integration: we once had this discussion before in another thread: in 1-categorical SDG one needs an integration axiom, but not in homotopy SDG. Here integration of (Kaehler) differential forms is given by the quotient of forms modulo homotopy given by closed forms on a disk. This is described for instance in the nlab entry on Lie integration.

Concerning integration: we once had this discussion before in another thread: in 1-categorical SDG one needs an integration axiom, but not in homotopy SDG. Here integration of (Kaehler) differential forms is given by the quotient of forms modulo homotopy given by closed forms on a disk. This is described for instance in the nlab entry on Lie integration.
- CommentRowNumber44.
- CommentAuthorTodd_Trimble
- CommentTimeDec 19th 2013
- PermaLink
Author: Todd_Trimble
Format: MarkdownItexActually, I see Zoran's stronger point as being the existence of a transfer principle for nonstandard analysis on invertible infinitesimals.

Actually, I see Zoran’s stronger point as being the existence of a transfer principle for nonstandard analysis on invertible infinitesimals.
- CommentRowNumber45.
- CommentAuthorDavid_Corfield
- CommentTimeDec 19th 2013
- PermaLink
Author: David_Corfield
Format: MarkdownItexTo repeat what I wrote elsewhere: I wonder if the difference between nilpotent and invertible infinitesimals has something to do with the not altogether straightforward relationship between category theory and model theory, that we once [discussed](http://golem.ph.utexas.edu/category/2008/07/category_theory_and_model_theo.html). I mean, you never make any use of that very model-theoretic transfer principle with nilpotent infinitesimals, do you? Perhaps this relates to the difference between geometric and logical morphisms in toposes. The transfer principle was used by Ngo in his proof of the fundamental lemma. Interesting that it's appearing in such a core area of maths. I wonder if there's something fundamental there, or merely a "hack".

To repeat what I wrote elsewhere:

I wonder if the difference between nilpotent and invertible infinitesimals has something to do with the not altogether straightforward relationship between category theory and model theory, that we once discussed. I mean, you never make any use of that very model-theoretic transfer principle with nilpotent infinitesimals, do you? Perhaps this relates to the difference between geometric and logical morphisms in toposes.

The transfer principle was used by Ngo in his proof of the fundamental lemma. Interesting that it’s appearing in such a core area of maths. I wonder if there’s something fundamental there, or merely a “hack”.
- CommentRowNumber46.
- CommentAuthorColin Tan
- CommentTimeDec 19th 2013
- PermaLink
Author: Colin Tan
Format: TextFor a historic example, the transfer principle was used by Artin in his solution of Hilbert's 17th problem. To prove that every nonnegative real polynomial is a sum of squares of rational functions, Artin used the model completeness of the theory of real closed fields to execute his transfer. Later proofs/ generalizations of this result come under the name Real Nullstellensatz and are proved by considering an object known as the real spectrum. This contrast seems to be an example of what David is alluding to above, the historic model theoretic approach and the later more flexible categorical approach. Does Cohen's proof of the independence of the continuum hypothesis and the later clarification of double negation as booleanification count as an example in this spirit? In category theory, we often say that, rather than a category of good objects, we rather work with a good category (completeness, cocompleteness, presentability) of objects. In model theory, the corresponding slogan would be, rather than a theory with good models, we rather work with a good theory (uncountable categoricity, quantifier elimination) of models.
For a historic example, the transfer principle was used by Artin in his solution of Hilbert's 17th problem. To prove that every nonnegative real polynomial is a sum of squares of rational functions, Artin used the model completeness of the theory of real closed fields to execute his transfer. Later proofs/ generalizations of this result come under the name Real Nullstellensatz and are proved by considering an object known as the real spectrum. This contrast seems to be an example of what David is alluding to above, the historic model theoretic approach and the later more flexible categorical approach.

Does Cohen's proof of the independence of the continuum hypothesis and the later clarification of double negation as booleanification count as an example in this spirit?

In category theory, we often say that, rather than a category of good objects, we rather work with a good category (completeness, cocompleteness, presentability) of objects. In model theory, the corresponding slogan would be, rather than a theory with good models, we rather work with a good theory (uncountable categoricity, quantifier elimination) of models.
- CommentRowNumber47.
- CommentAuthorMike Shulman
- CommentTimeDec 19th 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItexIn case anyone's antipathy to it arises from not knowing this, nonstandard analysis does have a nice category-theoretic description: it's the filterpower construction on a topos. The canonical functor from a topos to its filterpower is logical and conservative, and that is the transfer principle in a nutshell. From this perspective you can also think of the topos of nonstandard analysis as the "germ at infinity" of the topos of infinite sequences, which is certainly a natural construction and not a hack. There are [[big and little toposes|more toposes]] in heaven and earth, Horatio. (-: And I don't think I can teach Calc I students about Kaehler differential forms and homotopy SDG. I don't even understand myself what the stuff at [[Lie integration]] would look like in SDG -- it all seems to be expressed in terms of concrete models? Anyway, I suggest, Urs, that you not think of NSA as a competitor to SDG, even though they both contain things called "infinitesimals", but as another tool in the mathematician's toolbox which serves different purposes.

In case anyone’s antipathy to it arises from not knowing this, nonstandard analysis does have a nice category-theoretic description: it’s the filterpower construction on a topos. The canonical functor from a topos to its filterpower is logical and conservative, and that is the transfer principle in a nutshell. From this perspective you can also think of the topos of nonstandard analysis as the “germ at infinity” of the topos of infinite sequences, which is certainly a natural construction and not a hack. There are more toposes in heaven and earth, Horatio. (-:

And I don’t think I can teach Calc I students about Kaehler differential forms and homotopy SDG. I don’t even understand myself what the stuff at Lie integration would look like in SDG – it all seems to be expressed in terms of concrete models?

Anyway, I suggest, Urs, that you not think of NSA as a competitor to SDG, even though they both contain things called “infinitesimals”, but as another tool in the mathematician’s toolbox which serves different purposes.
- CommentRowNumber48.
- CommentAuthorUrs
- CommentTimeDec 19th 2013
- (edited Dec 19th 2013)
- PermaLink
Author: Urs
Format: MarkdownItexColin, what would be the relation of this application of the transfer principle to differential calculus eith explicit differentials? Mike, we seem to be talking past each other. Not everything that is expressed in topos theory is therefore the natural way to do something. Maybe remember the sympathies and antipathies towards Bohr toposes for another example of just this question. I still find that when I look around then sdg differentials play a thorough and foundational role in differential geometry both in its basic formulation but in particular in a bunch of powerful modern refinements. All of derived algebraic geometry, all of D-geometry and all the applications to pde theory, variational calculus etc that this has In contrast, for nonstandard analysis the main statement is that elementary calculus can be phrased this way. Is there anything that goes further?

Colin, what would be the relation of this application of the transfer principle to differential calculus eith explicit differentials?

Mike, we seem to be talking past each other. Not everything that is expressed in topos theory is therefore the natural way to do something. Maybe remember the sympathies and antipathies towards Bohr toposes for another example of just this question.

I still find that when I look around then sdg differentials play a thorough and foundational role in differential geometry both in its basic formulation but in particular in a bunch of powerful modern refinements. All of derived algebraic geometry, all of D-geometry and all the applications to pde theory, variational calculus etc that this has

In contrast, for nonstandard analysis the main statement is that elementary calculus can be phrased this way. Is there anything that goes further?
- CommentRowNumber49.
- CommentAuthorUrs
- CommentTimeDec 19th 2013
- (edited Dec 19th 2013)
- PermaLink
Author: Urs
Format: MarkdownItexBy the way, also nilpotent differentials have their transfer principle: that's the statement that for instance the Cahier topos is a model for differential cohesion. This means in particular that there is a certain geometric morphism from the standard smooth topos to that with synthetic infinitesimals. But i'd think these transfer principles are part of the notion of infinitesimals themselves. Saying that nonstandard analysis is good because it has a transfer principle is a bit like saying that the natural numbers are good because they have an element called zero

By the way, also nilpotent differentials have their transfer principle: that’s the statement that for instance the Cahier topos is a model for differential cohesion. This means in particular that there is a certain geometric morphism from the standard smooth topos to that with synthetic infinitesimals.

But i’d think these transfer principles are part of the notion of infinitesimals themselves. Saying that nonstandard analysis is good because it has a transfer principle is a bit like saying that the natural numbers are good because they have an element called zero
- CommentRowNumber50.
- CommentAuthorMike Shulman
- CommentTimeDec 19th 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItexIf you are really interested in applications of nonstandard analysis, you could try reading some books. Here are a few on my shelf: * Nelson, *Radically elementary probability theory* * Diener-Diener (eds.), *Nonstandard analysis in practice* * Imme van den Berg, *Nonstandard asymptotic analysis* * Leob-Wolff (eds.), *Nonstandard analysis for the working mathematician*. I haven't digested everything in these books, but I've learned a lot from them, and each of them goes *way* beyond elementary calculus. But perhaps the point of departure is that most of the applications are to *analysis*, whereas in #48 you seem to prefer applications to *geometry*. Analysis is an area of math that often doesn't seem especially amenable to elegant category-theoretic formulations, but that doesn't make it less important. Nonstandard analysis, essentially because it is a "synthetic" way to talk about orders of magnitude, does seem like it provides a more elegant way to do a lot of analysis. In other words, there's a reason we say "nonstandard *analysis*" but "synthetic differential *geometry*". (-:
If you are really interested in applications of nonstandard analysis, you could try reading some books. Here are a few on my shelf:
- Nelson, Radically elementary probability theory
- Diener-Diener (eds.), Nonstandard analysis in practice
- Imme van den Berg, Nonstandard asymptotic analysis
- Leob-Wolff (eds.), Nonstandard analysis for the working mathematician.
I haven’t digested everything in these books, but I’ve learned a lot from them, and each of them goes way beyond elementary calculus. But perhaps the point of departure is that most of the applications are to analysis, whereas in #48 you seem to prefer applications to geometry. Analysis is an area of math that often doesn’t seem especially amenable to elegant category-theoretic formulations, but that doesn’t make it less important. Nonstandard analysis, essentially because it is a “synthetic” way to talk about orders of magnitude, does seem like it provides a more elegant way to do a lot of analysis.

In other words, there’s a reason we say “nonstandard analysis” but “synthetic differential geometry”. (-:
- CommentRowNumber51.
- CommentAuthorMike Shulman
- CommentTimeDec 19th 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItexAlso perhaps interesting: <http://terrytao.wordpress.com/2013/12/07/ultraproducts-as-a-bridge-between-discrete-and-continuous-analysis/>

Also perhaps interesting: http://terrytao.wordpress.com/2013/12/07/ultraproducts-as-a-bridge-between-discrete-and-continuous-analysis/
- CommentRowNumber52.
- CommentAuthorUrs
- CommentTimeDec 19th 2013
- (edited Dec 19th 2013)
- PermaLink
Author: Urs
Format: MarkdownItexThanks, Mike, for the analysis/differential geometry dichotomy. I'll think about that. I have added the references that you displayed to _[nonstandard analysis -- References](http://ncatlab.org/nlab/show/nonstandard+analysis#References)_. Incidentally, that makes the list already available there a bit longer still; and my impression is that it would be useful if some expert organized these items a little and/or added some comments as to why one would want to track down which of them.

Thanks, Mike, for the analysis/differential geometry dichotomy. I’ll think about that.

I have added the references that you displayed to nonstandard analysis – References. Incidentally, that makes the list already available there a bit longer still; and my impression is that it would be useful if some expert organized these items a little and/or added some comments as to why one would want to track down which of them.
- CommentRowNumber53.
- CommentAuthorzskoda
- CommentTimeDec 19th 2013
- (edited Dec 19th 2013)
- PermaLink
Author: zskoda
Format: MarkdownItex> for nonstandard analysis the main statement is that elementary calculus can be phrased this way. Is there anything that goes further? Surely, there are advanced objects like Loeb nonstandard probability spaces, nonstandard set theory, transfer at the level of certain topoi in the picture etc. Nonstandard analysis is not _only_ about analysis.

for nonstandard analysis the main statement is that elementary calculus can be phrased this way. Is there anything that goes further?

Surely, there are advanced objects like Loeb nonstandard probability spaces, nonstandard set theory, transfer at the level of certain topoi in the picture etc. Nonstandard analysis is not only about analysis.
- CommentRowNumber54.
- CommentAuthorzskoda
- CommentTimeDec 19th 2013
- (edited Dec 19th 2013)
- PermaLink
Author: zskoda
Format: MarkdownItex* V. A. Lyubetskiĭ, _Оценки и пучки. О некоторых вопросах нестандартного анализа_, Uspekhi Mat. Nauk __44__ (1989), no. 4(268), 99--153, 256; translation _Valuations and sheaves. On some questions of non-standard analysis_, in Russian Math. Surveys __44__ (1989), no. 4, 37–112 [MR1023104](http://www.ams.org/mathscinet-getitem?mr=1023104) [doi](http://dx.doi.org/10.1070/RM1989v044n04ABEH002140) [IOP pdf](http://iopscience.iop.org/0036-0279/44/4/R03/pdf/0036-0279_44_4_R03.pdf) [rus pdf](http://www.mathnet.ru/php/getFT.phtml?jrnid=rm&paperid=1849&what=fullt&option_lang=rus) > We present some parts of a mathematical theory that is sometimes called Heyting-valued analysis (or nonstandard analysis in the broad sense). Sometimes this theory is considered as a part of general topos theory. One may surmise that this theory has some applications outside mathematical logic as well: in algebra and analysis, and even in a still wider context, for example, as in A. Robinson's well-known work on the application of nonstandard analysis in quantum field theory. > In Chapter I we present the actual method of Heyting-valued (in particular, Boolean-valued) analysis. Chapters II–IV contain specific examples of applications of the method of Heyting-valued analysis. In Chapter II we primarily consider the problem of the existence of a model companion of a locally axiomatizable class of rings. In Chapter III we consider a conjecture of P. S. Novikov [cf. Selected works (Russian), see p. 127, "Nauka'', Moscow, 1979; MR0545907 (80i:01017)]. In this chapter we discuss the transfer from classical to intuitionistic validity in an arbitrary ring. Novikov's paper established the possibility of such a transfer in the case of the ring Z. In Chapter IV, we construct, for some rings of continuous Y-valued functions (as algebras over the ring Y), a nonstandard representation Y˜ such that in a certain sense this algebra is similar to its ring of scalars Y. The appendix briefly describes examples of applications of Boolean-valued analysis in connection with problems of duality. Practically all the theorems and propositions are given complete proofs.
- V. A. Lyubetskiĭ, Оценки и пучки. О некоторых вопросах нестандартного анализа, Uspekhi Mat. Nauk 44 (1989), no. 4(268), 99–153, 256; translation Valuations and sheaves. On some questions of non-standard analysis, in Russian Math. Surveys 44 (1989), no. 4, 37–112 MR1023104 doi IOP pdf rus pdf
We present some parts of a mathematical theory that is sometimes called Heyting-valued analysis (or nonstandard analysis in the broad sense). Sometimes this theory is considered as a part of general topos theory. One may surmise that this theory has some applications outside mathematical logic as well: in algebra and analysis, and even in a still wider context, for example, as in A. Robinson’s well-known work on the application of nonstandard analysis in quantum field theory.

In Chapter I we present the actual method of Heyting-valued (in particular, Boolean-valued) analysis. Chapters II–IV contain specific examples of applications of the method of Heyting-valued analysis. In Chapter II we primarily consider the problem of the existence of a model companion of a locally axiomatizable class of rings. In Chapter III we consider a conjecture of P. S. Novikov [cf. Selected works (Russian), see p. 127, “Nauka”, Moscow, 1979; MR0545907 (80i:01017)]. In this chapter we discuss the transfer from classical to intuitionistic validity in an arbitrary ring. Novikov’s paper established the possibility of such a transfer in the case of the ring Z. In Chapter IV, we construct, for some rings of continuous Y-valued functions (as algebras over the ring Y), a nonstandard representation Y˜ such that in a certain sense this algebra is similar to its ring of scalars Y. The appendix briefly describes examples of applications of Boolean-valued analysis in connection with problems of duality. Practically all the theorems and propositions are given complete proofs.
- CommentRowNumber55.
- CommentAuthorzskoda
- CommentTimeDec 19th 2013
- (edited Dec 19th 2013)
- PermaLink
Author: zskoda
Format: MarkdownItexJust a sample set-theoretic treatise in the framework of nonstandard analysis. * В. Г. Кановей, В. А. Любецкий, _Проблемы теоретико-множественного нестандартного анализа_, [pdf](http://www.mathnet.ru/php/getFT.phtml?jrnid=rm&paperid=5588&what=fullt&option_lang=rus); transl. Vladimir G Kanovei, Vasilii A Lyubetskii, _Problems of set-theoretic non-standard analysis_, 2007 Russ. Math. Surv. 62 45 [MR2352413](http://www.ams.org/mathscinet-getitem?mr=2352413) [doi](http://dx.doi.org/10.1070/RM2007v062n01ABEH004381) [IOP pdf](http://iopscience.iop.org/0036-0279/62/1/R02/pdf/0036-0279_62_1_R02.pdf)
Just a sample set-theoretic treatise in the framework of nonstandard analysis.
- В. Г. Кановей, В. А. Любецкий, Проблемы теоретико-множественного нестандартного анализа, pdf; transl. Vladimir G Kanovei, Vasilii A Lyubetskii, Problems of set-theoretic non-standard analysis, 2007 Russ. Math. Surv. 62 45 MR2352413 doi IOP pdf
- CommentRowNumber56.
- CommentAuthorUrs
- CommentTimeDec 19th 2013
- PermaLink
Author: Urs
Format: MarkdownItexZoran, thanks for taking the time to provide more pointers. But I think your choice of examples -- e-g- nonstandard probability spaces -- confirms that the distinction between analysis and differential calculus which Mike amplified above is relevant. Probability spaces are not a topic involving differential calculus. For me that suggestion of Mike's is a good conclusion of this little debate here, and I'd tend to leave it at that for the moment, since I should be looking into other things. If I had more time I would maybe add a little paragraph to this effect to the nLab entry.

Zoran, thanks for taking the time to provide more pointers. But I think your choice of examples – e-g- nonstandard probability spaces – confirms that the distinction between analysis and differential calculus which Mike amplified above is relevant. Probability spaces are not a topic involving differential calculus.

For me that suggestion of Mike’s is a good conclusion of this little debate here, and I’d tend to leave it at that for the moment, since I should be looking into other things. If I had more time I would maybe add a little paragraph to this effect to the nLab entry.
- CommentRowNumber57.
- CommentAuthorColin Tan
- CommentTimeDec 20th 2013
- PermaLink
Author: Colin Tan
Format: TextUrs, referring to comments 45,46 and 48, I was trying to illustrate the difference within the model theoretic and the category theoretic approaches to using differentials in calculus, in response to David's comments. Using a nilpotent differential amounts to the category theoretic approach of having a good category of objects. A ring with nilpotents is not a field and does not have the same first-order theory as the real field. However, using an invertible infinitesimal follows the model theoretic approach. A hyperreal field has the same first-order theory as the real field, but it a nonstandard model where we can do computations and take the standard part to recover back calculus in the standard model.
Urs, referring to comments 45,46 and 48, I was trying to illustrate the difference within the model theoretic and the category theoretic approaches to using differentials in calculus, in response to David's comments. Using a nilpotent differential amounts to the category theoretic approach of having a good category of objects. A ring with nilpotents is not a field and does not have the same first-order theory as the real field. However, using an invertible infinitesimal follows the model theoretic approach. A hyperreal field has the same first-order theory as the real field, but it a nonstandard model where we can do computations and take the standard part to recover back calculus in the standard model.
- CommentRowNumber58.
- CommentAuthorDavid_Corfield
- CommentTimeDec 20th 2013
- PermaLink
Author: David_Corfield
Format: MarkdownItexConcerning the analysis/geometry dichotomy, in view of algebra-geometry duality, this might also be seen as the analysis/algebra dichotomy. Then we have some interesting comments by Terry Tao, which I collected [here](http://ncatlab.org/davidcorfield/show/Two+Cultures#terry_tao_6). In particular, in the section 'Tao on Buzz' he looks to characterise analysis and algebra in terms of open and closed conditions. There's also something there on NSA.

Concerning the analysis/geometry dichotomy, in view of algebra-geometry duality, this might also be seen as the analysis/algebra dichotomy. Then we have some interesting comments by Terry Tao, which I collected here. In particular, in the section ’Tao on Buzz’ he looks to characterise analysis and algebra in terms of open and closed conditions. There’s also something there on NSA.
- CommentRowNumber59.
- CommentAuthorTobyBartels
- CommentTimeJan 27th 2014
- PermaLink
Author: TobyBartels
Format: MarkdownItexRe #41, the new version of the notes on differentials for my Multivariable Calclulus class are done: [check them out](http://tobybartels.name/MATH-2080/2014w/differentials/). I now feel like the end (where I get to this bit) is a bit anticlimactic, and I wonder if I should redo the whole thing *starting* with the action of differentials on curves. Incidentally, higher differentials such as $\mathrm{d}^2 u$ (where remember we are *not* doing the exterior differential, which would just be zero, but rather something relevant to second derivatives) cannot be understood as acting on vectors (since they act on order-$2$ jets) but can be understood as acting on curves. With the emphasis on curves, I suppose that I\'m secretly doing calculus on diffeological spaces. (I remarked in class on Thursday that there are very general notions of ‘differentiable space’ even beyond the differentiable manifolds that one is likely to meet in an advanced course, but in theory everything in *this* course is done on open subspaces of $\mathbb{R}^n$ for $n = 1, 2, 3$.)

Re #41, the new version of the notes on differentials for my Multivariable Calclulus class are done: check them out. I now feel like the end (where I get to this bit) is a bit anticlimactic, and I wonder if I should redo the whole thing starting with the action of differentials on curves.

Incidentally, higher differentials such as $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>u</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}^2 u</annotation></semantics></math>$ (where remember we are not doing the exterior differential, which would just be zero, but rather something relevant to second derivatives) cannot be understood as acting on vectors (since they act on order- $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>2</mn></mrow><annotation encoding="application/x-tex">2</annotation></semantics></math>$ jets) but can be understood as acting on curves.

With the emphasis on curves, I suppose that I'm secretly doing calculus on diffeological spaces. (I remarked in class on Thursday that there are very general notions of ‘differentiable space’ even beyond the differentiable manifolds that one is likely to meet in an advanced course, but in theory everything in this course is done on open subspaces of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>ℝ</mi> <mi>n</mi></msup></mrow><annotation encoding="application/x-tex">\mathbb{R}^n</annotation></semantics></math>$ for $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>n</mi><mo>=</mo><mn>1</mn><mo>,</mo><mn>2</mn><mo>,</mo><mn>3</mn></mrow><annotation encoding="application/x-tex">n = 1, 2, 3</annotation></semantics></math>$ .)
- CommentRowNumber60.
- CommentAuthorTobyBartels
- CommentTimeFeb 8th 2014
- (edited Feb 8th 2014)
- PermaLink
Author: TobyBartels
Format: MarkdownItexIt seems that I went to far with the strategy of pushing everything back to curves, since $x, y \mapsto y^3/(x^2 + y^2)$ (continuously extended to the origin) is not differentiable at the origin (by the usual definition), even though its composite with any differentiable curve is differentiable (indeed continuously so if the curve is continuously differentiable). [[Boman\'s theorem]] says that you can push things back to curves for *smooth* maps, and this is what really matters, so I may just do that next term, leaving the fine print for merely differentiable maps to the textbook.

It seems that I went to far with the strategy of pushing everything back to curves, since $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi><mo>,</mo><mi>y</mi><mo>↦</mo><msup><mi>y</mi> <mn>3</mn></msup><mo stretchy="false">/</mo><mo stretchy="false">(</mo><msup><mi>x</mi> <mn>2</mn></msup><mo>+</mo><msup><mi>y</mi> <mn>2</mn></msup><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">x, y \mapsto y^3/(x^2 + y^2)</annotation></semantics></math>$ (continuously extended to the origin) is not differentiable at the origin (by the usual definition), even though its composite with any differentiable curve is differentiable (indeed continuously so if the curve is continuously differentiable).

Boman's theorem says that you can push things back to curves for smooth maps, and this is what really matters, so I may just do that next term, leaving the fine print for merely differentiable maps to the textbook.
- CommentRowNumber61.
- CommentAuthorMike Shulman
- CommentTimeFeb 8th 2014
- PermaLink
Author: Mike Shulman
Format: MarkdownItexI presume it is also not sufficient to say that $f$ is differentiable if $f\circ c$ is differentiable for all $c$ and moreover there exists a differential form $df$ such that $\langle df(p)|c\rangle = (f\circ c)'(0)$? That is, you also need to ensure that the limits defining each derivative $(f \circ c)'(0)$ happen "simultaneously in all directions"?

I presume it is also not sufficient to say that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ is differentiable if $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>∘</mo><mi>c</mi></mrow><annotation encoding="application/x-tex">f\circ c</annotation></semantics></math>$ is differentiable for all $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>c</mi></mrow><annotation encoding="application/x-tex">c</annotation></semantics></math>$ and moreover there exists a differential form $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>df</mi></mrow><annotation encoding="application/x-tex">df</annotation></semantics></math>$ such that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">⟨</mo><mi>df</mi><mo stretchy="false">(</mo><mi>p</mi><mo stretchy="false">)</mo><mo stretchy="false">|</mo><mi>c</mi><mo stretchy="false">⟩</mo><mo>=</mo><mo stretchy="false">(</mo><mi>f</mi><mo>∘</mo><mi>c</mi><mo stretchy="false">)</mo><mo>′</mo><mo stretchy="false">(</mo><mn>0</mn><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">\langle df(p)|c\rangle = (f\circ c)'(0)</annotation></semantics></math>$ ? That is, you also need to ensure that the limits defining each derivative $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mi>f</mi><mo>∘</mo><mi>c</mi><mo stretchy="false">)</mo><mo>′</mo><mo stretchy="false">(</mo><mn>0</mn><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">(f \circ c)'(0)</annotation></semantics></math>$ happen “simultaneously in all directions”?
- CommentRowNumber62.
- CommentAuthorTobyBartels
- CommentTimeFeb 9th 2014
- PermaLink
Author: TobyBartels
Format: MarkdownItexMy working definition of ‘differential form’ (of rank $1$) in this class is a formal linear combination of differentials with coefficients from the ring of appreciable quantities (not stated like that, of course, but given by examples). Then $\mathrm{d}f$ is automatically the differential form desired (well, assuming that it exists, since I actually refuse to define $\mathrm{d}f$ until $f$ is known to be differentiable, but then your proposal is circular). But ignoring that context, and defining a differential form abstractly as an operator on differentiable curves with appropriate properties (such as linearity), then the answer is Yes if you require $\mathrm{d}f$ to be a *continuous* differential form; and in that case, we can conclude that $f$ is *continuously* differentiable. Without that, I don\'t know. In the example of $x, y \mapsto y^3/(x^2 + y^2)$, not only is $\mathrm{d}f$ not continuous, it\'s not linear (at $(x,y) = (0,0)$); if you apply it to a line through the origin with tangent vector $[a,b]$, then the result is $b^3/(a^2 + b^2)$. So this $\mathrm{d}f$ is not a differential form. If $\mathrm{d}f$ has the properties of a differential form, does this guarantee that $f$ is differentiable in the standard sense? That would be nice! Is additivity sufficient? That would be particularly nice! I don\'t know.

My working definition of ‘differential form’ (of rank $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn></mrow><annotation encoding="application/x-tex">1</annotation></semantics></math>$ ) in this class is a formal linear combination of differentials with coefficients from the ring of appreciable quantities (not stated like that, of course, but given by examples). Then $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>f</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}f</annotation></semantics></math>$ is automatically the differential form desired (well, assuming that it exists, since I actually refuse to define $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>f</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}f</annotation></semantics></math>$ until $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ is known to be differentiable, but then your proposal is circular).

But ignoring that context, and defining a differential form abstractly as an operator on differentiable curves with appropriate properties (such as linearity), then the answer is Yes if you require $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>f</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}f</annotation></semantics></math>$ to be a continuous differential form; and in that case, we can conclude that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ is continuously differentiable.

Without that, I don't know. In the example of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi><mo>,</mo><mi>y</mi><mo>↦</mo><msup><mi>y</mi> <mn>3</mn></msup><mo stretchy="false">/</mo><mo stretchy="false">(</mo><msup><mi>x</mi> <mn>2</mn></msup><mo>+</mo><msup><mi>y</mi> <mn>2</mn></msup><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">x, y \mapsto y^3/(x^2 + y^2)</annotation></semantics></math>$ , not only is $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>f</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}f</annotation></semantics></math>$ not continuous, it's not linear (at $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mi>x</mi><mo>,</mo><mi>y</mi><mo stretchy="false">)</mo><mo>=</mo><mo stretchy="false">(</mo><mn>0</mn><mo>,</mo><mn>0</mn><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">(x,y) = (0,0)</annotation></semantics></math>$ ); if you apply it to a line through the origin with tangent vector $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">[</mo><mi>a</mi><mo>,</mo><mi>b</mi><mo stretchy="false">]</mo></mrow><annotation encoding="application/x-tex">[a,b]</annotation></semantics></math>$ , then the result is $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>b</mi> <mn>3</mn></msup><mo stretchy="false">/</mo><mo stretchy="false">(</mo><msup><mi>a</mi> <mn>2</mn></msup><mo>+</mo><msup><mi>b</mi> <mn>2</mn></msup><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">b^3/(a^2 + b^2)</annotation></semantics></math>$ . So this $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>f</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}f</annotation></semantics></math>$ is not a differential form.

If $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>f</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}f</annotation></semantics></math>$ has the properties of a differential form, does this guarantee that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ is differentiable in the standard sense? That would be nice! Is additivity sufficient? That would be particularly nice! I don't know.
- CommentRowNumber63.
- CommentAuthorTobyBartels
- CommentTimeFeb 10th 2014
- PermaLink
Author: TobyBartels
Format: MarkdownItexOK, yes, your idea does work! Specifically, define $\mathrm{d}f$ as the operation (in general partially defined) on differentiable parametrized curves (in a given Cartesian space, or more generally in a $C^1$ manifold) that takes $c$ to $(f \circ c)'(0)$ (if this exists). Also define $\mathrm{d}f(p)$ to be the restriction of that operation to curves with $c(0) = p$. Then $\mathrm{d}f(p)$ might be defined on all such curves, and (if so) it might respect the equivalence of curves that defines a tangent vector at $p$, and (if so) it might be linear and so a cotangent vector at $p$. If so, then $f$ is differentiable at $p$, as desired. The proof is that the definition of differentiability itself calls for nothing more than this cotangent vector. The next step is to make this into a definition of generalized differentiable (rather than smooth) space, by not using the previously known structure of tangent vectors.

OK, yes, your idea does work!

Specifically, define $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>f</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}f</annotation></semantics></math>$ as the operation (in general partially defined) on differentiable parametrized curves (in a given Cartesian space, or more generally in a $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>C</mi> <mn>1</mn></msup></mrow><annotation encoding="application/x-tex">C^1</annotation></semantics></math>$ manifold) that takes $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>c</mi></mrow><annotation encoding="application/x-tex">c</annotation></semantics></math>$ to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mi>f</mi><mo>∘</mo><mi>c</mi><mo stretchy="false">)</mo><mo>′</mo><mo stretchy="false">(</mo><mn>0</mn><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">(f \circ c)'(0)</annotation></semantics></math>$ (if this exists). Also define $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>f</mi><mo stretchy="false">(</mo><mi>p</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">\mathrm{d}f(p)</annotation></semantics></math>$ to be the restriction of that operation to curves with $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>c</mi><mo stretchy="false">(</mo><mn>0</mn><mo stretchy="false">)</mo><mo>=</mo><mi>p</mi></mrow><annotation encoding="application/x-tex">c(0) = p</annotation></semantics></math>$ . Then $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>f</mi><mo stretchy="false">(</mo><mi>p</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">\mathrm{d}f(p)</annotation></semantics></math>$ might be defined on all such curves, and (if so) it might respect the equivalence of curves that defines a tangent vector at $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>p</mi></mrow><annotation encoding="application/x-tex">p</annotation></semantics></math>$ , and (if so) it might be linear and so a cotangent vector at $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>p</mi></mrow><annotation encoding="application/x-tex">p</annotation></semantics></math>$ . If so, then $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ is differentiable at $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>p</mi></mrow><annotation encoding="application/x-tex">p</annotation></semantics></math>$ , as desired.

The proof is that the definition of differentiability itself calls for nothing more than this cotangent vector.

The next step is to make this into a definition of generalized differentiable (rather than smooth) space, by not using the previously known structure of tangent vectors.
- CommentRowNumber64.
- CommentAuthorDavidRoberts
- CommentTimeFeb 12th 2014
- PermaLink
Author: DavidRoberts
Format: MarkdownItexJust throwing this in the mix: <http://mathoverflow.net/a/7632/4177>

Just throwing this in the mix: http://mathoverflow.net/a/7632/4177
- CommentRowNumber65.
- CommentAuthorMike Shulman
- CommentTimeJul 3rd 2014
- PermaLink
Author: Mike Shulman
Format: MarkdownItexToby, you might have something to contribute here: <http://matheducators.stackexchange.com/questions/2246/practical-experience-with-teaching-differentials-in-freshman-calc>

Toby, you might have something to contribute here: http://matheducators.stackexchange.com/questions/2246/practical-experience-with-teaching-differentials-in-freshman-calc
- CommentRowNumber66.
- CommentAuthorTobyBartels
- CommentTimeJul 4th 2014
- PermaLink
Author: TobyBartels
Format: MarkdownItexThanks!

Thanks!
- CommentRowNumber67.
- CommentAuthorTobyBartels
- CommentTimeApr 20th 2015
- (edited Apr 20th 2015)
- PermaLink
Author: TobyBartels
Format: MarkdownItexThe [current term\'s Mulivariable Calculus course](http://tobybartels.name/MATH-2080/2015s/) has a revised version of the introduction to differentials and $1$-forms, split into [two](http://tobybartels.name/MATH-2080/2015s/diffforms/) [parts](http://tobybartels.name/MATH-2080/2015s/differentials/). (Although I\'m not really done rewriting the second part, it\'s acceptable, and I needed to hand it out in class today.) There will be more handouts. This year features the general statement that a differential form is *any* expression with differentials in it, said with the confidence that I know how to define it if pressed! But in the first handout, $\mathrm{d}x$, $\mathrm{d}^2x$, etc are treated as independent variables in a formal expression, which I never really liked. Fortunately, there is a real definition in the second handout, although what it really does is to define equality as an equivalence relation on such formal expressions. Edit: Also, the second handout formally defines a function $f$ on a Cartesian space to be differentiable at a point $P$ if $(f \circ C)$ is not only differentiable wherever the value of $C$ is $P$ and $C$ is differentiable there, but also depends only on the derivative of $C$ there (the velocity tangent vector) and depends on that linearly (stated as the existence of an appropriate row vector $\Del{f}(P)$). That actually came out looking simpler than I had originally anticipated!

The current term's Mulivariable Calculus course has a revised version of the introduction to differentials and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn></mrow><annotation encoding="application/x-tex">1</annotation></semantics></math>$ -forms, split into two parts. (Although I'm not really done rewriting the second part, it's acceptable, and I needed to hand it out in class today.) There will be more handouts.

This year features the general statement that a differential form is any expression with differentials in it, said with the confidence that I know how to define it if pressed! But in the first handout, $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}x</annotation></semantics></math>$ , $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}^2x</annotation></semantics></math>$ , etc are treated as independent variables in a formal expression, which I never really liked. Fortunately, there is a real definition in the second handout, although what it really does is to define equality as an equivalence relation on such formal expressions.

Edit: Also, the second handout formally defines a function $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ on a Cartesian space to be differentiable at a point $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>P</mi></mrow><annotation encoding="application/x-tex">P</annotation></semantics></math>$ if $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mi>f</mi><mo>∘</mo><mi>C</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">(f \circ C)</annotation></semantics></math>$ is not only differentiable wherever the value of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>C</mi></mrow><annotation encoding="application/x-tex">C</annotation></semantics></math>$ is $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>P</mi></mrow><annotation encoding="application/x-tex">P</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>C</mi></mrow><annotation encoding="application/x-tex">C</annotation></semantics></math>$ is differentiable there, but also depends only on the derivative of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>C</mi></mrow><annotation encoding="application/x-tex">C</annotation></semantics></math>$ there (the velocity tangent vector) and depends on that linearly (stated as the existence of an appropriate row vector $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo>∇</mo><mi>f</mi><mo stretchy="false">(</mo><mi>P</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">\Del{f}(P)</annotation></semantics></math>$ ). That actually came out looking simpler than I had originally anticipated!
- CommentRowNumber68.
- CommentAuthorTobyBartels
- CommentTimeMay 5th 2015
- PermaLink
Author: TobyBartels
Format: MarkdownItexI should probably add to [[differentiable function]] my proof that this definition of differentiability is correct. It\'s stronger than requiring all directional derivatives and then requiring these to depend linearly on the direction. Since the derivative of $f \circ C$ depends on $C$ only through the derivative of $C$ and yet there exist nondifferentiable functions with linear directional derivatives (example: $y^3/x$ extended as $0$, at $(0,0)$), one might think that it would be insufficient to require $(f \circ C)'$ to depend on $C$ only linearly through the derivative of $C$. However, the claim that $(f \circ C)'$ depends on $C$ only through the derivative of $C$ fails for nondifferentiable functions with linear directional derivatives! (In the example, $(f \circ C)'$ is $0$ at $(0,0)$ when $C$ is a line but not, say, when $x = y^2$ on $C$.)

I should probably add to differentiable function my proof that this definition of differentiability is correct. It's stronger than requiring all directional derivatives and then requiring these to depend linearly on the direction. Since the derivative of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>∘</mo><mi>C</mi></mrow><annotation encoding="application/x-tex">f \circ C</annotation></semantics></math>$ depends on $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>C</mi></mrow><annotation encoding="application/x-tex">C</annotation></semantics></math>$ only through the derivative of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>C</mi></mrow><annotation encoding="application/x-tex">C</annotation></semantics></math>$ and yet there exist nondifferentiable functions with linear directional derivatives (example: $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>y</mi> <mn>3</mn></msup><mo stretchy="false">/</mo><mi>x</mi></mrow><annotation encoding="application/x-tex">y^3/x</annotation></semantics></math>$ extended as $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>0</mn></mrow><annotation encoding="application/x-tex">0</annotation></semantics></math>$ , at $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mn>0</mn><mo>,</mo><mn>0</mn><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">(0,0)</annotation></semantics></math>$ ), one might think that it would be insufficient to require $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mi>f</mi><mo>∘</mo><mi>C</mi><mo stretchy="false">)</mo><mo>′</mo></mrow><annotation encoding="application/x-tex">(f \circ C)'</annotation></semantics></math>$ to depend on $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>C</mi></mrow><annotation encoding="application/x-tex">C</annotation></semantics></math>$ only linearly through the derivative of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>C</mi></mrow><annotation encoding="application/x-tex">C</annotation></semantics></math>$ . However, the claim that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mi>f</mi><mo>∘</mo><mi>C</mi><mo stretchy="false">)</mo><mo>′</mo></mrow><annotation encoding="application/x-tex">(f \circ C)'</annotation></semantics></math>$ depends on $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>C</mi></mrow><annotation encoding="application/x-tex">C</annotation></semantics></math>$ only through the derivative of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>C</mi></mrow><annotation encoding="application/x-tex">C</annotation></semantics></math>$ fails for nondifferentiable functions with linear directional derivatives! (In the example, $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mi>f</mi><mo>∘</mo><mi>C</mi><mo stretchy="false">)</mo><mo>′</mo></mrow><annotation encoding="application/x-tex">(f \circ C)'</annotation></semantics></math>$ is $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>0</mn></mrow><annotation encoding="application/x-tex">0</annotation></semantics></math>$ at $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mn>0</mn><mo>,</mo><mn>0</mn><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">(0,0)</annotation></semantics></math>$ when $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>C</mi></mrow><annotation encoding="application/x-tex">C</annotation></semantics></math>$ is a line but not, say, when $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi><mo>=</mo><msup><mi>y</mi> <mn>2</mn></msup></mrow><annotation encoding="application/x-tex">x = y^2</annotation></semantics></math>$ on $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>C</mi></mrow><annotation encoding="application/x-tex">C</annotation></semantics></math>$ .)
- CommentRowNumber69.
- CommentAuthorMike Shulman
- CommentTimeMay 5th 2015
- PermaLink
Author: Mike Shulman
Format: MarkdownItexVery interesting!

Very interesting!
- CommentRowNumber70.
- CommentAuthorTobyBartels
- CommentTimeMay 6th 2015
- PermaLink
Author: TobyBartels
Format: MarkdownItexYes, when I saw that example on [[differentiable map]], I was at first worried that I\'d made a mistake, and I tried to put that example through my proof, which made me realize that it didn\'t apply.

Yes, when I saw that example on differentiable map, I was at first worried that I'd made a mistake, and I tried to put that example through my proof, which made me realize that it didn't apply.
- CommentRowNumber71.
- CommentAuthorTobyBartels
- CommentTimeNov 25th 2016
- PermaLink
Author: TobyBartels
Format: MarkdownItexHere is a multivariable Calculus textbook, intended for undergraduates who have had only one-variable Calculus and no more advanced mathematics, that covers the Stokes Theorems using differential forms. <http://matrixeditions.com/UnifiedApproach5thedSamples.html>

Here is a multivariable Calculus textbook, intended for undergraduates who have had only one-variable Calculus and no more advanced mathematics, that covers the Stokes Theorems using differential forms. http://matrixeditions.com/UnifiedApproach5thedSamples.html

1 to 71 of 71

nForum

Discussion Feed

Not signed in

Site Tag Cloud

Atrium > Mathematics, Physics & Philosophy: differentials