Not signed in

Want to take part in these discussions? Sign in if you have an account, or apply for one below

Site Tag Cloud

Vanilla 1.1.10 is a product of Lussumo. More Information: Documentation, Community Support.

Welcome to nForum
If you want to take part in these discussions either sign in now (if you have an account), apply for one now (if you don't).

Atrium > Mathematics, Physics & Philosophy: What is a variable?

Bottom of Page

1 to 100 of 102

- CommentRowNumber1.
- CommentAuthorMike Shulman
- CommentTimeNov 6th 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItexMy calculus book uses many different notations for the derivative of $y = f(x)$ with respect to $x$, such as $$ \frac{dy}{dx} \quad y'\quad f'(x) \quad D f(x) \quad \frac{df}{dx} $$ Recently I've found that I kind of object to a couple of these. For instance, consider $\frac{df}{dx}$. One of the things I try to teach my students is that when we define a function $f$ by writing $f(x) = x^2$, say, the variable $x$ is a dummy variable; if we wrote $f(t) = t^2$ we would be defining the same function, namely the one which squares its input. But if "$f$" denotes a function of this (usual mathematical) sort, then how can we write $\frac{df}{dx}$ to mean its derivative, since $f$ doesn't know that we called its input variable $x$? Notations like $f'(x)$ and $D f (x)$ don't have this problem, because $f'$ and $D f$ denote the derivative function of $f$, which assigns to each input value the derivative of $f$ at that value, and so $f'(x)$ and $D f (x)$ just mean evaluation of this function at that value. I suppose we could interpret $\frac{df}{dx}$ similarly if we regarded "$\frac{df}{d}$" as the derivative function of $f$, which we evaluate at something by placing it after the $d$ in the denominator, but this seems strained, and would suggest even odder notations such as writing $\frac{df}{d3}$ for $f'(3)$. It was pointed out to me that this kind of notation is even commoner in multivariable calculus, where we write things like $\frac{\partial f}{\partial x}$ and $\frac{\partial f}{\partial y}$, and in this case there aren't good alternatives available, since we have to indicate somehow whether it is the first or second input variable of $f$ with respect to which we take the derivative. From a differential-geometric viewpoint, one answer is to say that $x$ denotes a standard coordinate function on the 1-dimensional manifold that is the domain of $f$, and so we are actually taking the derivative with respect to a vector field associated to that function. We can even regard $\frac{df}{dx}$ as a literal quotient of differential 1-forms, since 1-forms on a 1-manifold are a 1-dimensional vector space at each point, so the quotient of two of them is a real number. But while logically consistent, this seems to undercut the force of the lesson of dummy variables, since we are endowing $x$ with a special status not shared by $t$. Using $x$ to denote the coordinate function has the other interesting consequence that it makes it okay to say "the function $x^2+1$" (since we can multiply and add functions together), instead of insisting on saying "the function $f$ defined by $f(x) = x^2+1$". Again this feels like it undercuts the lesson of what a function is --- and yet I find that it's hard to teach a calculus class without eventually slipping into saying "the function $x^2+1$". With $x$ as a function we can also write "$f = x^2+1$", which again is something that I'm used to indoctrinating my students against. I also have a problem with the notation $y'$, for a more pragmatic reason. Suppose we want to take the derivative of $y = (3x+1)^4$ using the chain rule. A nice way to do it is to make a substitution $u = 3x+1$, so that $y = u^4$, and then use differentials: $$du = 3 dx$$ $$dy = 4u^3 du = 4(3x+1)^3(2 dx)$$ $$\frac{dy}{dx}= 8(3x+1)^3$$ The problem here is that the notation $y'$ doesn't indicate what variable we differentiate with respect to, and in this calculation we have two derivatives of $y$, namely $\frac{dy}{dx} = 8(3x+1)^3$ and $\frac{dy}{du}=4u^3$, which are not equal even after substituting the value of $u = 3x+1$. Here the solution seems to be straightforward: just don't write $y'$. But if we can write $f = x^2+1$ just like $y = x^2+1$, and if we allow the notation $f'(x) = 2x$ and hence also $f' = 2x$, then we should just as well have $y' = 2x$. Does anyone have a good solution? I feel like at least part of the problem comes from confusing $\mathbb{R}$ as the real numbers with $\mathbb{R}$ as a 1-dimensional manifold, but I haven't exactly managed to pin down yet how to solve it from that point of view.

My calculus book uses many different notations for the derivative of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi><mo>=</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">y = f(x)</annotation></semantics></math>$ with respect to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ , such as
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mfrac><mi>dy</mi><mi>dx</mi></mfrac><mspace width="1em"/><mi>y</mi><mo>′</mo><mspace width="1em"/><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mspace width="1em"/><mi>D</mi><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mspace width="1em"/><mfrac><mi>df</mi><mi>dx</mi></mfrac></mrow><annotation encoding="application/x-tex"> \frac{dy}{dx} \quad y'\quad f'(x) \quad D f(x) \quad \frac{df}{dx} </annotation></semantics></math>$
Recently I’ve found that I kind of object to a couple of these. For instance, consider $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mi>df</mi><mi>dx</mi></mfrac></mrow><annotation encoding="application/x-tex">\frac{df}{dx}</annotation></semantics></math>$ . One of the things I try to teach my students is that when we define a function $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ by writing $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><msup><mi>x</mi> <mn>2</mn></msup></mrow><annotation encoding="application/x-tex">f(x) = x^2</annotation></semantics></math>$ , say, the variable $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ is a dummy variable; if we wrote $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo stretchy="false">(</mo><mi>t</mi><mo stretchy="false">)</mo><mo>=</mo><msup><mi>t</mi> <mn>2</mn></msup></mrow><annotation encoding="application/x-tex">f(t) = t^2</annotation></semantics></math>$ we would be defining the same function, namely the one which squares its input. But if “ $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ ” denotes a function of this (usual mathematical) sort, then how can we write $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mi>df</mi><mi>dx</mi></mfrac></mrow><annotation encoding="application/x-tex">\frac{df}{dx}</annotation></semantics></math>$ to mean its derivative, since $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ doesn’t know that we called its input variable $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ ?

Notations like $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">f'(x)</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>D</mi><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">D f (x)</annotation></semantics></math>$ don’t have this problem, because $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>′</mo></mrow><annotation encoding="application/x-tex">f'</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>D</mi><mi>f</mi></mrow><annotation encoding="application/x-tex">D f</annotation></semantics></math>$ denote the derivative function of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ , which assigns to each input value the derivative of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ at that value, and so $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">f'(x)</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>D</mi><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">D f (x)</annotation></semantics></math>$ just mean evaluation of this function at that value. I suppose we could interpret $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mi>df</mi><mi>dx</mi></mfrac></mrow><annotation encoding="application/x-tex">\frac{df}{dx}</annotation></semantics></math>$ similarly if we regarded “ $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mi>df</mi><mi>d</mi></mfrac></mrow><annotation encoding="application/x-tex">\frac{df}{d}</annotation></semantics></math>$ ” as the derivative function of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ , which we evaluate at something by placing it after the $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>d</mi></mrow><annotation encoding="application/x-tex">d</annotation></semantics></math>$ in the denominator, but this seems strained, and would suggest even odder notations such as writing $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mi>df</mi><mrow><mi>d</mi><mn>3</mn></mrow></mfrac></mrow><annotation encoding="application/x-tex">\frac{df}{d3}</annotation></semantics></math>$ for $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mn>3</mn><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">f'(3)</annotation></semantics></math>$ .

It was pointed out to me that this kind of notation is even commoner in multivariable calculus, where we write things like $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mrow><mo>∂</mo><mi>f</mi></mrow><mrow><mo>∂</mo><mi>x</mi></mrow></mfrac></mrow><annotation encoding="application/x-tex">\frac{\partial f}{\partial x}</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mrow><mo>∂</mo><mi>f</mi></mrow><mrow><mo>∂</mo><mi>y</mi></mrow></mfrac></mrow><annotation encoding="application/x-tex">\frac{\partial f}{\partial y}</annotation></semantics></math>$ , and in this case there aren’t good alternatives available, since we have to indicate somehow whether it is the first or second input variable of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ with respect to which we take the derivative.

From a differential-geometric viewpoint, one answer is to say that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ denotes a standard coordinate function on the 1-dimensional manifold that is the domain of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ , and so we are actually taking the derivative with respect to a vector field associated to that function. We can even regard $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mi>df</mi><mi>dx</mi></mfrac></mrow><annotation encoding="application/x-tex">\frac{df}{dx}</annotation></semantics></math>$ as a literal quotient of differential 1-forms, since 1-forms on a 1-manifold are a 1-dimensional vector space at each point, so the quotient of two of them is a real number. But while logically consistent, this seems to undercut the force of the lesson of dummy variables, since we are endowing $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ with a special status not shared by $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi></mrow><annotation encoding="application/x-tex">t</annotation></semantics></math>$ .

Using $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ to denote the coordinate function has the other interesting consequence that it makes it okay to say “the function $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>x</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn></mrow><annotation encoding="application/x-tex">x^2+1</annotation></semantics></math>$ ” (since we can multiply and add functions together), instead of insisting on saying “the function $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ defined by $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><msup><mi>x</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn></mrow><annotation encoding="application/x-tex">f(x) = x^2+1</annotation></semantics></math>$ ”. Again this feels like it undercuts the lesson of what a function is — and yet I find that it’s hard to teach a calculus class without eventually slipping into saying “the function $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>x</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn></mrow><annotation encoding="application/x-tex">x^2+1</annotation></semantics></math>$ ”. With $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ as a function we can also write “ $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>=</mo><msup><mi>x</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn></mrow><annotation encoding="application/x-tex">f = x^2+1</annotation></semantics></math>$ ”, which again is something that I’m used to indoctrinating my students against.

I also have a problem with the notation $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi><mo>′</mo></mrow><annotation encoding="application/x-tex">y'</annotation></semantics></math>$ , for a more pragmatic reason. Suppose we want to take the derivative of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi><mo>=</mo><mo stretchy="false">(</mo><mn>3</mn><mi>x</mi><mo>+</mo><mn>1</mn><msup><mo stretchy="false">)</mo> <mn>4</mn></msup></mrow><annotation encoding="application/x-tex">y = (3x+1)^4</annotation></semantics></math>$ using the chain rule. A nice way to do it is to make a substitution $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>u</mi><mo>=</mo><mn>3</mn><mi>x</mi><mo>+</mo><mn>1</mn></mrow><annotation encoding="application/x-tex">u = 3x+1</annotation></semantics></math>$ , so that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi><mo>=</mo><msup><mi>u</mi> <mn>4</mn></msup></mrow><annotation encoding="application/x-tex">y = u^4</annotation></semantics></math>$ , and then use differentials:
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mi>du</mi><mo>=</mo><mn>3</mn><mi>dx</mi></mrow><annotation encoding="application/x-tex">du = 3 dx</annotation></semantics></math>$ $<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mi>dy</mi><mo>=</mo><mn>4</mn><msup><mi>u</mi> <mn>3</mn></msup><mi>du</mi><mo>=</mo><mn>4</mn><mo stretchy="false">(</mo><mn>3</mn><mi>x</mi><mo>+</mo><mn>1</mn><msup><mo stretchy="false">)</mo> <mn>3</mn></msup><mo stretchy="false">(</mo><mn>2</mn><mi>dx</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">dy = 4u^3 du = 4(3x+1)^3(2 dx)</annotation></semantics></math>$ $<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mfrac><mi>dy</mi><mi>dx</mi></mfrac><mo>=</mo><mn>8</mn><mo stretchy="false">(</mo><mn>3</mn><mi>x</mi><mo>+</mo><mn>1</mn><msup><mo stretchy="false">)</mo> <mn>3</mn></msup></mrow><annotation encoding="application/x-tex">\frac{dy}{dx}= 8(3x+1)^3</annotation></semantics></math>$
The problem here is that the notation $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi><mo>′</mo></mrow><annotation encoding="application/x-tex">y'</annotation></semantics></math>$ doesn’t indicate what variable we differentiate with respect to, and in this calculation we have two derivatives of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi></mrow><annotation encoding="application/x-tex">y</annotation></semantics></math>$ , namely $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mi>dy</mi><mi>dx</mi></mfrac><mo>=</mo><mn>8</mn><mo stretchy="false">(</mo><mn>3</mn><mi>x</mi><mo>+</mo><mn>1</mn><msup><mo stretchy="false">)</mo> <mn>3</mn></msup></mrow><annotation encoding="application/x-tex">\frac{dy}{dx} = 8(3x+1)^3</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mi>dy</mi><mi>du</mi></mfrac><mo>=</mo><mn>4</mn><msup><mi>u</mi> <mn>3</mn></msup></mrow><annotation encoding="application/x-tex">\frac{dy}{du}=4u^3</annotation></semantics></math>$ , which are not equal even after substituting the value of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>u</mi><mo>=</mo><mn>3</mn><mi>x</mi><mo>+</mo><mn>1</mn></mrow><annotation encoding="application/x-tex">u = 3x+1</annotation></semantics></math>$ . Here the solution seems to be straightforward: just don’t write $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi><mo>′</mo></mrow><annotation encoding="application/x-tex">y'</annotation></semantics></math>$ . But if we can write $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>=</mo><msup><mi>x</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn></mrow><annotation encoding="application/x-tex">f = x^2+1</annotation></semantics></math>$ just like $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi><mo>=</mo><msup><mi>x</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn></mrow><annotation encoding="application/x-tex">y = x^2+1</annotation></semantics></math>$ , and if we allow the notation $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mn>2</mn><mi>x</mi></mrow><annotation encoding="application/x-tex">f'(x) = 2x</annotation></semantics></math>$ and hence also $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>′</mo><mo>=</mo><mn>2</mn><mi>x</mi></mrow><annotation encoding="application/x-tex">f' = 2x</annotation></semantics></math>$ , then we should just as well have $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi><mo>′</mo><mo>=</mo><mn>2</mn><mi>x</mi></mrow><annotation encoding="application/x-tex">y' = 2x</annotation></semantics></math>$ .

Does anyone have a good solution? I feel like at least part of the problem comes from confusing $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ℝ</mi></mrow><annotation encoding="application/x-tex">\mathbb{R}</annotation></semantics></math>$ as the real numbers with $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ℝ</mi></mrow><annotation encoding="application/x-tex">\mathbb{R}</annotation></semantics></math>$ as a 1-dimensional manifold, but I haven’t exactly managed to pin down yet how to solve it from that point of view.
- CommentRowNumber2.
- CommentAuthorhilbertthm90
- CommentTimeNov 6th 2013
- PermaLink
Author: hilbertthm90
Format: MarkdownItexThis doesn't answer the broader question, but there is a "better" alternative for the multivariable calculus notation. It is pretty typical to use $f_y (x,y)$ to denote the partial derivative of $f$ with respect to $y$.

This doesn’t answer the broader question, but there is a “better” alternative for the multivariable calculus notation. It is pretty typical to use $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi>f</mi> <mi>y</mi></msub><mo stretchy="false">(</mo><mi>x</mi><mo>,</mo><mi>y</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">f_y (x,y)</annotation></semantics></math>$ to denote the partial derivative of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ with respect to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi></mrow><annotation encoding="application/x-tex">y</annotation></semantics></math>$ .
- CommentRowNumber3.
- CommentAuthorZhen Lin
- CommentTimeNov 6th 2013
- PermaLink
Author: Zhen Lin
Format: MarkdownItexFor what it's worth, Mathematica writes multivariable derivatives as $f^{(i,j,\ldots,k)}$, which means the $i$-th partial derivative with respect to the first variable, the $j$-th partial derivative with respect to the second variable, etc. Unfortunately, this notation presupposes the commutativity of partial derivatives. On $f'$ and $\frac{d y}{d x}$: I think you are right there. In order to use notations like $f'$ consistently, it is appears to be necessary to abandon notions like "change of variable" and regard $x \mapsto (3 x + 1)^4$ and $u \mapsto u^4$ as distinct functions. So it is incompatible with the setup where calculus expressions are regarded as functions on some manifold – because there are no preferred coordinates. From a syntactic point of view it does seem rather disturbing that the $x$ in the denominator of $\frac{d y}{d x}$ is bound but free in $\frac{d y}{d x}$ itself. It's almost as if $\frac{d}{d x}$ is some kind of variable binding operator like $\lambda$ or $\prod$ or $\sum$... except for the fact that it doesn't bind the variable at all! Compare: $$x : \mathbb{R} \vdash y \equiv (3 x + 1)^4 : \mathbb{R}$$ $$x : \mathbb{R} \vdash \frac{d y}{d x} \equiv 12 (3 x + 1)^3 : \mathbb{R}$$ $$\vdash \lambda x . (3 x + 1)^4 : \mathbb{R} \to \mathbb{R}$$ Accordingly, we should also require the use of substitutions instead of evaluations when working with the $\frac{d y}{d x}$ notation: so $\frac{d y}{d x} |_{x = 0}$ instead of $\frac{d y}{d x} (0)$.

For what it’s worth, Mathematica writes multivariable derivatives as $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>f</mi> <mrow><mo stretchy="false">(</mo><mi>i</mi><mo>,</mo><mi>j</mi><mo>,</mo><mi>…</mi><mo>,</mo><mi>k</mi><mo stretchy="false">)</mo></mrow></msup></mrow><annotation encoding="application/x-tex">f^{(i,j,\ldots,k)}</annotation></semantics></math>$ , which means the $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>i</mi></mrow><annotation encoding="application/x-tex">i</annotation></semantics></math>$ -th partial derivative with respect to the first variable, the $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>j</mi></mrow><annotation encoding="application/x-tex">j</annotation></semantics></math>$ -th partial derivative with respect to the second variable, etc. Unfortunately, this notation presupposes the commutativity of partial derivatives.

On $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>′</mo></mrow><annotation encoding="application/x-tex">f'</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mrow><mi>d</mi><mi>y</mi></mrow><mrow><mi>d</mi><mi>x</mi></mrow></mfrac></mrow><annotation encoding="application/x-tex">\frac{d y}{d x}</annotation></semantics></math>$ : I think you are right there. In order to use notations like $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>′</mo></mrow><annotation encoding="application/x-tex">f'</annotation></semantics></math>$ consistently, it is appears to be necessary to abandon notions like “change of variable” and regard $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi><mo>↦</mo><mo stretchy="false">(</mo><mn>3</mn><mi>x</mi><mo>+</mo><mn>1</mn><msup><mo stretchy="false">)</mo> <mn>4</mn></msup></mrow><annotation encoding="application/x-tex">x \mapsto (3 x + 1)^4</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>u</mi><mo>↦</mo><msup><mi>u</mi> <mn>4</mn></msup></mrow><annotation encoding="application/x-tex">u \mapsto u^4</annotation></semantics></math>$ as distinct functions. So it is incompatible with the setup where calculus expressions are regarded as functions on some manifold – because there are no preferred coordinates. From a syntactic point of view it does seem rather disturbing that the $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ in the denominator of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mrow><mi>d</mi><mi>y</mi></mrow><mrow><mi>d</mi><mi>x</mi></mrow></mfrac></mrow><annotation encoding="application/x-tex">\frac{d y}{d x}</annotation></semantics></math>$ is bound but free in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mrow><mi>d</mi><mi>y</mi></mrow><mrow><mi>d</mi><mi>x</mi></mrow></mfrac></mrow><annotation encoding="application/x-tex">\frac{d y}{d x}</annotation></semantics></math>$ itself. It’s almost as if $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mi>d</mi><mrow><mi>d</mi><mi>x</mi></mrow></mfrac></mrow><annotation encoding="application/x-tex">\frac{d}{d x}</annotation></semantics></math>$ is some kind of variable binding operator like $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>λ</mi></mrow><annotation encoding="application/x-tex">\lambda</annotation></semantics></math>$ or $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo lspace="0.16667em" rspace="0.16667em">∏</mo></mrow><annotation encoding="application/x-tex">\prod</annotation></semantics></math>$ or $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo lspace="0.16667em" rspace="0.16667em">∑</mo></mrow><annotation encoding="application/x-tex">\sum</annotation></semantics></math>$ … except for the fact that it doesn’t bind the variable at all! Compare:
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mi>x</mi><mo>:</mo><mi>ℝ</mi><mo>⊢</mo><mi>y</mi><mo>≡</mo><mo stretchy="false">(</mo><mn>3</mn><mi>x</mi><mo>+</mo><mn>1</mn><msup><mo stretchy="false">)</mo> <mn>4</mn></msup><mo>:</mo><mi>ℝ</mi></mrow><annotation encoding="application/x-tex">x : \mathbb{R} \vdash y \equiv (3 x + 1)^4 : \mathbb{R}</annotation></semantics></math>$ $<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mi>x</mi><mo>:</mo><mi>ℝ</mi><mo>⊢</mo><mfrac><mrow><mi>d</mi><mi>y</mi></mrow><mrow><mi>d</mi><mi>x</mi></mrow></mfrac><mo>≡</mo><mn>12</mn><mo stretchy="false">(</mo><mn>3</mn><mi>x</mi><mo>+</mo><mn>1</mn><msup><mo stretchy="false">)</mo> <mn>3</mn></msup><mo>:</mo><mi>ℝ</mi></mrow><annotation encoding="application/x-tex">x : \mathbb{R} \vdash \frac{d y}{d x} \equiv 12 (3 x + 1)^3 : \mathbb{R}</annotation></semantics></math>$ $<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mo>⊢</mo><mi>λ</mi><mi>x</mi><mo>.</mo><mo stretchy="false">(</mo><mn>3</mn><mi>x</mi><mo>+</mo><mn>1</mn><msup><mo stretchy="false">)</mo> <mn>4</mn></msup><mo>:</mo><mi>ℝ</mi><mo>→</mo><mi>ℝ</mi></mrow><annotation encoding="application/x-tex">\vdash \lambda x . (3 x + 1)^4 : \mathbb{R} \to \mathbb{R}</annotation></semantics></math>$
Accordingly, we should also require the use of substitutions instead of evaluations when working with the $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mrow><mi>d</mi><mi>y</mi></mrow><mrow><mi>d</mi><mi>x</mi></mrow></mfrac></mrow><annotation encoding="application/x-tex">\frac{d y}{d x}</annotation></semantics></math>$ notation: so $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mrow><mi>d</mi><mi>y</mi></mrow><mrow><mi>d</mi><mi>x</mi></mrow></mfrac><msub><mo stretchy="false">|</mo> <mrow><mi>x</mi><mo>=</mo><mn>0</mn></mrow></msub></mrow><annotation encoding="application/x-tex">\frac{d y}{d x} |_{x = 0}</annotation></semantics></math>$ instead of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mrow><mi>d</mi><mi>y</mi></mrow><mrow><mi>d</mi><mi>x</mi></mrow></mfrac><mo stretchy="false">(</mo><mn>0</mn><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">\frac{d y}{d x} (0)</annotation></semantics></math>$ .
- CommentRowNumber4.
- CommentAuthorDavid_Corfield
- CommentTimeNov 6th 2013
- PermaLink
Author: David_Corfield
Format: MarkdownItexWhat would elementary calculus look like in cohesive homotopy type theory?

What would elementary calculus look like in cohesive homotopy type theory?
- CommentRowNumber5.
- CommentAuthorUrs
- CommentTimeNov 6th 2013
- (edited Nov 6th 2013)
- PermaLink
Author: Urs
Format: MarkdownItexDavid, with cohesion one can essentially characterize $\mathbf{d} \colon \mathbb{R} \longrightarrow \mathbf{\Omega}^1_{cl}$ such that postcomposition of a function $f \colon X \longrightarrow \mathbb{R}$ with this map yields the derivative $\mathbf{d} f \colon \in \mathbf{\Omega}^1_{cl}(X)$ of $f$. This is disucssed in some detail at _[[geometry of physics]]_ in the section [4. Differentiation](http://ncatlab.org/nlab/show/geometry+of+physics#Differentiation) A variant is a certain homotopy pullback of this construction, which yields [[variational calculus]], as discussed there in the section _[In terms of smooth spaces](http://ncatlab.org/nlab/show/variational+calculus#InTermsOfSmoothSpaces)_. In [[differential cohesion]] one can get hold of the [[infinitesimal interval]] $D$ and then proceed as in [[synthetic differential geometry]]. For instance using this one can describe [[differential equations]] as discussed there in the section _[In terms of synthetic differential equations](http://ncatlab.org/nlab/show/differential+equation#InSynthDiff)_. Moreover, differential cohesion encodes [[D-geometry]] and hence in principle allows to talk about differential equations in that way.

David,

with cohesion one can essentially characterize $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mstyle mathvariant="bold"><mi>d</mi></mstyle><mo lspace="0.11111em">:</mo><mi>ℝ</mi><mo>⟶</mo><msubsup><mstyle mathvariant="bold"><mi>Ω</mi></mstyle> <mi>cl</mi> <mn>1</mn></msubsup></mrow><annotation encoding="application/x-tex">\mathbf{d} \colon \mathbb{R} \longrightarrow \mathbf{\Omega}^1_{cl}</annotation></semantics></math>$ such that postcomposition of a function $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo lspace="0.11111em">:</mo><mi>X</mi><mo>⟶</mo><mi>ℝ</mi></mrow><annotation encoding="application/x-tex">f \colon X \longrightarrow \mathbb{R}</annotation></semantics></math>$ with this map yields the derivative $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mstyle mathvariant="bold"><mi>d</mi></mstyle><mi>f</mi><mo lspace="0.11111em">:</mo><mo>∈</mo><msubsup><mstyle mathvariant="bold"><mi>Ω</mi></mstyle> <mi>cl</mi> <mn>1</mn></msubsup><mo stretchy="false">(</mo><mi>X</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">\mathbf{d} f \colon \in \mathbf{\Omega}^1_{cl}(X)</annotation></semantics></math>$ of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ . This is disucssed in some detail at geometry of physics in the section 4. Differentiation

A variant is a certain homotopy pullback of this construction, which yields variational calculus, as discussed there in the section In terms of smooth spaces.

In differential cohesion one can get hold of the infinitesimal interval $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>D</mi></mrow><annotation encoding="application/x-tex">D</annotation></semantics></math>$ and then proceed as in synthetic differential geometry. For instance using this one can describe differential equations as discussed there in the section In terms of synthetic differential equations.

Moreover, differential cohesion encodes D-geometry and hence in principle allows to talk about differential equations in that way.
- CommentRowNumber6.
- CommentAuthorUrs
- CommentTimeNov 6th 2013
- PermaLink
Author: Urs
Format: MarkdownItexI have added my previous reply as a paragraph to _[differential calculus -- In cohesive homotopy theory](http://ncatlab.org/nlab/show/differential+calculus#InCohesiveHomotopyTheory)_

I have added my previous reply as a paragraph to differential calculus – In cohesive homotopy theory
- CommentRowNumber7.
- CommentAuthorDavid_Corfield
- CommentTimeNov 6th 2013
- (edited Nov 6th 2013)
- PermaLink
Author: David_Corfield
Format: MarkdownItexThanks. I meant my question to see if perhaps in some idealised setting where Mike has complete control over his students' maths education, so has taught them HoTT from an early age, now when he comes to teach calculus, and he adds the cohesive axioms, is he ever confronted with the issues he raised in #1? He has $f: R \to R$, with $f(x) = (3 x + 1)^4$. So $f = g(h)$ for obvious $g$ and $h$. So then using your $\mathbf{d}$, $$ \mathbf{d} f = \mathbf{d} g(h), $$ at which point a [[chain rule]] kicks in. (I'm waiting for [[geometry of physics]] to load to read Prop. 26, but it takes about 10 minutes to typeset on this machine!) So all of Mike's worries are over!

Thanks. I meant my question to see if perhaps in some idealised setting where Mike has complete control over his students’ maths education, so has taught them HoTT from an early age, now when he comes to teach calculus, and he adds the cohesive axioms, is he ever confronted with the issues he raised in #1?

He has $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>:</mo><mi>R</mi><mo>→</mo><mi>R</mi></mrow><annotation encoding="application/x-tex">f: R \to R</annotation></semantics></math>$ , with $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mo stretchy="false">(</mo><mn>3</mn><mi>x</mi><mo>+</mo><mn>1</mn><msup><mo stretchy="false">)</mo> <mn>4</mn></msup></mrow><annotation encoding="application/x-tex">f(x) = (3 x + 1)^4</annotation></semantics></math>$ . So $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>=</mo><mi>g</mi><mo stretchy="false">(</mo><mi>h</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">f = g(h)</annotation></semantics></math>$ for obvious $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>g</mi></mrow><annotation encoding="application/x-tex">g</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>h</mi></mrow><annotation encoding="application/x-tex">h</annotation></semantics></math>$ . So then using your $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mstyle mathvariant="bold"><mi>d</mi></mstyle></mrow><annotation encoding="application/x-tex">\mathbf{d}</annotation></semantics></math>$ ,
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mstyle mathvariant="bold"><mi>d</mi></mstyle><mi>f</mi><mo>=</mo><mstyle mathvariant="bold"><mi>d</mi></mstyle><mi>g</mi><mo stretchy="false">(</mo><mi>h</mi><mo stretchy="false">)</mo><mo>,</mo></mrow><annotation encoding="application/x-tex"> \mathbf{d} f = \mathbf{d} g(h), </annotation></semantics></math>$
at which point a chain rule kicks in. (I’m waiting for geometry of physics to load to read Prop. 26, but it takes about 10 minutes to typeset on this machine!)

So all of Mike’s worries are over!
- CommentRowNumber8.
- CommentAuthorUrs
- CommentTimeNov 6th 2013
- PermaLink
Author: Urs
Format: MarkdownItex> I’m waiting for geometry of physics to load to read Prop. 26, but it takes about 10 minutes to typeset on this machine! Oh, that's a pain. For me it's slow, but not quite this slow. This is with math rendered by Mathjax, I suppose? I suppose in some other browser maybe and/or with the rewquite fonts installed, it should take no extra time? Once there was this vision that we have decent math on the web. But somehow it still seems to be a long, long way to go, for some reason.

I’m waiting for geometry of physics to load to read Prop. 26, but it takes about 10 minutes to typeset on this machine!

Oh, that’s a pain. For me it’s slow, but not quite this slow.

This is with math rendered by Mathjax, I suppose? I suppose in some other browser maybe and/or with the rewquite fonts installed, it should take no extra time?

Once there was this vision that we have decent math on the web. But somehow it still seems to be a long, long way to go, for some reason.
- CommentRowNumber9.
- CommentAuthorUrs
- CommentTimeNov 6th 2013
- PermaLink
Author: Urs
Format: MarkdownItex> So all of Mike’s worries are over! While I suppose you are joking (right? :-) this reminds me that maybe we shouldn't hijack Mike's thread too much. My reply to the topic here would be: since we are humans and not proof assistants, whenever we actually do some work we'll adopt convenient "abuse of notation". One should alert students as to what's really going on, but I wouldn't worry too much about enforcing a formally consistent notation.

So all of Mike’s worries are over!

While I suppose you are joking (right? :-) this reminds me that maybe we shouldn’t hijack Mike’s thread too much.

My reply to the topic here would be: since we are humans and not proof assistants, whenever we actually do some work we’ll adopt convenient “abuse of notation”. One should alert students as to what’s really going on, but I wouldn’t worry too much about enforcing a formally consistent notation.
- CommentRowNumber10.
- CommentAuthorDavid_Corfield
- CommentTimeNov 6th 2013
- PermaLink
Author: David_Corfield
Format: MarkdownItexIt wasn't completely a joke. There was something like the thought that if cohesive HoTT is the God-given way to do things, then it might suggest the least bad forms of "abuse of notation".

It wasn’t completely a joke. There was something like the thought that if cohesive HoTT is the God-given way to do things, then it might suggest the least bad forms of “abuse of notation”.
- CommentRowNumber11.
- CommentAuthorMike Shulman
- CommentTimeNov 6th 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItex@hilbertthm60: I don't understand how $f_y (x,y)$ is better. You still have the variable $y$ occurring in the notation $f_y$ for the partial derivative function. Am I misunderstanding? @Urs: It's true that we always abuse notation, but I find that the correct and incorrect ways to abuse notation are one of the hardest things for beginning math students to understand. I haven't spent a lot of time thinking about it, but I've generally assumed that it's not really possible to understand how to abuse notation until you understand "the way mathematics works" at a sufficiently deep level, so that when teaching students who don't yet understand math, it's better to try to avoid abusing notation as much as possible.

@hilbertthm60: I don’t understand how $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi>f</mi> <mi>y</mi></msub><mo stretchy="false">(</mo><mi>x</mi><mo>,</mo><mi>y</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">f_y (x,y)</annotation></semantics></math>$ is better. You still have the variable $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi></mrow><annotation encoding="application/x-tex">y</annotation></semantics></math>$ occurring in the notation $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi>f</mi> <mi>y</mi></msub></mrow><annotation encoding="application/x-tex">f_y</annotation></semantics></math>$ for the partial derivative function. Am I misunderstanding?

@Urs: It’s true that we always abuse notation, but I find that the correct and incorrect ways to abuse notation are one of the hardest things for beginning math students to understand. I haven’t spent a lot of time thinking about it, but I’ve generally assumed that it’s not really possible to understand how to abuse notation until you understand “the way mathematics works” at a sufficiently deep level, so that when teaching students who don’t yet understand math, it’s better to try to avoid abusing notation as much as possible.
- CommentRowNumber12.
- CommentAuthorRodMcGuire
- CommentTimeNov 6th 2013
- PermaLink
Author: RodMcGuire
Format: MarkdownItexFrom the perspective of Logic Programming, and a particular variant, "variables" are a notational solution that conflates 2 things: labeling and binding. Traditionally, formula are written as linear strings of symbols that are parsed into trees. One use of variables is as external labels to note that parts of a "tree" share the same substructure and it is really not a tree. For example in the formula $x + x$ the two $x$s can be seen as the same substructure and substituting "1" for "x" involves changing just one substructure, not 2. The result of the substitution, "1 + 1", has dropped any labeling that indicates that the two "1"s are really the same and came from the same place. One can explicitly use external labels for this situation using a name followed by "?". Then the two formulas would be notated as $x?\top + x?\top$ and $x?1 + x?1$ where $\top$ indicates that the first $x?$ structure is "unbound". Binding traditionally has two states - unbound or bound to something totally specific. The alternative perspective is that "binding" is a continuum that takes place in a lattice of structures where $\top$ means "completely unknown" and $\bot$ means "totally contradictory". In an expression like $x+x$, $x$ is rarely completely unknown. Usually $x$ is known to be some specific type of number, but which exact number is unknown. One could even give $x$ an intermediate binding such as $1\vee 2$. Evaluating $x?(1\vee 2) + x?$ gives the result $2 \vee 4$ while if the structure is not shared, evaluating $x?(1\vee 2) + y?(1\vee 2)?$ gives $2 \vee 3 \vee 4$. There are two notions of substitution. "Binding substitution" or "unification" is used to make structures more specific. For example $(x?\top + x?\top) \wedge (1 + \top)$ is unified to $x?1 + x?1$ while $(x?\top + x?\top) \wedge (1 + 2)$ becomes $x?\bot + x?\bot$ because $1\wedge 2$ becomes the contradictory structure. Actual true substitution is rare in maths. Rarely does one substitute $2$ for $1$ in an expression like $1 + 1$ to give $2 + 2$ though there can be systems where some structure holding $1$ is regarded as a default value that can be overridden. The result of substituting $2$ for $1$ in $1 + 1$ depends on whether the two $1$s are the same structure or not - $1$ is not substituted for, instead a maybe shared structure bound to $1$ gets rebound.

From the perspective of Logic Programming, and a particular variant, “variables” are a notational solution that conflates 2 things: labeling and binding.

Traditionally, formula are written as linear strings of symbols that are parsed into trees. One use of variables is as external labels to note that parts of a “tree” share the same substructure and it is really not a tree.

For example in the formula $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi><mo>+</mo><mi>x</mi></mrow><annotation encoding="application/x-tex">x + x</annotation></semantics></math>$ the two $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ s can be seen as the same substructure and substituting “1” for “x” involves changing just one substructure, not 2. The result of the substitution, “1 + 1”, has dropped any labeling that indicates that the two “1”s are really the same and came from the same place. One can explicitly use external labels for this situation using a name followed by “?”. Then the two formulas would be notated as $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi><mo>?</mo><mo>⊤</mo><mo lspace="0.11111em" rspace="0em">+</mo><mi>x</mi><mo>?</mo><mo>⊤</mo></mrow><annotation encoding="application/x-tex">x?\top + x?\top</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi><mo>?</mo><mn>1</mn><mo>+</mo><mi>x</mi><mo>?</mo><mn>1</mn></mrow><annotation encoding="application/x-tex">x?1 + x?1</annotation></semantics></math>$ where $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo>⊤</mo></mrow><annotation encoding="application/x-tex">\top</annotation></semantics></math>$ indicates that the first $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi><mo>?</mo></mrow><annotation encoding="application/x-tex">x?</annotation></semantics></math>$ structure is “unbound”.

Binding traditionally has two states - unbound or bound to something totally specific. The alternative perspective is that “binding” is a continuum that takes place in a lattice of structures where $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo>⊤</mo></mrow><annotation encoding="application/x-tex">\top</annotation></semantics></math>$ means “completely unknown” and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo>⊥</mo></mrow><annotation encoding="application/x-tex">\bot</annotation></semantics></math>$ means “totally contradictory”.

In an expression like $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi><mo>+</mo><mi>x</mi></mrow><annotation encoding="application/x-tex">x+x</annotation></semantics></math>$ , $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ is rarely completely unknown. Usually $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ is known to be some specific type of number, but which exact number is unknown. One could even give $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ an intermediate binding such as $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn><mo>∨</mo><mn>2</mn></mrow><annotation encoding="application/x-tex">1\vee 2</annotation></semantics></math>$ . Evaluating $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi><mo>?</mo><mo stretchy="false">(</mo><mn>1</mn><mo>∨</mo><mn>2</mn><mo stretchy="false">)</mo><mo>+</mo><mi>x</mi><mo>?</mo></mrow><annotation encoding="application/x-tex">x?(1\vee 2) + x?</annotation></semantics></math>$ gives the result $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>2</mn><mo>∨</mo><mn>4</mn></mrow><annotation encoding="application/x-tex">2 \vee 4</annotation></semantics></math>$ while if the structure is not shared, evaluating $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi><mo>?</mo><mo stretchy="false">(</mo><mn>1</mn><mo>∨</mo><mn>2</mn><mo stretchy="false">)</mo><mo>+</mo><mi>y</mi><mo>?</mo><mo stretchy="false">(</mo><mn>1</mn><mo>∨</mo><mn>2</mn><mo stretchy="false">)</mo><mo>?</mo></mrow><annotation encoding="application/x-tex">x?(1\vee 2) + y?(1\vee 2)?</annotation></semantics></math>$ gives $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>2</mn><mo>∨</mo><mn>3</mn><mo>∨</mo><mn>4</mn></mrow><annotation encoding="application/x-tex">2 \vee 3 \vee 4</annotation></semantics></math>$ .

There are two notions of substitution. “Binding substitution” or “unification” is used to make structures more specific. For example $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mi>x</mi><mo>?</mo><mo>⊤</mo><mo lspace="0.11111em" rspace="0em">+</mo><mi>x</mi><mo>?</mo><mo>⊤</mo><mo stretchy="false">)</mo><mo>∧</mo><mo stretchy="false">(</mo><mn>1</mn><mo>+</mo><mo>⊤</mo><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">(x?\top + x?\top) \wedge (1 + \top)</annotation></semantics></math>$ is unified to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi><mo>?</mo><mn>1</mn><mo>+</mo><mi>x</mi><mo>?</mo><mn>1</mn></mrow><annotation encoding="application/x-tex">x?1 + x?1</annotation></semantics></math>$ while $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mi>x</mi><mo>?</mo><mo>⊤</mo><mo lspace="0.11111em" rspace="0em">+</mo><mi>x</mi><mo>?</mo><mo>⊤</mo><mo stretchy="false">)</mo><mo>∧</mo><mo stretchy="false">(</mo><mn>1</mn><mo>+</mo><mn>2</mn><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">(x?\top + x?\top) \wedge (1 + 2)</annotation></semantics></math>$ becomes $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi><mo>?</mo><mo>⊥</mo><mo lspace="0.11111em" rspace="0em">+</mo><mi>x</mi><mo>?</mo><mo>⊥</mo></mrow><annotation encoding="application/x-tex">x?\bot + x?\bot</annotation></semantics></math>$ because $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn><mo>∧</mo><mn>2</mn></mrow><annotation encoding="application/x-tex">1\wedge 2</annotation></semantics></math>$ becomes the contradictory structure.

Actual true substitution is rare in maths. Rarely does one substitute $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>2</mn></mrow><annotation encoding="application/x-tex">2</annotation></semantics></math>$ for $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn></mrow><annotation encoding="application/x-tex">1</annotation></semantics></math>$ in an expression like $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn><mo>+</mo><mn>1</mn></mrow><annotation encoding="application/x-tex">1 + 1</annotation></semantics></math>$ to give $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>2</mn><mo>+</mo><mn>2</mn></mrow><annotation encoding="application/x-tex">2 + 2</annotation></semantics></math>$ though there can be systems where some structure holding $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn></mrow><annotation encoding="application/x-tex">1</annotation></semantics></math>$ is regarded as a default value that can be overridden. The result of substituting $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>2</mn></mrow><annotation encoding="application/x-tex">2</annotation></semantics></math>$ for $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn></mrow><annotation encoding="application/x-tex">1</annotation></semantics></math>$ in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn><mo>+</mo><mn>1</mn></mrow><annotation encoding="application/x-tex">1 + 1</annotation></semantics></math>$ depends on whether the two $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn></mrow><annotation encoding="application/x-tex">1</annotation></semantics></math>$ s are the same structure or not - $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn></mrow><annotation encoding="application/x-tex">1</annotation></semantics></math>$ is not substituted for, instead a maybe shared structure bound to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn></mrow><annotation encoding="application/x-tex">1</annotation></semantics></math>$ gets rebound.
- CommentRowNumber13.
- CommentAuthorMike Shulman
- CommentTimeNov 7th 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItexRod, does any of that suggest an answer to my question?

Rod, does any of that suggest an answer to my question?
- CommentRowNumber14.
- CommentAuthorRodMcGuire
- CommentTimeNov 7th 2013
- PermaLink
Author: RodMcGuire
Format: MarkdownItex> Rod, does any of that suggest an answer to my question? It does to your question "What is a variable" :) But to your more elaborate question it seemed like you were complaining about "variables" that played the role of something like labels, flags, or indices into structures without being "true variables" that can get bound. I just wanted to mention an alternative way of thinking about variables, binding, and substitution which might fit your problem if I understood the math enough, however I got a little carried away in my typing. And for those who ares becoming Hegelian, I wanted to sneak in the opposition between "totally contradictory" and "completely unknown" :), though it doesn't seem to involve adjoints. I'm sorry.

Rod, does any of that suggest an answer to my question?

It does to your question “What is a variable” :)

But to your more elaborate question it seemed like you were complaining about “variables” that played the role of something like labels, flags, or indices into structures without being “true variables” that can get bound. I just wanted to mention an alternative way of thinking about variables, binding, and substitution which might fit your problem if I understood the math enough, however I got a little carried away in my typing.

And for those who ares becoming Hegelian, I wanted to sneak in the opposition between “totally contradictory” and “completely unknown” :), though it doesn’t seem to involve adjoints.

I’m sorry.
- CommentRowNumber15.
- CommentAuthorMike Shulman
- CommentTimeNov 7th 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItexMy question wasn't really "what is a variable"; that was just the best short-ish title for this kind of rambly question that I could think of. I don't *think* I was complaining about "variables that can't be bound", but maybe I was; can you explain further? What does your description have to say about notations for derivatives?

My question wasn’t really “what is a variable”; that was just the best short-ish title for this kind of rambly question that I could think of. I don’t think I was complaining about “variables that can’t be bound”, but maybe I was; can you explain further? What does your description have to say about notations for derivatives?
- CommentRowNumber16.
- CommentAuthorMichael_Bachtold
- CommentTimeNov 9th 2013
- (edited Nov 9th 2013)
- PermaLink
Author: Michael_Bachtold
Format: MarkdownItexHello Mike, you seem to suggest that these issues are resolved if we interpret $x$ as a function instead of a variable, a point of view I sympathize with. Unfortunately I don't quite understand the arguments against this step. You seem to mention two: >But while logically consistent, this seems to undercut the force of the lesson of dummy variables, since we are endowing $x$ with a special status not shared by $t$. and >With $x$ as a function we can also write "$f = x^2+1$", which again is something that I'm used to indoctrinating my students against. Correct me if I'm wrong, both of these arguments disappear if we interpret $x$ as a function: in the first case since the function $t$ might (or might not) be the same as the function $x$. (This seems in agreement with applications of calculus where different variables like time $t$ and position $x$ are not interchangeable.) The second argument is resolved since for a function $y=f(x)$ all of the symbols $y$, $f(x)$, $f$ and $y(x)$ now denote the same thing. The notations $f(x)$ would be a different notation for composition $f\circ x$ (with $x$ usually denoting the identity function). So maybe there are other arguments against this convention? Another question: if $y$ and $x$ only denote variables, should the symbol $\frac{dy}{dx}$ denote a variable or a function? **Edit** (after reading your question again): you also suggest, that the "evil" notation is the primed one $y'$, which I agree with. Physicist have a convention of writing the dot but _only_ to denote derivatives with respect to time, so that doesn't have the same ambiguity. Also if $Df$ denote the differential $df$ as Todd suggests, than that's Ok since it has a different meaning from $\frac{df}{dx}$ Cheers Michael

Hello Mike,

you seem to suggest that these issues are resolved if we interpret $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ as a function instead of a variable, a point of view I sympathize with. Unfortunately I don’t quite understand the arguments against this step. You seem to mention two:

But while logically consistent, this seems to undercut the force of the lesson of dummy variables, since we are endowing $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ with a special status not shared by $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi></mrow><annotation encoding="application/x-tex">t</annotation></semantics></math>$ .

and

With $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ as a function we can also write “ $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>=</mo><msup><mi>x</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn></mrow><annotation encoding="application/x-tex">f = x^2+1</annotation></semantics></math>$ ”, which again is something that I’m used to indoctrinating my students against.

Correct me if I’m wrong, both of these arguments disappear if we interpret $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ as a function: in the first case since the function $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi></mrow><annotation encoding="application/x-tex">t</annotation></semantics></math>$ might (or might not) be the same as the function $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ . (This seems in agreement with applications of calculus where different variables like time $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi></mrow><annotation encoding="application/x-tex">t</annotation></semantics></math>$ and position $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ are not interchangeable.) The second argument is resolved since for a function $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi><mo>=</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">y=f(x)</annotation></semantics></math>$ all of the symbols $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi></mrow><annotation encoding="application/x-tex">y</annotation></semantics></math>$ , $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">f(x)</annotation></semantics></math>$ , $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">y(x)</annotation></semantics></math>$ now denote the same thing. The notations $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">f(x)</annotation></semantics></math>$ would be a different notation for composition $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>∘</mo><mi>x</mi></mrow><annotation encoding="application/x-tex">f\circ x</annotation></semantics></math>$ (with $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ usually denoting the identity function).

So maybe there are other arguments against this convention?

Another question: if $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi></mrow><annotation encoding="application/x-tex">y</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ only denote variables, should the symbol $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mi>dy</mi><mi>dx</mi></mfrac></mrow><annotation encoding="application/x-tex">\frac{dy}{dx}</annotation></semantics></math>$ denote a variable or a function?

Edit (after reading your question again): you also suggest, that the “evil” notation is the primed one $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi><mo>′</mo></mrow><annotation encoding="application/x-tex">y'</annotation></semantics></math>$ , which I agree with. Physicist have a convention of writing the dot but only to denote derivatives with respect to time, so that doesn’t have the same ambiguity. Also if $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>Df</mi></mrow><annotation encoding="application/x-tex">Df</annotation></semantics></math>$ denote the differential $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>df</mi></mrow><annotation encoding="application/x-tex">df</annotation></semantics></math>$ as Todd suggests, than that’s Ok since it has a different meaning from $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mi>df</mi><mi>dx</mi></mfrac></mrow><annotation encoding="application/x-tex">\frac{df}{dx}</annotation></semantics></math>$

Cheers Michael
- CommentRowNumber17.
- CommentAuthorTodd_Trimble
- CommentTimeNov 9th 2013
- PermaLink
Author: Todd_Trimble
Format: MarkdownItexSorry to enter this discussion so late. But Mike: it sort of looks like you've answered your own question back in #1! Is there a problem with just using $D f$ notation? It's probably hard to tease apart for calculus students what is really going on, but if we start with a function $f: \mathbb{R} \to \mathbb{R}$ and apply the tangent bundle functor $(-)^D$ from SDG, keeping in mind the Kock-Lawvere axiom asserting an isomorphism $(pos, vel): \mathbb{R}^D \to \mathbb{R} \times \mathbb{R}$, where the first map is evaluation at the point $1 \to D$. Then notationally it seems quite alright to describe $f^D$ in terms of a mapping that is customarily denoted as $$(p, v) \mapsto (f(p), D f|_p \cdot v)$$ and this carries over just fine to the multivariate setting. Maybe it's even a really good idea to implant the thought that the derivative $D f|_p$ (or $D f(p)$ if you'd prefer) is not really just to be thought of as just a "number" but as a linear function (which can be characterized by a number in the 1-dimensional setting, by setting $v = 1$ and taking $D f(p) \cdot 1$)? If this pedagogical point is to be driven home seriously, I suppose you could dismiss notations like $\frac{d f}{d x}$ as archaisms, reflecting an earlier age when notions of functions, variables, etc. hadn't been fully worked out.

Sorry to enter this discussion so late. But Mike: it sort of looks like you’ve answered your own question back in #1! Is there a problem with just using $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>D</mi><mi>f</mi></mrow><annotation encoding="application/x-tex">D f</annotation></semantics></math>$ notation?

It’s probably hard to tease apart for calculus students what is really going on, but if we start with a function $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>:</mo><mi>ℝ</mi><mo>→</mo><mi>ℝ</mi></mrow><annotation encoding="application/x-tex">f: \mathbb{R} \to \mathbb{R}</annotation></semantics></math>$ and apply the tangent bundle functor $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mo lspace="0.11111em" rspace="0em">−</mo><msup><mo stretchy="false">)</mo> <mi>D</mi></msup></mrow><annotation encoding="application/x-tex">(-)^D</annotation></semantics></math>$ from SDG, keeping in mind the Kock-Lawvere axiom asserting an isomorphism $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mi>pos</mi><mo>,</mo><mi>vel</mi><mo stretchy="false">)</mo><mo>:</mo><msup><mi>ℝ</mi> <mi>D</mi></msup><mo>→</mo><mi>ℝ</mi><mo>×</mo><mi>ℝ</mi></mrow><annotation encoding="application/x-tex">(pos, vel): \mathbb{R}^D \to \mathbb{R} \times \mathbb{R}</annotation></semantics></math>$ , where the first map is evaluation at the point $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn><mo>→</mo><mi>D</mi></mrow><annotation encoding="application/x-tex">1 \to D</annotation></semantics></math>$ . Then notationally it seems quite alright to describe $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>f</mi> <mi>D</mi></msup></mrow><annotation encoding="application/x-tex">f^D</annotation></semantics></math>$ in terms of a mapping that is customarily denoted as
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mo stretchy="false">(</mo><mi>p</mi><mo>,</mo><mi>v</mi><mo stretchy="false">)</mo><mo>↦</mo><mo stretchy="false">(</mo><mi>f</mi><mo stretchy="false">(</mo><mi>p</mi><mo stretchy="false">)</mo><mo>,</mo><mi>D</mi><mi>f</mi><msub><mo stretchy="false">|</mo> <mi>p</mi></msub><mo>⋅</mo><mi>v</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">(p, v) \mapsto (f(p), D f|_p \cdot v)</annotation></semantics></math>$
and this carries over just fine to the multivariate setting. Maybe it’s even a really good idea to implant the thought that the derivative $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>D</mi><mi>f</mi><msub><mo stretchy="false">|</mo> <mi>p</mi></msub></mrow><annotation encoding="application/x-tex">D f|_p</annotation></semantics></math>$ (or $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>D</mi><mi>f</mi><mo stretchy="false">(</mo><mi>p</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">D f(p)</annotation></semantics></math>$ if you’d prefer) is not really just to be thought of as just a “number” but as a linear function (which can be characterized by a number in the 1-dimensional setting, by setting $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>v</mi><mo>=</mo><mn>1</mn></mrow><annotation encoding="application/x-tex">v = 1</annotation></semantics></math>$ and taking $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>D</mi><mi>f</mi><mo stretchy="false">(</mo><mi>p</mi><mo stretchy="false">)</mo><mo>⋅</mo><mn>1</mn></mrow><annotation encoding="application/x-tex">D f(p) \cdot 1</annotation></semantics></math>$ )?

If this pedagogical point is to be driven home seriously, I suppose you could dismiss notations like $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mrow><mi>d</mi><mi>f</mi></mrow><mrow><mi>d</mi><mi>x</mi></mrow></mfrac></mrow><annotation encoding="application/x-tex">\frac{d f}{d x}</annotation></semantics></math>$ as archaisms, reflecting an earlier age when notions of functions, variables, etc. hadn’t been fully worked out.
- CommentRowNumber18.
- CommentAuthorMichael_Bachtold
- CommentTimeNov 9th 2013
- PermaLink
Author: Michael_Bachtold
Format: MarkdownItexIs what Todd denotes with $Df$ the same as $df$ the differential of the function $f$?

Is what Todd denotes with $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>Df</mi></mrow><annotation encoding="application/x-tex">Df</annotation></semantics></math>$ the same as $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>df</mi></mrow><annotation encoding="application/x-tex">df</annotation></semantics></math>$ the differential of the function $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ ?
- CommentRowNumber19.
- CommentAuthorTodd_Trimble
- CommentTimeNov 9th 2013
- PermaLink
Author: Todd_Trimble
Format: MarkdownItexMichael_B: as far as I've always seen it used, if $f: M \to N$ is a smooth mapping and $p \in M$, then $D f(p): T_p(M) \to T_{f(p)}(N)$ is the derivative mapping at $p$, also sometimes called the Jacobian at $p$. If $f$ is real-valued, then all the tangent spaces $T_{f(p)}(\mathbb{R})$ are canonically identified with $\mathbb{R}$, and the resultant linear functional $D f(p): T_p(M) \to \mathbb{R}$ is an element in the cotangent space of $M$ at $p$, also denoted $d f(p)$ as you say.

Michael_B: as far as I’ve always seen it used, if $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>:</mo><mi>M</mi><mo>→</mo><mi>N</mi></mrow><annotation encoding="application/x-tex">f: M \to N</annotation></semantics></math>$ is a smooth mapping and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>p</mi><mo>∈</mo><mi>M</mi></mrow><annotation encoding="application/x-tex">p \in M</annotation></semantics></math>$ , then $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>D</mi><mi>f</mi><mo stretchy="false">(</mo><mi>p</mi><mo stretchy="false">)</mo><mo>:</mo><msub><mi>T</mi> <mi>p</mi></msub><mo stretchy="false">(</mo><mi>M</mi><mo stretchy="false">)</mo><mo>→</mo><msub><mi>T</mi> <mrow><mi>f</mi><mo stretchy="false">(</mo><mi>p</mi><mo stretchy="false">)</mo></mrow></msub><mo stretchy="false">(</mo><mi>N</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">D f(p): T_p(M) \to T_{f(p)}(N)</annotation></semantics></math>$ is the derivative mapping at $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>p</mi></mrow><annotation encoding="application/x-tex">p</annotation></semantics></math>$ , also sometimes called the Jacobian at $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>p</mi></mrow><annotation encoding="application/x-tex">p</annotation></semantics></math>$ . If $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ is real-valued, then all the tangent spaces $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi>T</mi> <mrow><mi>f</mi><mo stretchy="false">(</mo><mi>p</mi><mo stretchy="false">)</mo></mrow></msub><mo stretchy="false">(</mo><mi>ℝ</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">T_{f(p)}(\mathbb{R})</annotation></semantics></math>$ are canonically identified with $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ℝ</mi></mrow><annotation encoding="application/x-tex">\mathbb{R}</annotation></semantics></math>$ , and the resultant linear functional $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>D</mi><mi>f</mi><mo stretchy="false">(</mo><mi>p</mi><mo stretchy="false">)</mo><mo>:</mo><msub><mi>T</mi> <mi>p</mi></msub><mo stretchy="false">(</mo><mi>M</mi><mo stretchy="false">)</mo><mo>→</mo><mi>ℝ</mi></mrow><annotation encoding="application/x-tex">D f(p): T_p(M) \to \mathbb{R}</annotation></semantics></math>$ is an element in the cotangent space of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>M</mi></mrow><annotation encoding="application/x-tex">M</annotation></semantics></math>$ at $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>p</mi></mrow><annotation encoding="application/x-tex">p</annotation></semantics></math>$ , also denoted $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>d</mi><mi>f</mi><mo stretchy="false">(</mo><mi>p</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">d f(p)</annotation></semantics></math>$ as you say.
- CommentRowNumber20.
- CommentAuthorMike Shulman
- CommentTimeNov 10th 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItex@Michael, you seem to be saying that I should just give up on indoctrinating my students not to write $f = x^2+1$, and on teaching them that $f(x) =x^2$ and $f(t)=t^2$ define the same function? I think there *is* an important point to distinguish between the *values* of a function and the function as an object in its own right. Only once you really understand that are you justified in failing to notate it.

@Michael, you seem to be saying that I should just give up on indoctrinating my students not to write $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>=</mo><msup><mi>x</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn></mrow><annotation encoding="application/x-tex">f = x^2+1</annotation></semantics></math>$ , and on teaching them that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><msup><mi>x</mi> <mn>2</mn></msup></mrow><annotation encoding="application/x-tex">f(x) =x^2</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo stretchy="false">(</mo><mi>t</mi><mo stretchy="false">)</mo><mo>=</mo><msup><mi>t</mi> <mn>2</mn></msup></mrow><annotation encoding="application/x-tex">f(t)=t^2</annotation></semantics></math>$ define the same function? I think there is an important point to distinguish between the values of a function and the function as an object in its own right. Only once you really understand that are you justified in failing to notate it.
- CommentRowNumber21.
- CommentAuthorMichael_Bachtold
- CommentTimeNov 10th 2013
- PermaLink
Author: Michael_Bachtold
Format: MarkdownItexHello Mike. I guess that's what I'm saying :) Admittedly, I'm not sure it's a valid solution to the problem, neither mathematically nor pedagogically. But it seems to me that it corresponds more to the common practice in calculus. Already a statement like $y=f(x)$ is mostly read as "y is a function of x", and less often as "y is the value of the function f at input x". (At least among engineers and physicists). Or I have seen statements like "suppose $V(t)$ is the volume of water at time t" together with a plot where the vertical axis is denoted with $V$. Without introducing an additional variable to denote the value of the function $V$. Or consider when calculus textbooks discuss cartesian coordinates $(x,y)$ and polar coordinates $(r,\theta)$ in the plane. The easiest way for me to think about this is by interpreting $x,y,r,\theta$ as functions on the plane. Wouldn't it be cumbersome to introduce additional names to denote the functions of which $x,y,r,\theta$ are the values? Unfortunately I don't yet see clearly what trouble such a point of view causes. (let's restrict to mathematical problems since the pedagogical ones are hard to predict) Probably we do have to distinguish between the function and its value sometimes, but can't we still do this say by obtaining the value of $x$ at $3$ by precomposing with the constant function $3$?

Hello Mike. I guess that’s what I’m saying :) Admittedly, I’m not sure it’s a valid solution to the problem, neither mathematically nor pedagogically. But it seems to me that it corresponds more to the common practice in calculus.

Already a statement like $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi><mo>=</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">y=f(x)</annotation></semantics></math>$ is mostly read as “y is a function of x”, and less often as “y is the value of the function f at input x”. (At least among engineers and physicists). Or I have seen statements like “suppose $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>V</mi><mo stretchy="false">(</mo><mi>t</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">V(t)</annotation></semantics></math>$ is the volume of water at time t” together with a plot where the vertical axis is denoted with $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>V</mi></mrow><annotation encoding="application/x-tex">V</annotation></semantics></math>$ . Without introducing an additional variable to denote the value of the function $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>V</mi></mrow><annotation encoding="application/x-tex">V</annotation></semantics></math>$ .

Or consider when calculus textbooks discuss cartesian coordinates $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mi>x</mi><mo>,</mo><mi>y</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">(x,y)</annotation></semantics></math>$ and polar coordinates $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mi>r</mi><mo>,</mo><mi>θ</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">(r,\theta)</annotation></semantics></math>$ in the plane. The easiest way for me to think about this is by interpreting $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi><mo>,</mo><mi>y</mi><mo>,</mo><mi>r</mi><mo>,</mo><mi>θ</mi></mrow><annotation encoding="application/x-tex">x,y,r,\theta</annotation></semantics></math>$ as functions on the plane. Wouldn’t it be cumbersome to introduce additional names to denote the functions of which $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi><mo>,</mo><mi>y</mi><mo>,</mo><mi>r</mi><mo>,</mo><mi>θ</mi></mrow><annotation encoding="application/x-tex">x,y,r,\theta</annotation></semantics></math>$ are the values?

Unfortunately I don’t yet see clearly what trouble such a point of view causes. (let’s restrict to mathematical problems since the pedagogical ones are hard to predict)

Probably we do have to distinguish between the function and its value sometimes, but can’t we still do this say by obtaining the value of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ at $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>3</mn></mrow><annotation encoding="application/x-tex">3</annotation></semantics></math>$ by precomposing with the constant function $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>3</mn></mrow><annotation encoding="application/x-tex">3</annotation></semantics></math>$ ?
- CommentRowNumber22.
- CommentAuthorMike Shulman
- CommentTimeNov 10th 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItexI agree entirely that it doesn't cause any problems mathematically. My point is entirely a pedagogical one.

I agree entirely that it doesn’t cause any problems mathematically. My point is entirely a pedagogical one.
- CommentRowNumber23.
- CommentAuthorTobyBartels
- CommentTimeNov 19th 2013
- PermaLink
Author: TobyBartels
Format: MarkdownItexLike you, Mike, I try to teach my Calculus (and Algebra) students the difference between a function and its value, even though the textbooks do what they can to undermine my efforts. Of course, it is important to see the abuses of notation that they\'re liable to meet in applied fields, but (as you say) they have to understand the correct way first. So I tell them that the textbook abuses notation, but I try to never abuse it myself. In particular, I tell them that $y'$ is ambiguous, so I don\'t use it; they are allowed to (since the book does) but I recommend against it. I use $f'(x)$ and $\mathrm{d}y/\mathrm{d}x$, as you suggest, instead. (Besides that, $\mathrm{D}f(x)$ is all right, too; but since our book doesn\'t use it, I only mention it in passing.) As for $\mathrm{d}f/\mathrm{d}x$, this is a bit subtle; $\mathrm{d}f(x)/\mathrm{d}x$ is just fine; in fact, if $y = f(x)$, then $y = \mathrm{d}f(x)/\mathrm{d}x = f'(x)$. But that doesn\'t make $\mathrm{d}f/\mathrm{d}x$ a legitimate synonym of $f'$! It\'s a matter of logic; like Rod #12 said, the two $x$s stand for the same thing, so you can\'t substitute for one without substituting for the other. Even this is subtle; if you substitute, say, $3$ for $x$ in $f'(x) = \mathrm{d}f(x)/\mathrm{d}x$, then you get $f'(3) = \mathrm{d}f(3)/\mathrm{d}3$, which doesn\'t quite work. On the other hand, if you substitute $t$ for $x$ instead (a change of variable), then $f'(t) = \mathrm{d}f(t)/\mathrm{d}t$ is fine. This works better with differentials; $\mathrm{d}f(x) = f'(x) \,\mathrm{d}x$ becomes $\mathrm{d}f(3) = f'(3) \,\mathrm{d}(3)$, which is fairly trivial but at least correct. The problem is that, in writing $\mathrm{d}f(x)/\mathrm{d}x$, you\'re tacitly assuming that $\mathrm{d}x$ is nonzero, that is that $x$ is a *variable* quantity[^1]. (This is the origin of the term ‘variable’, I believe, even though we now use that also for symbols that stand for constants.) So you should only substitute something variable for it. (But in $\mathrm{d}f(x) = f'(x) \,\mathrm{d}x$, no such assumption is being made.) This is really no more mysterious than why you can\'t substitute $1$ for $x$ in $(x^2 - 1)/(x - 1)$. What does it mean for $x$ to stand for a *variable* quantity? Doesn\'t it stand for a real number? Yes ... but it stands for a *variable* real number. This brings us to the question in the title: what is a variable? Lawvere said that a variable can be any morphism in any category; then a variable real number (aka a real-valued variable or simply a real variable) is a morphism whose target is the space[^2] of real numbers. For some reason, the only field in which this penetrates the undergraduate curriculum is statistics; there, they know that a random variable is a measurable function on a measurable space (typically valued in the measurable space of real numbers with Borel measure), a morphism in the category of measurable spaces and measurable functions (or something like it). Even in an elementary treatment where every random variable is defined a finite space and the word ‘measurable’ is never uttered, they still give a definition of ‘random variable’. In Calculus, we usually study smooth variables (or smoothly varying quantities), which are morphisms in the category of smooth manifolds and smooth functions (or something like it). This is actually the first lesson in my Applied Calculus course: what a smooth variable is (very roughly, of course). (In the regular Calculus course, we don\'t usually assume that everything is smooth, so I have to bring this in later.) So $x$, $y$, $t$, $u$, etc are all variables (usually smooth ones). What then is $f$? It\'s a function, but I mean ‘function’ in the sense used in elementary algebra, that is a partial function (a partial morphism in the category of sets) from $\mathbb{R}$ to $\mathbb{R}$, usually a smooth one (so infinitely differentiable wherever defined). Quantities like $f(x)$ are defined at the formal level by composition; we don\'t write this as $f \circ x$ because we conceptually distinguish variables (with arbitrary unspecified domain) from functions (with domain a specified subset of $\mathbb{R}$). So (pace Michael #21) $x$, $y$, $r$, and $\theta$ may indeed be functions on the plane, but we\'re not treating them in the same way as $f$, so they use different notation. (And then they might not be functions on the plane; if you\'re really studying the motion of a particle in the plane, they might be better thought of as functions of time.) I won\'t even get into the problems with notation for second derivatives and multivariable calculus. In general, all of this stuff works better with differentials than with derivatives (right down to the terms ‘differential’ and ‘derivative’), but I\'ve said enough for now. [^1]: Well, a nonstationary variable quantity. [^2]: Here, a space is simply an object of whatever category is relevant. The category of sets and functions may or may not be the most appropriate category.
Like you, Mike, I try to teach my Calculus (and Algebra) students the difference between a function and its value, even though the textbooks do what they can to undermine my efforts. Of course, it is important to see the abuses of notation that they're liable to meet in applied fields, but (as you say) they have to understand the correct way first. So I tell them that the textbook abuses notation, but I try to never abuse it myself. In particular, I tell them that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi><mo>′</mo></mrow><annotation encoding="application/x-tex">y'</annotation></semantics></math>$ is ambiguous, so I don't use it; they are allowed to (since the book does) but I recommend against it. I use $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">f'(x)</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>y</mi><mo stretchy="false">/</mo><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}y/\mathrm{d}x</annotation></semantics></math>$ , as you suggest, instead. (Besides that, $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">D</mi><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">\mathrm{D}f(x)</annotation></semantics></math>$ is all right, too; but since our book doesn't use it, I only mention it in passing.)

As for $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>f</mi><mo stretchy="false">/</mo><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}f/\mathrm{d}x</annotation></semantics></math>$ , this is a bit subtle; $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">/</mo><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}f(x)/\mathrm{d}x</annotation></semantics></math>$ is just fine; in fact, if $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi><mo>=</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">y = f(x)</annotation></semantics></math>$ , then $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi><mo>=</mo><mi mathvariant="normal">d</mi><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">/</mo><mi mathvariant="normal">d</mi><mi>x</mi><mo>=</mo><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">y = \mathrm{d}f(x)/\mathrm{d}x = f'(x)</annotation></semantics></math>$ . But that doesn't make $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>f</mi><mo stretchy="false">/</mo><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}f/\mathrm{d}x</annotation></semantics></math>$ a legitimate synonym of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>′</mo></mrow><annotation encoding="application/x-tex">f'</annotation></semantics></math>$ ! It's a matter of logic; like Rod #12 said, the two $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ s stand for the same thing, so you can't substitute for one without substituting for the other. Even this is subtle; if you substitute, say, $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>3</mn></mrow><annotation encoding="application/x-tex">3</annotation></semantics></math>$ for $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mi mathvariant="normal">d</mi><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">/</mo><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">f'(x) = \mathrm{d}f(x)/\mathrm{d}x</annotation></semantics></math>$ , then you get $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mn>3</mn><mo stretchy="false">)</mo><mo>=</mo><mi mathvariant="normal">d</mi><mi>f</mi><mo stretchy="false">(</mo><mn>3</mn><mo stretchy="false">)</mo><mo stretchy="false">/</mo><mi mathvariant="normal">d</mi><mn>3</mn></mrow><annotation encoding="application/x-tex">f'(3) = \mathrm{d}f(3)/\mathrm{d}3</annotation></semantics></math>$ , which doesn't quite work. On the other hand, if you substitute $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi></mrow><annotation encoding="application/x-tex">t</annotation></semantics></math>$ for $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ instead (a change of variable), then $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>t</mi><mo stretchy="false">)</mo><mo>=</mo><mi mathvariant="normal">d</mi><mi>f</mi><mo stretchy="false">(</mo><mi>t</mi><mo stretchy="false">)</mo><mo stretchy="false">/</mo><mi mathvariant="normal">d</mi><mi>t</mi></mrow><annotation encoding="application/x-tex">f'(t) = \mathrm{d}f(t)/\mathrm{d}t</annotation></semantics></math>$ is fine. This works better with differentials; $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}f(x) = f'(x) \,\mathrm{d}x</annotation></semantics></math>$ becomes $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>f</mi><mo stretchy="false">(</mo><mn>3</mn><mo stretchy="false">)</mo><mo>=</mo><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mn>3</mn><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mo stretchy="false">(</mo><mn>3</mn><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">\mathrm{d}f(3) = f'(3) \,\mathrm{d}(3)</annotation></semantics></math>$ , which is fairly trivial but at least correct. The problem is that, in writing $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">/</mo><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}f(x)/\mathrm{d}x</annotation></semantics></math>$ , you're tacitly assuming that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}x</annotation></semantics></math>$ is nonzero, that is that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ is a variable quantity¹. (This is the origin of the term ‘variable’, I believe, even though we now use that also for symbols that stand for constants.) So you should only substitute something variable for it. (But in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}f(x) = f'(x) \,\mathrm{d}x</annotation></semantics></math>$ , no such assumption is being made.) This is really no more mysterious than why you can't substitute $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn></mrow><annotation encoding="application/x-tex">1</annotation></semantics></math>$ for $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><msup><mi>x</mi> <mn>2</mn></msup><mo>−</mo><mn>1</mn><mo stretchy="false">)</mo><mo stretchy="false">/</mo><mo stretchy="false">(</mo><mi>x</mi><mo>−</mo><mn>1</mn><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">(x^2 - 1)/(x - 1)</annotation></semantics></math>$ .

What does it mean for $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ to stand for a variable quantity? Doesn't it stand for a real number? Yes … but it stands for a variable real number. This brings us to the question in the title: what is a variable? Lawvere said that a variable can be any morphism in any category; then a variable real number (aka a real-valued variable or simply a real variable) is a morphism whose target is the space² of real numbers. For some reason, the only field in which this penetrates the undergraduate curriculum is statistics; there, they know that a random variable is a measurable function on a measurable space (typically valued in the measurable space of real numbers with Borel measure), a morphism in the category of measurable spaces and measurable functions (or something like it). Even in an elementary treatment where every random variable is defined a finite space and the word ‘measurable’ is never uttered, they still give a definition of ‘random variable’. In Calculus, we usually study smooth variables (or smoothly varying quantities), which are morphisms in the category of smooth manifolds and smooth functions (or something like it). This is actually the first lesson in my Applied Calculus course: what a smooth variable is (very roughly, of course). (In the regular Calculus course, we don't usually assume that everything is smooth, so I have to bring this in later.)

So $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ , $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi></mrow><annotation encoding="application/x-tex">y</annotation></semantics></math>$ , $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi></mrow><annotation encoding="application/x-tex">t</annotation></semantics></math>$ , $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>u</mi></mrow><annotation encoding="application/x-tex">u</annotation></semantics></math>$ , etc are all variables (usually smooth ones). What then is $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ ? It's a function, but I mean ‘function’ in the sense used in elementary algebra, that is a partial function (a partial morphism in the category of sets) from $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ℝ</mi></mrow><annotation encoding="application/x-tex">\mathbb{R}</annotation></semantics></math>$ to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ℝ</mi></mrow><annotation encoding="application/x-tex">\mathbb{R}</annotation></semantics></math>$ , usually a smooth one (so infinitely differentiable wherever defined). Quantities like $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">f(x)</annotation></semantics></math>$ are defined at the formal level by composition; we don't write this as $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>∘</mo><mi>x</mi></mrow><annotation encoding="application/x-tex">f \circ x</annotation></semantics></math>$ because we conceptually distinguish variables (with arbitrary unspecified domain) from functions (with domain a specified subset of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ℝ</mi></mrow><annotation encoding="application/x-tex">\mathbb{R}</annotation></semantics></math>$ ). So (pace Michael #21) $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ , $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi></mrow><annotation encoding="application/x-tex">y</annotation></semantics></math>$ , $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>r</mi></mrow><annotation encoding="application/x-tex">r</annotation></semantics></math>$ , and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>θ</mi></mrow><annotation encoding="application/x-tex">\theta</annotation></semantics></math>$ may indeed be functions on the plane, but we're not treating them in the same way as $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ , so they use different notation. (And then they might not be functions on the plane; if you're really studying the motion of a particle in the plane, they might be better thought of as functions of time.)

I won't even get into the problems with notation for second derivatives and multivariable calculus. In general, all of this stuff works better with differentials than with derivatives (right down to the terms ‘differential’ and ‘derivative’), but I've said enough for now.
1. Well, a nonstationary variable quantity. ↩
2. Here, a space is simply an object of whatever category is relevant. The category of sets and functions may or may not be the most appropriate category. ↩
- CommentRowNumber24.
- CommentAuthorMichael_Bachtold
- CommentTimeNov 19th 2013
- PermaLink
Author: Michael_Bachtold
Format: MarkdownItex@Toby: Interesting comments. Just a few questions 1) How do you bring the point across to students that sometimes $x$ stands for the value of a function (or constant, as in your first paragraph) and sometimes $x$ represents a variable quantity (morphism in a category) as in your paragraphs 3 and 4? Especially if we use the same symbol. 2) Why is it important to make a conceptual distinction between functions with unspecified domains (like x,t,r) and functions with domains subset of $\mathbb{R}^n$ like $f$, and use different notation for their compositions? Another remark: form the perspective of interpreting $x,y,t$ etc. as morphism in a suitable category, a common abuse of notation I see, is that a lot of times the pullback of a function along a map gets denoted with the same symbol. So for example when describing motion of a particle in the plane by saying $x,y$ are functions of time, I'd interpret this as saying $x,y$ are actually the pullbacks of the standard coordinates $x,y$ on the plane to the 1-dimensional manifold representing time, along the map given by the motion.

@Toby: Interesting comments.

Just a few questions

1) How do you bring the point across to students that sometimes $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ stands for the value of a function (or constant, as in your first paragraph) and sometimes $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ represents a variable quantity (morphism in a category) as in your paragraphs 3 and 4? Especially if we use the same symbol.

2) Why is it important to make a conceptual distinction between functions with unspecified domains (like x,t,r) and functions with domains subset of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>ℝ</mi> <mi>n</mi></msup></mrow><annotation encoding="application/x-tex">\mathbb{R}^n</annotation></semantics></math>$ like $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ , and use different notation for their compositions?

Another remark: form the perspective of interpreting $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi><mo>,</mo><mi>y</mi><mo>,</mo><mi>t</mi></mrow><annotation encoding="application/x-tex">x,y,t</annotation></semantics></math>$ etc. as morphism in a suitable category, a common abuse of notation I see, is that a lot of times the pullback of a function along a map gets denoted with the same symbol. So for example when describing motion of a particle in the plane by saying $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi><mo>,</mo><mi>y</mi></mrow><annotation encoding="application/x-tex">x,y</annotation></semantics></math>$ are functions of time, I’d interpret this as saying $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi><mo>,</mo><mi>y</mi></mrow><annotation encoding="application/x-tex">x,y</annotation></semantics></math>$ are actually the pullbacks of the standard coordinates $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi><mo>,</mo><mi>y</mi></mrow><annotation encoding="application/x-tex">x,y</annotation></semantics></math>$ on the plane to the 1-dimensional manifold representing time, along the map given by the motion.
- CommentRowNumber25.
- CommentAuthorMichael_Bachtold
- CommentTimeNov 19th 2013
- PermaLink
Author: Michael_Bachtold
Format: MarkdownItexI think I have a partial answer to 2): functions with values in $\mathbb{R}$ but unspecified domains cannot be composed, but we may always compose them with functions from $\matbb{R}$ to itself. Is that the reason?

I think I have a partial answer to 2): functions with values in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ℝ</mi></mrow><annotation encoding="application/x-tex">\mathbb{R}</annotation></semantics></math>$ but unspecified domains cannot be composed, but we may always compose them with functions from $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo lspace="0em" rspace="0.16667em">matbb</mo><mi>R</mi></mrow><annotation encoding="application/x-tex">\matbb{R}</annotation></semantics></math>$ to itself. Is that the reason?
- CommentRowNumber26.
- CommentAuthorMike Shulman
- CommentTimeNov 19th 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItexThanks Toby! I think you've resolved my confusion by saying that the manifold in question should be an arbitrary one (the domain of generalized elements), not even necessarily 1-dimensional. I hadn't thought about $d\, f(x) /dx$, but you're right that that makes perfect sense. My explanation of substitution would have been a bit different. I would say that when $x$ is a variable, then $dx$ is a *new* variable, albeit one which happens to be related to $x$ in a certain way. (In other words, we now consider generalized elements of the tangent bundle $T\mathbb{R}$ rather than the manifold $\mathbb{R}$.) So substituting 3 for $x$ doesn't make $dx$ into $d(3)$. Rather, it just means that instead of $dx$ being a small variation about $x$, it is a small variation about 3. I would really like to hear what you have to say about second derivatives and multivariable calculus, if you have time sometime to share it. I have recently discovered another problem with the $f'$ notation: prime is a very small symbol and hard to see from the back of the room! (-:

Thanks Toby! I think you’ve resolved my confusion by saying that the manifold in question should be an arbitrary one (the domain of generalized elements), not even necessarily 1-dimensional. I hadn’t thought about $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>d</mi><mspace width="0.16667em"/><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">/</mo><mi>dx</mi></mrow><annotation encoding="application/x-tex">d\, f(x) /dx</annotation></semantics></math>$ , but you’re right that that makes perfect sense.

My explanation of substitution would have been a bit different. I would say that when $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ is a variable, then $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>dx</mi></mrow><annotation encoding="application/x-tex">dx</annotation></semantics></math>$ is a new variable, albeit one which happens to be related to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ in a certain way. (In other words, we now consider generalized elements of the tangent bundle $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>T</mi><mi>ℝ</mi></mrow><annotation encoding="application/x-tex">T\mathbb{R}</annotation></semantics></math>$ rather than the manifold $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ℝ</mi></mrow><annotation encoding="application/x-tex">\mathbb{R}</annotation></semantics></math>$ .) So substituting 3 for $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ doesn’t make $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>dx</mi></mrow><annotation encoding="application/x-tex">dx</annotation></semantics></math>$ into $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>d</mi><mo stretchy="false">(</mo><mn>3</mn><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">d(3)</annotation></semantics></math>$ . Rather, it just means that instead of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>dx</mi></mrow><annotation encoding="application/x-tex">dx</annotation></semantics></math>$ being a small variation about $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ , it is a small variation about 3.

I would really like to hear what you have to say about second derivatives and multivariable calculus, if you have time sometime to share it. I have recently discovered another problem with the $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>′</mo></mrow><annotation encoding="application/x-tex">f'</annotation></semantics></math>$ notation: prime is a very small symbol and hard to see from the back of the room! (-:
- CommentRowNumber27.
- CommentAuthorZhen Lin
- CommentTimeNov 19th 2013
- PermaLink
Author: Zhen Lin
Format: MarkdownItexThe second derivative is not really a notion that goes well with differential forms. After all, $\mathrm{d}^2 = 0$! In the one-variable case one can cheat and observe that the tangent bundle is generated by a global section $\mathrm{d} x$, so there is a unique function $\frac{\mathrm{d} f}{\mathrm{d} x}$ such that $\mathrm{d} f = \frac{\mathrm{d} f}{\mathrm{d} x} \mathrm{d} x$, and since $\frac{\mathrm{d} f}{\mathrm{d} x}$ is a function, we can talk about its derivative again. (Note, we can do this equally well on the real line or the circle!) This cheat can be made to work in the many-variable case provided the tangent bundle is trivial, but it's far more obvious that one is doing some very coordinate-dependent things then. (One thing that is always very confusing is that the partial differential operator $\frac{\partial}{\partial x^i}$ depends on the choice of the _other_ coordinates as well, despite the notation! This is in contrast to the differential 1-form $\mathrm{d} x^i$, which depends _only_ on the coordinate function $x^i$.) I think one is supposed to think about jet bundles if one wants to do higher-order derivatives. But that seems a step too far for first-year calculus.

The second derivative is not really a notion that goes well with differential forms. After all, $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mo>=</mo><mn>0</mn></mrow><annotation encoding="application/x-tex">\mathrm{d}^2 = 0</annotation></semantics></math>$ ! In the one-variable case one can cheat and observe that the tangent bundle is generated by a global section $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d} x</annotation></semantics></math>$ , so there is a unique function $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mrow><mi mathvariant="normal">d</mi><mi>f</mi></mrow><mrow><mi mathvariant="normal">d</mi><mi>x</mi></mrow></mfrac></mrow><annotation encoding="application/x-tex">\frac{\mathrm{d} f}{\mathrm{d} x}</annotation></semantics></math>$ such that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>f</mi><mo>=</mo><mfrac><mrow><mi mathvariant="normal">d</mi><mi>f</mi></mrow><mrow><mi mathvariant="normal">d</mi><mi>x</mi></mrow></mfrac><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d} f = \frac{\mathrm{d} f}{\mathrm{d} x} \mathrm{d} x</annotation></semantics></math>$ , and since $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mrow><mi mathvariant="normal">d</mi><mi>f</mi></mrow><mrow><mi mathvariant="normal">d</mi><mi>x</mi></mrow></mfrac></mrow><annotation encoding="application/x-tex">\frac{\mathrm{d} f}{\mathrm{d} x}</annotation></semantics></math>$ is a function, we can talk about its derivative again. (Note, we can do this equally well on the real line or the circle!) This cheat can be made to work in the many-variable case provided the tangent bundle is trivial, but it’s far more obvious that one is doing some very coordinate-dependent things then. (One thing that is always very confusing is that the partial differential operator $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mo>∂</mo><mrow><mo>∂</mo><msup><mi>x</mi> <mi>i</mi></msup></mrow></mfrac></mrow><annotation encoding="application/x-tex">\frac{\partial}{\partial x^i}</annotation></semantics></math>$ depends on the choice of the other coordinates as well, despite the notation! This is in contrast to the differential 1-form $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><msup><mi>x</mi> <mi>i</mi></msup></mrow><annotation encoding="application/x-tex">\mathrm{d} x^i</annotation></semantics></math>$ , which depends only on the coordinate function $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>x</mi> <mi>i</mi></msup></mrow><annotation encoding="application/x-tex">x^i</annotation></semantics></math>$ .)

I think one is supposed to think about jet bundles if one wants to do higher-order derivatives. But that seems a step too far for first-year calculus.
- CommentRowNumber28.
- CommentAuthorMike Shulman
- CommentTimeNov 19th 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItexCertainly jet bundles are themselves a step too far for first-year calculus, but then so are tangent bundles. The trick would be to find a way to have them in the background causing things to make sense, but not needing to be mentioned explicitly. The most direct thing to see is the iterated tangent bundle: if $dx$ is a variable element of $T\mathbb{R}$, then $d(dx)$ is a variable element of $T(T\mathbb{R})$. But unfortunately that is a bit bigger than the jet bundle...

Certainly jet bundles are themselves a step too far for first-year calculus, but then so are tangent bundles. The trick would be to find a way to have them in the background causing things to make sense, but not needing to be mentioned explicitly.

The most direct thing to see is the iterated tangent bundle: if $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>dx</mi></mrow><annotation encoding="application/x-tex">dx</annotation></semantics></math>$ is a variable element of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>T</mi><mi>ℝ</mi></mrow><annotation encoding="application/x-tex">T\mathbb{R}</annotation></semantics></math>$ , then $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>d</mi><mo stretchy="false">(</mo><mi>dx</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">d(dx)</annotation></semantics></math>$ is a variable element of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>T</mi><mo stretchy="false">(</mo><mi>T</mi><mi>ℝ</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">T(T\mathbb{R})</annotation></semantics></math>$ . But unfortunately that is a bit bigger than the jet bundle…
- CommentRowNumber29.
- CommentAuthorMike Shulman
- CommentTimeNov 19th 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItexI think that the next time I teach calculus, I'm going to bring in differentials and linear approximations much earlier. This time I waited to use them as an explanation for the chain rule, but then once we had them I found that I liked using them for everything else. (Thanks Toby for stressing this point, here and elsewhere!) From that point of view, the ordinary differential is determined by a linear approximation $$ f(x+dx) \simeq f(x) + d(f(x)) $$ to first order in dx, so an appropriate meaning of "second differential" would be a quadratic approximation $$ f(x+dx) \simeq f(x) + d(f(x)) + \frac{1}{2!} d^2(f(x)) $$ to second order in dx. And with this meaning of $d^2(f(x))$ we do have a literal quotient $$f''(x) = \frac{d^2(f(x))}{dx^2}.$$ I haven't yet worked out the best way to explain "first order in $dx$", though. And I don't know what I would say to explain the $2!$, either.

I think that the next time I teach calculus, I’m going to bring in differentials and linear approximations much earlier. This time I waited to use them as an explanation for the chain rule, but then once we had them I found that I liked using them for everything else. (Thanks Toby for stressing this point, here and elsewhere!) From that point of view, the ordinary differential is determined by a linear approximation
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo>+</mo><mi>dx</mi><mo stretchy="false">)</mo><mo>≃</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>+</mo><mi>d</mi><mo stretchy="false">(</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex"> f(x+dx) \simeq f(x) + d(f(x)) </annotation></semantics></math>$
to first order in dx, so an appropriate meaning of “second differential” would be a quadratic approximation
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo>+</mo><mi>dx</mi><mo stretchy="false">)</mo><mo>≃</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>+</mo><mi>d</mi><mo stretchy="false">(</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">)</mo><mo>+</mo><mfrac><mn>1</mn><mrow><mn>2</mn><mo>!</mo></mrow></mfrac><msup><mi>d</mi> <mn>2</mn></msup><mo stretchy="false">(</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex"> f(x+dx) \simeq f(x) + d(f(x)) + \frac{1}{2!} d^2(f(x)) </annotation></semantics></math>$
to second order in dx. And with this meaning of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>d</mi> <mn>2</mn></msup><mo stretchy="false">(</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">d^2(f(x))</annotation></semantics></math>$ we do have a literal quotient
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mi>f</mi><mo>″</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mfrac><mrow><msup><mi>d</mi> <mn>2</mn></msup><mo stretchy="false">(</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">)</mo></mrow><mrow><msup><mi>dx</mi> <mn>2</mn></msup></mrow></mfrac><mo>.</mo></mrow><annotation encoding="application/x-tex">f''(x) = \frac{d^2(f(x))}{dx^2}.</annotation></semantics></math>$
I haven’t yet worked out the best way to explain “first order in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>dx</mi></mrow><annotation encoding="application/x-tex">dx</annotation></semantics></math>$ ”, though. And I don’t know what I would say to explain the $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>2</mn><mo>!</mo></mrow><annotation encoding="application/x-tex">2!</annotation></semantics></math>$ , either.
- CommentRowNumber30.
- CommentAuthorTobyBartels
- CommentTimeNov 20th 2013
- PermaLink
Author: TobyBartels
Format: MarkdownItexZhen Lin #27 has succinctly pointed out the problems. Here is a more explicit problem with the usual notation for second derivatives: it breaks the chain rule. If $u = f(x)$ and $y = g(u)$, then $$ \mathrm{d}y = g'(u) \,\mathrm{d}u = g'(u) \,(f'(x) \,\mathrm{d}x) = g'(f(x)) \,f'(x) \,\mathrm{d}x $$ works out fine; it gives $$ (g \circ f)'(x) = \mathrm{d}y/\mathrm{d}x = g'(f(x)) \,f'(x) $$ as it should. But (using the proposed second differential from Mike #29, which is implicitly endorsed by the notation $\mathrm{d}^2{y}/\mathrm{d}x^2$) $$ \mathrm{d}^2{y} = g''(u) \,\mathrm{d}u^2 = g''(u) \,(f'(x) \,\mathrm{d}x)^2 = g''(f(x)) \,f'(x)^2 \,\mathrm{d}x^2 $$ is no good; it gives $$ (g \circ f)''(x) = \mathrm{d}^2{y}/\mathrm{d}x^2 = g''(f(x)) \,f'(x)^2 ,$$ which is incorrect. The correct formula is $$ (g \circ f)''(x) = g''(f(x)) \,f'(x)^2 + g'(f(x)) \,f''(x) .$$ For this reason I never write $\mathrm{d}^2{y}/\mathrm{d}x^2$ in class (except once, about the same time that I write $y'$, to warn against it); I write $(\mathrm{d}/\mathrm{d}x)^2{y}$ (or even $\mathrm{d}(\mathrm{d}y/\mathrm{d}x)/\mathrm{d}x$, when it comes naturally) instead. More simply (but you might not believe it if I make it so simple right away), $\mathrm{d}^2{u}/\mathrm{d}u^2 = 0$ implies $\mathrm{d}^2{u} = 0$, which implies $\mathrm{d}^2{u}/\mathrm{d}x^2 = 0$, and that\'s not what we want at all.

Zhen Lin #27 has succinctly pointed out the problems.

Here is a more explicit problem with the usual notation for second derivatives: it breaks the chain rule. If $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>u</mi><mo>=</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">u = f(x)</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi><mo>=</mo><mi>g</mi><mo stretchy="false">(</mo><mi>u</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">y = g(u)</annotation></semantics></math>$ , then
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mi mathvariant="normal">d</mi><mi>y</mi><mo>=</mo><mi>g</mi><mo>′</mo><mo stretchy="false">(</mo><mi>u</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>u</mi><mo>=</mo><mi>g</mi><mo>′</mo><mo stretchy="false">(</mo><mi>u</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><mo stretchy="false">(</mo><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mi>g</mi><mo>′</mo><mo stretchy="false">(</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex"> \mathrm{d}y = g'(u) \,\mathrm{d}u = g'(u) \,(f'(x) \,\mathrm{d}x) = g'(f(x)) \,f'(x) \,\mathrm{d}x </annotation></semantics></math>$
works out fine; it gives
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mo stretchy="false">(</mo><mi>g</mi><mo>∘</mo><mi>f</mi><mo stretchy="false">)</mo><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mi mathvariant="normal">d</mi><mi>y</mi><mo stretchy="false">/</mo><mi mathvariant="normal">d</mi><mi>x</mi><mo>=</mo><mi>g</mi><mo>′</mo><mo stretchy="false">(</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex"> (g \circ f)'(x) = \mathrm{d}y/\mathrm{d}x = g'(f(x)) \,f'(x) </annotation></semantics></math>$
as it should. But (using the proposed second differential from Mike #29, which is implicitly endorsed by the notation $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>y</mi><mo stretchy="false">/</mo><mi mathvariant="normal">d</mi><msup><mi>x</mi> <mn>2</mn></msup></mrow><annotation encoding="application/x-tex">\mathrm{d}^2{y}/\mathrm{d}x^2</annotation></semantics></math>$ )
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>y</mi><mo>=</mo><mi>g</mi><mo>″</mo><mo stretchy="false">(</mo><mi>u</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><msup><mi>u</mi> <mn>2</mn></msup><mo>=</mo><mi>g</mi><mo>″</mo><mo stretchy="false">(</mo><mi>u</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><mo stretchy="false">(</mo><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi><msup><mo stretchy="false">)</mo> <mn>2</mn></msup><mo>=</mo><mi>g</mi><mo>″</mo><mo stretchy="false">(</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><msup><mo stretchy="false">)</mo> <mn>2</mn></msup><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><msup><mi>x</mi> <mn>2</mn></msup></mrow><annotation encoding="application/x-tex"> \mathrm{d}^2{y} = g''(u) \,\mathrm{d}u^2 = g''(u) \,(f'(x) \,\mathrm{d}x)^2 = g''(f(x)) \,f'(x)^2 \,\mathrm{d}x^2 </annotation></semantics></math>$
is no good; it gives
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mo stretchy="false">(</mo><mi>g</mi><mo>∘</mo><mi>f</mi><mo stretchy="false">)</mo><mo>″</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>y</mi><mo stretchy="false">/</mo><mi mathvariant="normal">d</mi><msup><mi>x</mi> <mn>2</mn></msup><mo>=</mo><mi>g</mi><mo>″</mo><mo stretchy="false">(</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><msup><mo stretchy="false">)</mo> <mn>2</mn></msup><mo>,</mo></mrow><annotation encoding="application/x-tex"> (g \circ f)''(x) = \mathrm{d}^2{y}/\mathrm{d}x^2 = g''(f(x)) \,f'(x)^2 ,</annotation></semantics></math>$
which is incorrect. The correct formula is
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mo stretchy="false">(</mo><mi>g</mi><mo>∘</mo><mi>f</mi><mo stretchy="false">)</mo><mo>″</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mi>g</mi><mo>″</mo><mo stretchy="false">(</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><msup><mo stretchy="false">)</mo> <mn>2</mn></msup><mo>+</mo><mi>g</mi><mo>′</mo><mo stretchy="false">(</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi>f</mi><mo>″</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>.</mo></mrow><annotation encoding="application/x-tex"> (g \circ f)''(x) = g''(f(x)) \,f'(x)^2 + g'(f(x)) \,f''(x) .</annotation></semantics></math>$
For this reason I never write $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>y</mi><mo stretchy="false">/</mo><mi mathvariant="normal">d</mi><msup><mi>x</mi> <mn>2</mn></msup></mrow><annotation encoding="application/x-tex">\mathrm{d}^2{y}/\mathrm{d}x^2</annotation></semantics></math>$ in class (except once, about the same time that I write $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi><mo>′</mo></mrow><annotation encoding="application/x-tex">y'</annotation></semantics></math>$ , to warn against it); I write $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mi mathvariant="normal">d</mi><mo stretchy="false">/</mo><mi mathvariant="normal">d</mi><mi>x</mi><msup><mo stretchy="false">)</mo> <mn>2</mn></msup><mi>y</mi></mrow><annotation encoding="application/x-tex">(\mathrm{d}/\mathrm{d}x)^2{y}</annotation></semantics></math>$ (or even $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mo stretchy="false">(</mo><mi mathvariant="normal">d</mi><mi>y</mi><mo stretchy="false">/</mo><mi mathvariant="normal">d</mi><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">/</mo><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}(\mathrm{d}y/\mathrm{d}x)/\mathrm{d}x</annotation></semantics></math>$ , when it comes naturally) instead.

More simply (but you might not believe it if I make it so simple right away), $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>u</mi><mo stretchy="false">/</mo><mi mathvariant="normal">d</mi><msup><mi>u</mi> <mn>2</mn></msup><mo>=</mo><mn>0</mn></mrow><annotation encoding="application/x-tex">\mathrm{d}^2{u}/\mathrm{d}u^2 = 0</annotation></semantics></math>$ implies $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>u</mi><mo>=</mo><mn>0</mn></mrow><annotation encoding="application/x-tex">\mathrm{d}^2{u} = 0</annotation></semantics></math>$ , which implies $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>u</mi><mo stretchy="false">/</mo><mi mathvariant="normal">d</mi><msup><mi>x</mi> <mn>2</mn></msup><mo>=</mo><mn>0</mn></mrow><annotation encoding="application/x-tex">\mathrm{d}^2{u}/\mathrm{d}x^2 = 0</annotation></semantics></math>$ , and that's not what we want at all.
- CommentRowNumber31.
- CommentAuthorTobyBartels
- CommentTimeNov 20th 2013
- PermaLink
Author: TobyBartels
Format: MarkdownItex@ Michael #24: >Why is it important to make a conceptual distinction between functions with unspecified domains (like $x,t,r$) and functions with domains subset of $\mathbb{R}^n$ like $f$, and use different notation for their compositions? Partly because function notation is so convenient, yet it requires a domain, and sometimes we don\'t want to specify it (or even to imply that it\'s a subset of $\mathbb{R}^n$, much less a subset of $\mathbb{R}$ in first-term Calculus). So I want to write $f(3)$ and $f'(3)$; of course, I can also write ${y|_{x=3}}$ and ${(\mathrm{d}y/\mathrm{d}x)|_{x=3}}$ ... but I can also write $f(x+1)$, while I can\'t write ${y|_{x=x+1}}$. I guess that this is basically what you said, the ability to compose functions. The distinction is especially relevant in applications. Here, the domain of the variables is a vaguely unspecified space of states of the world. The textbooks sometimes encourage us to identify this space (often it can be identified with the time line, for example), so that there is a single independent variable (or a few in the multivariable case) of which every other variable is a function. But the whole point of the Chain Rule is that the independent variable is irrelevant! It\'s sufficient that some choice of independent variables is possible (that is that some space of states can be assumed to exist), but it\'s completely unnecessary to actually make this choice. So I don\'t actually want to say that $x, y, t$ etc are functions at all (to the students, for whom a function is defined on a subset of some $\mathbb{R}^n$). Yet functions like $(x \mapsto \mathrm{e}^x)$ are also around, and I want to refer to them from time to time too. >How do you bring the point across to students that sometimes $x$ stands for the value of a function (or constant, as in your first paragraph) and sometimes $x$ represents a variable quantity (morphism in a category) as in your paragraphs 3 and 4? Especially if we use the same symbol. What\'s happening at the most basic level is a change of [[context]] (in the technical sense as in type theory). There is the general context where the variables are allowed to vary as much as they may, and then there is the more specific context where $x$ is set to (say) $3$. (There are other intermediate contexts, especially in multivariable Calculus, such as that given by a constraint as in optimization problems.) Assuming for the sake of argument (but this is hardly necessary) that there is exactly one possible state of the world in which $x = 3$, then we have a morphism from the point to the space of all world-states; as you said (but I didn\'t quote), we are taking a pullback along this morphism and abusing notation by keeping the same symbol $x$. I tell my students, particularly when working out word problems, to keep careful track of the context. (I use the word ‘context’ but don\'t let them suspect that it\'s a technical term in logic!) In a typical problem, they have an equation that holds always, which they may differentiate; but then later they use equations that are only true for an instant. (Related rates and optimization are two broad categories of problems like this.) I tell them that any result from the equations that hold always is also true for an instant, but not conversely (which I think makes intuitive sense); and you cannot differentiate equations that only hold for an instant, because nothing is changing in that instant! (In multivariable Calculus you can differentiate equations that hold under a constraint, and this leads for example to Lagrange multipliers, but you still have to remember that you are working relative to a constraint.) So basically, I\'m allowing them to pull back results along a morphism but not push them forward; but I try to make it sound like common sense instead of a theorem of categorial logic!

@ Michael #24:

Why is it important to make a conceptual distinction between functions with unspecified domains (like $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi><mo>,</mo><mi>t</mi><mo>,</mo><mi>r</mi></mrow><annotation encoding="application/x-tex">x,t,r</annotation></semantics></math>$ ) and functions with domains subset of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>ℝ</mi> <mi>n</mi></msup></mrow><annotation encoding="application/x-tex">\mathbb{R}^n</annotation></semantics></math>$ like $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ , and use different notation for their compositions?

Partly because function notation is so convenient, yet it requires a domain, and sometimes we don't want to specify it (or even to imply that it's a subset of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>ℝ</mi> <mi>n</mi></msup></mrow><annotation encoding="application/x-tex">\mathbb{R}^n</annotation></semantics></math>$ , much less a subset of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ℝ</mi></mrow><annotation encoding="application/x-tex">\mathbb{R}</annotation></semantics></math>$ in first-term Calculus). So I want to write $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo stretchy="false">(</mo><mn>3</mn><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">f(3)</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mn>3</mn><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">f'(3)</annotation></semantics></math>$ ; of course, I can also write $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mrow><mi>y</mi><msub><mo stretchy="false">|</mo> <mrow><mi>x</mi><mo>=</mo><mn>3</mn></mrow></msub></mrow></mrow><annotation encoding="application/x-tex">{y|_{x=3}}</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mrow><mo stretchy="false">(</mo><mi mathvariant="normal">d</mi><mi>y</mi><mo stretchy="false">/</mo><mi mathvariant="normal">d</mi><mi>x</mi><mo stretchy="false">)</mo><msub><mo stretchy="false">|</mo> <mrow><mi>x</mi><mo>=</mo><mn>3</mn></mrow></msub></mrow></mrow><annotation encoding="application/x-tex">{(\mathrm{d}y/\mathrm{d}x)|_{x=3}}</annotation></semantics></math>$ … but I can also write $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo>+</mo><mn>1</mn><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">f(x+1)</annotation></semantics></math>$ , while I can't write $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mrow><mi>y</mi><msub><mo stretchy="false">|</mo> <mrow><mi>x</mi><mo>=</mo><mi>x</mi><mo>+</mo><mn>1</mn></mrow></msub></mrow></mrow><annotation encoding="application/x-tex">{y|_{x=x+1}}</annotation></semantics></math>$ . I guess that this is basically what you said, the ability to compose functions.

The distinction is especially relevant in applications. Here, the domain of the variables is a vaguely unspecified space of states of the world. The textbooks sometimes encourage us to identify this space (often it can be identified with the time line, for example), so that there is a single independent variable (or a few in the multivariable case) of which every other variable is a function. But the whole point of the Chain Rule is that the independent variable is irrelevant! It's sufficient that some choice of independent variables is possible (that is that some space of states can be assumed to exist), but it's completely unnecessary to actually make this choice. So I don't actually want to say that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi><mo>,</mo><mi>y</mi><mo>,</mo><mi>t</mi></mrow><annotation encoding="application/x-tex">x, y, t</annotation></semantics></math>$ etc are functions at all (to the students, for whom a function is defined on a subset of some $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>ℝ</mi> <mi>n</mi></msup></mrow><annotation encoding="application/x-tex">\mathbb{R}^n</annotation></semantics></math>$ ). Yet functions like $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mi>x</mi><mo>↦</mo><msup><mi mathvariant="normal">e</mi> <mi>x</mi></msup><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">(x \mapsto \mathrm{e}^x)</annotation></semantics></math>$ are also around, and I want to refer to them from time to time too.

How do you bring the point across to students that sometimes $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ stands for the value of a function (or constant, as in your first paragraph) and sometimes $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ represents a variable quantity (morphism in a category) as in your paragraphs 3 and 4? Especially if we use the same symbol.

What's happening at the most basic level is a change of context (in the technical sense as in type theory). There is the general context where the variables are allowed to vary as much as they may, and then there is the more specific context where $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ is set to (say) $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>3</mn></mrow><annotation encoding="application/x-tex">3</annotation></semantics></math>$ . (There are other intermediate contexts, especially in multivariable Calculus, such as that given by a constraint as in optimization problems.) Assuming for the sake of argument (but this is hardly necessary) that there is exactly one possible state of the world in which $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi><mo>=</mo><mn>3</mn></mrow><annotation encoding="application/x-tex">x = 3</annotation></semantics></math>$ , then we have a morphism from the point to the space of all world-states; as you said (but I didn't quote), we are taking a pullback along this morphism and abusing notation by keeping the same symbol $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ .

I tell my students, particularly when working out word problems, to keep careful track of the context. (I use the word ‘context’ but don't let them suspect that it's a technical term in logic!) In a typical problem, they have an equation that holds always, which they may differentiate; but then later they use equations that are only true for an instant. (Related rates and optimization are two broad categories of problems like this.) I tell them that any result from the equations that hold always is also true for an instant, but not conversely (which I think makes intuitive sense); and you cannot differentiate equations that only hold for an instant, because nothing is changing in that instant! (In multivariable Calculus you can differentiate equations that hold under a constraint, and this leads for example to Lagrange multipliers, but you still have to remember that you are working relative to a constraint.) So basically, I'm allowing them to pull back results along a morphism but not push them forward; but I try to make it sound like common sense instead of a theorem of categorial logic!
- CommentRowNumber32.
- CommentAuthorTobyBartels
- CommentTimeNov 20th 2013
- PermaLink
Author: TobyBartels
Format: MarkdownItex>the partial differential operator $\frac{\partial}{\partial x^i}$ depends on the choice of the _other_ coordinates as well, despite the notation! In a thermodynamics course, I learnt the notation $(\partial{U}/\partial{S})_T$ for the partial derivative of $U$ with respect to $S$ when $T$ is held constant. (In general, there are $n - 1$ subscripts.) In my multivariable class, I introduce this notation first, then say that we can drop the subscripts as an abuse of notation when it\'s obvious what they\'re going to be. Of course, all of the abuses of notation can be justified in this way (that it\'s obvious what it\'s supposed to mean), but only this one is really necessary, since otherwise it gets very tedious. By the way, Mike, the corresponding notation for the partial derivatives of a function is $f_i$ (where $i = 1, 2, ...$); it works just like $f_y$ in hilbertthm90 #2, only it\'s legitimate. An alternative is $\mathrm{D}_i{f}$ (especially if you want to leave subscripts free for a sequence or other family of functions). This extends to higher-order derivatives just fine, without having to assume commutativity (the claim that $\mathrm{D}_{i,j} = \mathrm{D}_{j,i}$).

the partial differential operator $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mo>∂</mo><mrow><mo>∂</mo><msup><mi>x</mi> <mi>i</mi></msup></mrow></mfrac></mrow><annotation encoding="application/x-tex">\frac{\partial}{\partial x^i}</annotation></semantics></math>$ depends on the choice of the other coordinates as well, despite the notation!

In a thermodynamics course, I learnt the notation $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mo>∂</mo><mi>U</mi><mo stretchy="false">/</mo><mo>∂</mo><mi>S</mi><msub><mo stretchy="false">)</mo> <mi>T</mi></msub></mrow><annotation encoding="application/x-tex">(\partial{U}/\partial{S})_T</annotation></semantics></math>$ for the partial derivative of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>U</mi></mrow><annotation encoding="application/x-tex">U</annotation></semantics></math>$ with respect to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>S</mi></mrow><annotation encoding="application/x-tex">S</annotation></semantics></math>$ when $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>T</mi></mrow><annotation encoding="application/x-tex">T</annotation></semantics></math>$ is held constant. (In general, there are $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>n</mi><mo>−</mo><mn>1</mn></mrow><annotation encoding="application/x-tex">n - 1</annotation></semantics></math>$ subscripts.) In my multivariable class, I introduce this notation first, then say that we can drop the subscripts as an abuse of notation when it's obvious what they're going to be. Of course, all of the abuses of notation can be justified in this way (that it's obvious what it's supposed to mean), but only this one is really necessary, since otherwise it gets very tedious.

By the way, Mike, the corresponding notation for the partial derivatives of a function is $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi>f</mi> <mi>i</mi></msub></mrow><annotation encoding="application/x-tex">f_i</annotation></semantics></math>$ (where $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>i</mi><mo>=</mo><mn>1</mn><mo>,</mo><mn>2</mn><mo>,</mo><mo>.</mo><mo>.</mo><mo>.</mo></mrow><annotation encoding="application/x-tex">i = 1, 2, ...</annotation></semantics></math>$ ); it works just like $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi>f</mi> <mi>y</mi></msub></mrow><annotation encoding="application/x-tex">f_y</annotation></semantics></math>$ in hilbertthm90 #2, only it's legitimate. An alternative is $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi mathvariant="normal">D</mi> <mi>i</mi></msub><mi>f</mi></mrow><annotation encoding="application/x-tex">\mathrm{D}_i{f}</annotation></semantics></math>$ (especially if you want to leave subscripts free for a sequence or other family of functions). This extends to higher-order derivatives just fine, without having to assume commutativity (the claim that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi mathvariant="normal">D</mi> <mrow><mi>i</mi><mo>,</mo><mi>j</mi></mrow></msub><mo>=</mo><msub><mi mathvariant="normal">D</mi> <mrow><mi>j</mi><mo>,</mo><mi>i</mi></mrow></msub></mrow><annotation encoding="application/x-tex">\mathrm{D}_{i,j} = \mathrm{D}_{j,i}</annotation></semantics></math>$ ).
- CommentRowNumber33.
- CommentAuthorMike Shulman
- CommentTimeNov 20th 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItex@Toby #30: very interesting, thanks! How about this for a second try at "second differentials"? The general form of a second-order approximation should include not only a first-order change in the variable but also an independent second-order change. That is, let $\mathrm{d}x$ be a first-order infinitesimal and $\mathrm{d}^2x$ a second-order one, and work up to second-order; thus $(\mathrm{d}x)^2$ is relevant but $(\mathrm{d}x)^3$ and $(\mathrm{d}^2x)^2$ and $(\mathrm{d}x)(\mathrm{d}^2x)$ can be neglected as being third- or fourth-order (or are equal to zero, depending on your preferred flavor of infinitesimal). Thus while it is true that $$ f(x+\mathrm{d}x) = f(x) + f'(x)\, \mathrm{d}x + \frac{1}{2} f''(x)\, (\mathrm{d}x)^2 $$ we also have (by the same reasoning) $$ f(x+\mathrm{d}x + \frac{1}{2} \mathrm{d}^2x) = f(x) + f'(x)\,\, \mathrm{d}x + \frac{1}{2} f'(x)\, \mathrm{d}^2x + \frac{1}{2} f''(x) \, (\mathrm{d}x)^2 $$ and it is both the third and the fourth terms here that should be called $\mathrm{d}^2(f(x))$, since they are both second-order. That is, we write $$ f(x+\mathrm{d}x + \frac{1}{2} \mathrm{d}^2x) = f(x) + \mathrm{d}(f(x)) + \frac{1}{2} \mathrm{d}^2(f(x)) $$ and therefore $$ \mathrm{d}^2(f(x)) = f'(x)\, \mathrm{d}^2 x + f''(x)\, (\mathrm{d}x)^2. $$ Now if $u = f(x)$ and $y = g(u)$, we have $$ \begin{aligned} \mathrm{d}^2 y &= g'(u) \, \mathrm{d}^2 u + g''(u) \, (\mathrm{d}u)^2 \\ &= g'(f(x)) \Big(f'(x) \, \mathrm{d}^2 x + f''(x)\, (\mathrm{d}x)^2\Big) + g''(f(x)) (f'(x)\, \mathrm{d}x)^2\\ &= g'(f(x)) f'(x) \, \mathrm{d}^2 x + \Big(g'(f(x)) f''(x) + g''(f(x)) (f'(x))^2\Big) (\mathrm{d}x)^2 \end{aligned} $$ And matching this with $$ \mathrm{d}^2((g\circ f)(x)) = (g\circ f)'(x)\, \mathrm{d}^2 x + (g\circ f) ''(x)\, (\mathrm{d}x)^2 $$ we recover the correct chain rules for both first and second derivatives. I suspect that this could be written in terms of jet bundles. Of course, now we've lost the notation $\frac{\mathrm{d}^2y}{(\mathrm{d}x)^2}$ for the second derivative. Unless there's some reason why we can assume $\mathrm{d}^2x=0$.

@Toby #30: very interesting, thanks! How about this for a second try at “second differentials”?

The general form of a second-order approximation should include not only a first-order change in the variable but also an independent second-order change. That is, let $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}x</annotation></semantics></math>$ be a first-order infinitesimal and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}^2x</annotation></semantics></math>$ a second-order one, and work up to second-order; thus $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mi mathvariant="normal">d</mi><mi>x</mi><msup><mo stretchy="false">)</mo> <mn>2</mn></msup></mrow><annotation encoding="application/x-tex">(\mathrm{d}x)^2</annotation></semantics></math>$ is relevant but $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mi mathvariant="normal">d</mi><mi>x</mi><msup><mo stretchy="false">)</mo> <mn>3</mn></msup></mrow><annotation encoding="application/x-tex">(\mathrm{d}x)^3</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>x</mi><msup><mo stretchy="false">)</mo> <mn>2</mn></msup></mrow><annotation encoding="application/x-tex">(\mathrm{d}^2x)^2</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mi mathvariant="normal">d</mi><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">(</mo><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">(\mathrm{d}x)(\mathrm{d}^2x)</annotation></semantics></math>$ can be neglected as being third- or fourth-order (or are equal to zero, depending on your preferred flavor of infinitesimal). Thus while it is true that
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo>+</mo><mi mathvariant="normal">d</mi><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>+</mo><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi><mo>+</mo><mfrac><mn>1</mn><mn>2</mn></mfrac><mi>f</mi><mo>″</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><mo stretchy="false">(</mo><mi mathvariant="normal">d</mi><mi>x</mi><msup><mo stretchy="false">)</mo> <mn>2</mn></msup></mrow><annotation encoding="application/x-tex"> f(x+\mathrm{d}x) = f(x) + f'(x)\, \mathrm{d}x + \frac{1}{2} f''(x)\, (\mathrm{d}x)^2 </annotation></semantics></math>$
we also have (by the same reasoning)
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo>+</mo><mi mathvariant="normal">d</mi><mi>x</mi><mo>+</mo><mfrac><mn>1</mn><mn>2</mn></mfrac><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>+</mo><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi><mo>+</mo><mfrac><mn>1</mn><mn>2</mn></mfrac><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>x</mi><mo>+</mo><mfrac><mn>1</mn><mn>2</mn></mfrac><mi>f</mi><mo>″</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><mo stretchy="false">(</mo><mi mathvariant="normal">d</mi><mi>x</mi><msup><mo stretchy="false">)</mo> <mn>2</mn></msup></mrow><annotation encoding="application/x-tex"> f(x+\mathrm{d}x + \frac{1}{2} \mathrm{d}^2x) = f(x) + f'(x)\,\, \mathrm{d}x + \frac{1}{2} f'(x)\, \mathrm{d}^2x + \frac{1}{2} f''(x) \, (\mathrm{d}x)^2 </annotation></semantics></math>$
and it is both the third and the fourth terms here that should be called $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mo stretchy="false">(</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">\mathrm{d}^2(f(x))</annotation></semantics></math>$ , since they are both second-order. That is, we write
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo>+</mo><mi mathvariant="normal">d</mi><mi>x</mi><mo>+</mo><mfrac><mn>1</mn><mn>2</mn></mfrac><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>+</mo><mi mathvariant="normal">d</mi><mo stretchy="false">(</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">)</mo><mo>+</mo><mfrac><mn>1</mn><mn>2</mn></mfrac><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mo stretchy="false">(</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex"> f(x+\mathrm{d}x + \frac{1}{2} \mathrm{d}^2x) = f(x) + \mathrm{d}(f(x)) + \frac{1}{2} \mathrm{d}^2(f(x)) </annotation></semantics></math>$
and therefore
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mo stretchy="false">(</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">)</mo><mo>=</mo><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>x</mi><mo>+</mo><mi>f</mi><mo>″</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><mo stretchy="false">(</mo><mi mathvariant="normal">d</mi><mi>x</mi><msup><mo stretchy="false">)</mo> <mn>2</mn></msup><mo>.</mo></mrow><annotation encoding="application/x-tex"> \mathrm{d}^2(f(x)) = f'(x)\, \mathrm{d}^2 x + f''(x)\, (\mathrm{d}x)^2. </annotation></semantics></math>$
Now if $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>u</mi><mo>=</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">u = f(x)</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi><mo>=</mo><mi>g</mi><mo stretchy="false">(</mo><mi>u</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">y = g(u)</annotation></semantics></math>$ , we have
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mrow><mtable displaystyle="true" columnalign="right left right left right left right left right left" columnspacing="0em"><mtr><mtd><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>y</mi></mtd> <mtd><mo>=</mo><mi>g</mi><mo>′</mo><mo stretchy="false">(</mo><mi>u</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>u</mi><mo>+</mo><mi>g</mi><mo>″</mo><mo stretchy="false">(</mo><mi>u</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><mo stretchy="false">(</mo><mi mathvariant="normal">d</mi><mi>u</mi><msup><mo stretchy="false">)</mo> <mn>2</mn></msup></mtd></mtr> <mtr><mtd/> <mtd><mo>=</mo><mi>g</mi><mo>′</mo><mo stretchy="false">(</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">)</mo><mo maxsize="1.8em" minsize="1.8em">(</mo><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>x</mi><mo>+</mo><mi>f</mi><mo>″</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><mo stretchy="false">(</mo><mi mathvariant="normal">d</mi><mi>x</mi><msup><mo stretchy="false">)</mo> <mn>2</mn></msup><mo maxsize="1.8em" minsize="1.8em">)</mo><mo>+</mo><mi>g</mi><mo>″</mo><mo stretchy="false">(</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">)</mo><mo stretchy="false">(</mo><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi><msup><mo stretchy="false">)</mo> <mn>2</mn></msup></mtd></mtr> <mtr><mtd/> <mtd><mo>=</mo><mi>g</mi><mo>′</mo><mo stretchy="false">(</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">)</mo><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>x</mi><mo>+</mo><mo maxsize="1.8em" minsize="1.8em">(</mo><mi>g</mi><mo>′</mo><mo stretchy="false">(</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">)</mo><mi>f</mi><mo>″</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>+</mo><mi>g</mi><mo>″</mo><mo stretchy="false">(</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">)</mo><mo stretchy="false">(</mo><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><msup><mo stretchy="false">)</mo> <mn>2</mn></msup><mo maxsize="1.8em" minsize="1.8em">)</mo><mo stretchy="false">(</mo><mi mathvariant="normal">d</mi><mi>x</mi><msup><mo stretchy="false">)</mo> <mn>2</mn></msup></mtd></mtr></mtable></mrow></mrow><annotation encoding="application/x-tex"> \begin{aligned} \mathrm{d}^2 y &amp;= g'(u) \, \mathrm{d}^2 u + g''(u) \, (\mathrm{d}u)^2 \\ &amp;= g'(f(x)) \Big(f'(x) \, \mathrm{d}^2 x + f''(x)\, (\mathrm{d}x)^2\Big) + g''(f(x)) (f'(x)\, \mathrm{d}x)^2\\ &amp;= g'(f(x)) f'(x) \, \mathrm{d}^2 x + \Big(g'(f(x)) f''(x) + g''(f(x)) (f'(x))^2\Big) (\mathrm{d}x)^2 \end{aligned} </annotation></semantics></math>$
And matching this with
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mo stretchy="false">(</mo><mo stretchy="false">(</mo><mi>g</mi><mo>∘</mo><mi>f</mi><mo stretchy="false">)</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">)</mo><mo>=</mo><mo stretchy="false">(</mo><mi>g</mi><mo>∘</mo><mi>f</mi><mo stretchy="false">)</mo><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>x</mi><mo>+</mo><mo stretchy="false">(</mo><mi>g</mi><mo>∘</mo><mi>f</mi><mo stretchy="false">)</mo><mo>″</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><mo stretchy="false">(</mo><mi mathvariant="normal">d</mi><mi>x</mi><msup><mo stretchy="false">)</mo> <mn>2</mn></msup></mrow><annotation encoding="application/x-tex"> \mathrm{d}^2((g\circ f)(x)) = (g\circ f)'(x)\, \mathrm{d}^2 x + (g\circ f) ''(x)\, (\mathrm{d}x)^2 </annotation></semantics></math>$
we recover the correct chain rules for both first and second derivatives.

I suspect that this could be written in terms of jet bundles.

Of course, now we’ve lost the notation $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mrow><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>y</mi></mrow><mrow><mo stretchy="false">(</mo><mi mathvariant="normal">d</mi><mi>x</mi><msup><mo stretchy="false">)</mo> <mn>2</mn></msup></mrow></mfrac></mrow><annotation encoding="application/x-tex">\frac{\mathrm{d}^2y}{(\mathrm{d}x)^2}</annotation></semantics></math>$ for the second derivative. Unless there’s some reason why we can assume $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>x</mi><mo>=</mo><mn>0</mn></mrow><annotation encoding="application/x-tex">\mathrm{d}^2x=0</annotation></semantics></math>$ .
- CommentRowNumber34.
- CommentAuthorMichael_Bachtold
- CommentTimeNov 20th 2013
- PermaLink
Author: Michael_Bachtold
Format: MarkdownItex@Toby: thanks for the detailed answer! Concerning Mikes nice reasoning in #33: >Of course, now we’ve lost the notation $\frac{d^2y}{(dx)^2}$ for the second derivative. If I understood this correctly, you might still "save" it by using the notation $\frac{\partial^2y}{(dx)^2}$, or should it be $\frac{\partial^2y}{(\partial x)^2}$?

@Toby: thanks for the detailed answer!

Concerning Mikes nice reasoning in #33:

Of course, now we’ve lost the notation $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mrow><msup><mi>d</mi> <mn>2</mn></msup><mi>y</mi></mrow><mrow><mo stretchy="false">(</mo><mi>dx</mi><msup><mo stretchy="false">)</mo> <mn>2</mn></msup></mrow></mfrac></mrow><annotation encoding="application/x-tex">\frac{d^2y}{(dx)^2}</annotation></semantics></math>$ for the second derivative.

If I understood this correctly, you might still “save” it by using the notation $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mrow><msup><mo>∂</mo> <mn>2</mn></msup><mi>y</mi></mrow><mrow><mo stretchy="false">(</mo><mi>dx</mi><msup><mo stretchy="false">)</mo> <mn>2</mn></msup></mrow></mfrac></mrow><annotation encoding="application/x-tex">\frac{\partial^2y}{(dx)^2}</annotation></semantics></math>$ , or should it be $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mrow><msup><mo>∂</mo> <mn>2</mn></msup><mi>y</mi></mrow><mrow><mo stretchy="false">(</mo><mo>∂</mo><mi>x</mi><msup><mo stretchy="false">)</mo> <mn>2</mn></msup></mrow></mfrac></mrow><annotation encoding="application/x-tex">\frac{\partial^2y}{(\partial x)^2}</annotation></semantics></math>$ ?
- CommentRowNumber35.
- CommentAuthorMike Shulman
- CommentTimeNov 20th 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItex@Michael: I didn't use the notation $\partial^2 y$; what are you thinking that it would mean? The problem that I saw is that $\mathrm{d}^2 y$ is not a linear function of $(\mathrm{d}x)^2$ alone, but also of $\mathrm{d}^2x$.

@Michael: I didn’t use the notation $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mo>∂</mo> <mn>2</mn></msup><mi>y</mi></mrow><annotation encoding="application/x-tex">\partial^2 y</annotation></semantics></math>$ ; what are you thinking that it would mean? The problem that I saw is that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>y</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}^2 y</annotation></semantics></math>$ is not a linear function of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mi mathvariant="normal">d</mi><mi>x</mi><msup><mo stretchy="false">)</mo> <mn>2</mn></msup></mrow><annotation encoding="application/x-tex">(\mathrm{d}x)^2</annotation></semantics></math>$ alone, but also of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}^2x</annotation></semantics></math>$ .
- CommentRowNumber36.
- CommentAuthorMike Shulman
- CommentTimeNov 20th 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItex@Toby, I have a couple of terminological questions for you. 1) What do you call "$x^2+1$"? In Lawvere's parlance it is still a "variable quantity", but as it is not syntactically a variable I wouldn't want to call it that. But neither is it a "function" in your setup, if I understood correctly. 2) What do you call an object like "$(x^2+1)\;\mathrm{d}x$" which you can take the integral of?

@Toby, I have a couple of terminological questions for you.

1) What do you call “ $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>x</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn></mrow><annotation encoding="application/x-tex">x^2+1</annotation></semantics></math>$ ”? In Lawvere’s parlance it is still a “variable quantity”, but as it is not syntactically a variable I wouldn’t want to call it that. But neither is it a “function” in your setup, if I understood correctly.

2) What do you call an object like “ $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><msup><mi>x</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn><mo stretchy="false">)</mo><mspace width="0.27778em"/><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">(x^2+1)\;\mathrm{d}x</annotation></semantics></math>$ ” which you can take the integral of?
- CommentRowNumber37.
- CommentAuthorTobyBartels
- CommentTimeNov 20th 2013
- (edited Nov 20th 2013)
- PermaLink
Author: TobyBartels
Format: MarkdownItex>$\mathrm{d}^2(f(x)) = f'(x)\, \mathrm{d}^2 x + f''(x)\, (\mathrm{d}x)^2$. I agree; I first got this by writing $\mathrm{d}f(x) = f'(x) \,\mathrm{d}x$ and applying the product rule. Notice that the $\mathrm{d}$ here is *not* the exterior derivative but instead a commutative (rather than supercommutative) operator. >Of course, now we've lost the notation $\frac{\mathrm{d}^2y}{(\mathrm{d}x)^2}$ for the second derivative. Right, and I don\'t know how to rehabilitate that notation, because of my last remark in #30. However, you *can* write $\frac{\partial^2{y}}{(\partial{x})^2}$ instead! This is because, just as $\frac{\partial{y}}{\partial{x}}$ is the coefficient on $\mathrm{d}x$ in an expansion of $\mathrm{d}y$, so $\frac{\partial^2{y}}{(\partial{x})^2}$ is the coefficient on $(\mathrm{d}x)^2$ in an expansion of $\mathrm{d}^2y$. (ETA: Michael already noticed this in #34, but hopefully my explanation of it helps.) By the way, I also tell my students that $\mathrm{d}$ binds more tightly than any non-differential operation like squaring, so I can write $\mathrm{d}x^2$ instead of $(\mathrm{d}x)^2$. (Then I always use parentheses in something like $\mathrm{d}(x^2) = 2x \,\mathrm{d}x$.)

$<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mo stretchy="false">(</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">)</mo><mo>=</mo><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>x</mi><mo>+</mo><mi>f</mi><mo>″</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><mo stretchy="false">(</mo><mi mathvariant="normal">d</mi><mi>x</mi><msup><mo stretchy="false">)</mo> <mn>2</mn></msup></mrow><annotation encoding="application/x-tex">\mathrm{d}^2(f(x)) = f'(x)\, \mathrm{d}^2 x + f''(x)\, (\mathrm{d}x)^2</annotation></semantics></math>$ .

I agree; I first got this by writing $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}f(x) = f'(x) \,\mathrm{d}x</annotation></semantics></math>$ and applying the product rule. Notice that the $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}</annotation></semantics></math>$ here is not the exterior derivative but instead a commutative (rather than supercommutative) operator.

Of course, now we’ve lost the notation $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mrow><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>y</mi></mrow><mrow><mo stretchy="false">(</mo><mi mathvariant="normal">d</mi><mi>x</mi><msup><mo stretchy="false">)</mo> <mn>2</mn></msup></mrow></mfrac></mrow><annotation encoding="application/x-tex">\frac{\mathrm{d}^2y}{(\mathrm{d}x)^2}</annotation></semantics></math>$ for the second derivative.

Right, and I don't know how to rehabilitate that notation, because of my last remark in #30.

However, you can write $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mrow><msup><mo>∂</mo> <mn>2</mn></msup><mi>y</mi></mrow><mrow><mo stretchy="false">(</mo><mo>∂</mo><mi>x</mi><msup><mo stretchy="false">)</mo> <mn>2</mn></msup></mrow></mfrac></mrow><annotation encoding="application/x-tex">\frac{\partial^2{y}}{(\partial{x})^2}</annotation></semantics></math>$ instead! This is because, just as $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mrow><mo>∂</mo><mi>y</mi></mrow><mrow><mo>∂</mo><mi>x</mi></mrow></mfrac></mrow><annotation encoding="application/x-tex">\frac{\partial{y}}{\partial{x}}</annotation></semantics></math>$ is the coefficient on $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}x</annotation></semantics></math>$ in an expansion of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>y</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}y</annotation></semantics></math>$ , so $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mrow><msup><mo>∂</mo> <mn>2</mn></msup><mi>y</mi></mrow><mrow><mo stretchy="false">(</mo><mo>∂</mo><mi>x</mi><msup><mo stretchy="false">)</mo> <mn>2</mn></msup></mrow></mfrac></mrow><annotation encoding="application/x-tex">\frac{\partial^2{y}}{(\partial{x})^2}</annotation></semantics></math>$ is the coefficient on $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mi mathvariant="normal">d</mi><mi>x</mi><msup><mo stretchy="false">)</mo> <mn>2</mn></msup></mrow><annotation encoding="application/x-tex">(\mathrm{d}x)^2</annotation></semantics></math>$ in an expansion of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>y</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}^2y</annotation></semantics></math>$ . (ETA: Michael already noticed this in #34, but hopefully my explanation of it helps.)

By the way, I also tell my students that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}</annotation></semantics></math>$ binds more tightly than any non-differential operation like squaring, so I can write $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><msup><mi>x</mi> <mn>2</mn></msup></mrow><annotation encoding="application/x-tex">\mathrm{d}x^2</annotation></semantics></math>$ instead of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mi mathvariant="normal">d</mi><mi>x</mi><msup><mo stretchy="false">)</mo> <mn>2</mn></msup></mrow><annotation encoding="application/x-tex">(\mathrm{d}x)^2</annotation></semantics></math>$ . (Then I always use parentheses in something like $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mo stretchy="false">(</mo><msup><mi>x</mi> <mn>2</mn></msup><mo stretchy="false">)</mo><mo>=</mo><mn>2</mn><mi>x</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}(x^2) = 2x \,\mathrm{d}x</annotation></semantics></math>$ .)
- CommentRowNumber38.
- CommentAuthorTobyBartels
- CommentTimeNov 20th 2013
- (edited Nov 20th 2013)
- PermaLink
Author: TobyBartels
Format: MarkdownItexAnswering Mike\'s questions in #36: 1. I call $x^2 + 1$ simply a quantity. I might even call it a real number (but not a constant one) or even simply a number, but ‘quantity’ usually works well (except in applications to supply and demand, where ‘quantity’ has a more specific meaning). It *is* a variable quantity, of course, and I might even point out that it varies or may even say ‘$x^2 + 1$ is variable.’, but not ‘$x^2 + 1$ is _a_ variable.’; that would be confusing. (In other words, when describing the quantity as a whole, I would use ‘variable’ only as an adjective.) Also, I *will* say that $x^2 + 1$ is a function <ins>of $x$</ins>,[^ins] which simply means that there exists a function $f$ such that $x^2 + 1 = f(x)$. If I\'m emphasizing the logical form, then I may also call it an algebraic expression; technically, the expression ‘$x^2 + 1$’ _represents_ or _stands for_ the quantity $x^2 + 1$. 2. Sometimes I call $(x^2 + 1) \,\mathrm{d}x$ an infinitesimal quantity, if I want to emphasize its interpretation as something infinitely small (and similarly I might call $x^2 + 1$ a finitesimal quantity). But if I\'m talking about things that one can integrate, then I usually call it a differential form. I actually introduce that term fairly early, when I remark that the differential of any (finitesimal) expression (in any number of variables!) will be a differential form; I point out that every term has a differential as one factor, note that this makes every term (and hence the sum) an infinitesimal quantity, and then I introduce the name for expressions of this form. (I also remark that the differential form has rank $1$ because only $1$ factor of each term is a differential, but we don\'t have to say that since differential forms of higher rank are only used in multivariable Calculus. And then in my multivariable class, I use them!) [^ins]: In case your browser fails to render <ins> in combination with MathML (as mine does), the ‘$x$’ should also be underlined (assuming that your browser renders <ins> as underlining, which they nearly all do). This is just for emphasis.
Answering Mike's questions in #36:
1. I call $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>x</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn></mrow><annotation encoding="application/x-tex">x^2 + 1</annotation></semantics></math>$ simply a quantity. I might even call it a real number (but not a constant one) or even simply a number, but ‘quantity’ usually works well (except in applications to supply and demand, where ‘quantity’ has a more specific meaning). It is a variable quantity, of course, and I might even point out that it varies or may even say ‘ $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>x</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn></mrow><annotation encoding="application/x-tex">x^2 + 1</annotation></semantics></math>$ is variable.’, but not ‘ $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>x</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn></mrow><annotation encoding="application/x-tex">x^2 + 1</annotation></semantics></math>$ is a variable.’; that would be confusing. (In other words, when describing the quantity as a whole, I would use ‘variable’ only as an adjective.) Also, I will say that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>x</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn></mrow><annotation encoding="application/x-tex">x^2 + 1</annotation></semantics></math>$ is a function of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ ,¹ which simply means that there exists a function $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ such that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>x</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn><mo>=</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">x^2 + 1 = f(x)</annotation></semantics></math>$ . If I'm emphasizing the logical form, then I may also call it an algebraic expression; technically, the expression ‘ $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>x</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn></mrow><annotation encoding="application/x-tex">x^2 + 1</annotation></semantics></math>$ ’ represents or stands for the quantity $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>x</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn></mrow><annotation encoding="application/x-tex">x^2 + 1</annotation></semantics></math>$ .
2. Sometimes I call $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><msup><mi>x</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">(x^2 + 1) \,\mathrm{d}x</annotation></semantics></math>$ an infinitesimal quantity, if I want to emphasize its interpretation as something infinitely small (and similarly I might call $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>x</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn></mrow><annotation encoding="application/x-tex">x^2 + 1</annotation></semantics></math>$ a finitesimal quantity). But if I'm talking about things that one can integrate, then I usually call it a differential form. I actually introduce that term fairly early, when I remark that the differential of any (finitesimal) expression (in any number of variables!) will be a differential form; I point out that every term has a differential as one factor, note that this makes every term (and hence the sum) an infinitesimal quantity, and then I introduce the name for expressions of this form. (I also remark that the differential form has rank $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn></mrow><annotation encoding="application/x-tex">1</annotation></semantics></math>$ because only $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn></mrow><annotation encoding="application/x-tex">1</annotation></semantics></math>$ factor of each term is a differential, but we don't have to say that since differential forms of higher rank are only used in multivariable Calculus. And then in my multivariable class, I use them!)
1. In case your browser fails to render <ins> in combination with MathML (as mine does), the ‘ $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ ’ should also be underlined (assuming that your browser renders <ins> as underlining, which they nearly all do). This is just for emphasis. ↩
- CommentRowNumber39.
- CommentAuthorMichael_Bachtold
- CommentTimeNov 20th 2013
- (edited Nov 20th 2013)
- PermaLink
Author: Michael_Bachtold
Format: MarkdownItexToby in #37 wrote: >I first got this by writing $\mathrm{d}f(x) = f'(x) \,\mathrm{d}x$ and applying the product rule. Notice that the $\mathrm{d}$ here is *not* the exterior derivative but instead a commutative (rather than supercommutative) operator. Interesting! Would you mind telling a bit more about this $d$ as a commutative operator and how the equation follows from the chain rule. Studying synthetic differential geometry is still on my TODO list, so apologies if this is standard knowledge among experts. I also still need to understand Mikes computation in #33. Intuitively I would have thought that a first order infinitesimal is also an infinitesimal of second order, so I find it confusing that we need to include the first order change seperately when looking at the effect of a second order change. (And I also have no intuition for what it means that the two changes are independent, from a geometric or physical perspective). Edit: I'm also still curious if Mikes suggestion from #26 can be made consistent with the different interpretations of $d$ suggested above >So substituting 3 for $x$ doesn't make $dx$ into $d(3)$. Rather, it just means that instead of $dx$ being a small variation about $x$, it is a small variation about 3.

Toby in #37 wrote:

I first got this by writing $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}f(x) = f'(x) \,\mathrm{d}x</annotation></semantics></math>$ and applying the product rule. Notice that the $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}</annotation></semantics></math>$ here is not the exterior derivative but instead a commutative (rather than supercommutative) operator.

Interesting! Would you mind telling a bit more about this $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>d</mi></mrow><annotation encoding="application/x-tex">d</annotation></semantics></math>$ as a commutative operator and how the equation follows from the chain rule. Studying synthetic differential geometry is still on my TODO list, so apologies if this is standard knowledge among experts.

I also still need to understand Mikes computation in #33. Intuitively I would have thought that a first order infinitesimal is also an infinitesimal of second order, so I find it confusing that we need to include the first order change seperately when looking at the effect of a second order change. (And I also have no intuition for what it means that the two changes are independent, from a geometric or physical perspective).

Edit: I’m also still curious if Mikes suggestion from #26 can be made consistent with the different interpretations of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>d</mi></mrow><annotation encoding="application/x-tex">d</annotation></semantics></math>$ suggested above

So substituting 3 for $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ doesn’t make $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>dx</mi></mrow><annotation encoding="application/x-tex">dx</annotation></semantics></math>$ into $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>d</mi><mo stretchy="false">(</mo><mn>3</mn><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">d(3)</annotation></semantics></math>$ . Rather, it just means that instead of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>dx</mi></mrow><annotation encoding="application/x-tex">dx</annotation></semantics></math>$ being a small variation about $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ , it is a small variation about 3.
- CommentRowNumber40.
- CommentAuthorMike Shulman
- CommentTimeNov 21st 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItexSo would it then be correct to also write $f'(x) = \frac{\partial^2 y}{\partial^2 x}$? > Intuitively I would have thought that a first order infinitesimal is also an infinitesimal of second order Actually, it's the other way around: a *second* order infinitesimal is also a *first* order one (although to first order, it's zero). Higher order means a smaller number.

So would it then be correct to also write $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mfrac><mrow><msup><mo>∂</mo> <mn>2</mn></msup><mi>y</mi></mrow><mrow><msup><mo>∂</mo> <mn>2</mn></msup><mi>x</mi></mrow></mfrac></mrow><annotation encoding="application/x-tex">f'(x) = \frac{\partial^2 y}{\partial^2 x}</annotation></semantics></math>$ ?

Intuitively I would have thought that a first order infinitesimal is also an infinitesimal of second order

Actually, it’s the other way around: a second order infinitesimal is also a first order one (although to first order, it’s zero). Higher order means a smaller number.
- CommentRowNumber41.
- CommentAuthorTobyBartels
- CommentTimeNov 21st 2013
- PermaLink
Author: TobyBartels
Format: MarkdownItex>>I first got this by writing $\mathrm{d}f(x) = f'(x) \,\mathrm{d}x$ and applying the product rule. Notice that the $\mathrm{d}$ here is *not* the exterior derivative but instead a commutative (rather than supercommutative) operator. >Interesting! Would you mind telling a bit more about this $d$ as a commutative operator and how the equation follows from the chain rule. I don\'t know very much about that operator; [I asked about this stuff once on Math Overflow](http://mathoverflow.net/questions/60474/is-there-a-convenient-differential-calculus-for-cojets) and got no clear answer, although I did get a reference that I haven\'t followed up yet. But if I just assume that it continues to obey the usual rules, then I can calculate with it just fine. In this case: $$ \mathrm{d}^2f(x) = \mathrm{d}(\mathrm{d}f(x)) = \mathrm{d}(f'(x) \,\mathrm{d}x) = \mathrm{d}(f'(x)) \,\mathrm{d}x + f'(x) \,\mathrm{d}(\mathrm{d}x) = (f''(x) \,\mathrm{d}x) \,\mathrm{d}x + f'(x) \,\mathrm{d}^2x = f''(x) \,\mathrm{d}x^2 + f'(x) \,\mathrm{d}^2x .$$ >So would it then be correct to also write $f'(x) = \frac{\partial^2 y}{\partial^2 x}$? Apparently so! But of course $\frac{\partial y}{\partial x}$ is simpler.

I first got this by writing $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}f(x) = f'(x) \,\mathrm{d}x</annotation></semantics></math>$ and applying the product rule. Notice that the $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}</annotation></semantics></math>$ here is not the exterior derivative but instead a commutative (rather than supercommutative) operator.

Interesting! Would you mind telling a bit more about this $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>d</mi></mrow><annotation encoding="application/x-tex">d</annotation></semantics></math>$ as a commutative operator and how the equation follows from the chain rule.

I don't know very much about that operator; I asked about this stuff once on Math Overflow and got no clear answer, although I did get a reference that I haven't followed up yet. But if I just assume that it continues to obey the usual rules, then I can calculate with it just fine. In this case:
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mi mathvariant="normal">d</mi><mo stretchy="false">(</mo><mi mathvariant="normal">d</mi><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">)</mo><mo>=</mo><mi mathvariant="normal">d</mi><mo stretchy="false">(</mo><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mi mathvariant="normal">d</mi><mo stretchy="false">(</mo><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi><mo>+</mo><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mo stretchy="false">(</mo><mi mathvariant="normal">d</mi><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mo stretchy="false">(</mo><mi>f</mi><mo>″</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi><mo>+</mo><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>x</mi><mo>=</mo><mi>f</mi><mo>″</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><msup><mi>x</mi> <mn>2</mn></msup><mo>+</mo><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>x</mi><mo>.</mo></mrow><annotation encoding="application/x-tex"> \mathrm{d}^2f(x) = \mathrm{d}(\mathrm{d}f(x)) = \mathrm{d}(f'(x) \,\mathrm{d}x) = \mathrm{d}(f'(x)) \,\mathrm{d}x + f'(x) \,\mathrm{d}(\mathrm{d}x) = (f''(x) \,\mathrm{d}x) \,\mathrm{d}x + f'(x) \,\mathrm{d}^2x = f''(x) \,\mathrm{d}x^2 + f'(x) \,\mathrm{d}^2x .</annotation></semantics></math>$

So would it then be correct to also write $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mfrac><mrow><msup><mo>∂</mo> <mn>2</mn></msup><mi>y</mi></mrow><mrow><msup><mo>∂</mo> <mn>2</mn></msup><mi>x</mi></mrow></mfrac></mrow><annotation encoding="application/x-tex">f'(x) = \frac{\partial^2 y}{\partial^2 x}</annotation></semantics></math>$ ?

Apparently so! But of course $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mrow><mo>∂</mo><mi>y</mi></mrow><mrow><mo>∂</mo><mi>x</mi></mrow></mfrac></mrow><annotation encoding="application/x-tex">\frac{\partial y}{\partial x}</annotation></semantics></math>$ is simpler.
- CommentRowNumber42.
- CommentAuthorMichael_Bachtold
- CommentTimeNov 21st 2013
- PermaLink
Author: Michael_Bachtold
Format: MarkdownItex@Mike #40: >Actually, it's the other way around: a *second* order infinitesimal is also a *first* order one (although to first order, it's zero). Higher order means a smaller number. Here's how I was thinking: a first order infinitesimal number is one with $\epsilon^2=0$, a second order is one with $\epsilon^3=0$. So first order is also of second order. Actually when I picture infinitesimal neighborhoods of a point or subset in a manifold or whatever space, I always thought that they increase as the order increases. Did I get it wrong or are we maybe talking of dual things? @Toby #41: I recall that question on Mathoverflow (one of the comments was mine). Unfortunately I also have not had the time to follow up on the references. But I do find the infinitesimals and differentials approach advocated by Dray Manogue to be worthwile for teaching calculus. And since Mike arrived at the same equation as you by slightly different reasons it makes it even more compelling to believe that maybe there is still something to be understood in the interpretation of $d$ or $d^2$.

@Mike #40:

Actually, it’s the other way around: a second order infinitesimal is also a first order one (although to first order, it’s zero). Higher order means a smaller number.

Here’s how I was thinking: a first order infinitesimal number is one with $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>ε</mi> <mn>2</mn></msup><mo>=</mo><mn>0</mn></mrow><annotation encoding="application/x-tex">\epsilon^2=0</annotation></semantics></math>$ , a second order is one with $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>ε</mi> <mn>3</mn></msup><mo>=</mo><mn>0</mn></mrow><annotation encoding="application/x-tex">\epsilon^3=0</annotation></semantics></math>$ . So first order is also of second order. Actually when I picture infinitesimal neighborhoods of a point or subset in a manifold or whatever space, I always thought that they increase as the order increases. Did I get it wrong or are we maybe talking of dual things?

@Toby #41: I recall that question on Mathoverflow (one of the comments was mine). Unfortunately I also have not had the time to follow up on the references. But I do find the infinitesimals and differentials approach advocated by Dray Manogue to be worthwile for teaching calculus. And since Mike arrived at the same equation as you by slightly different reasons it makes it even more compelling to believe that maybe there is still something to be understood in the interpretation of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>d</mi></mrow><annotation encoding="application/x-tex">d</annotation></semantics></math>$ or $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>d</mi> <mn>2</mn></msup></mrow><annotation encoding="application/x-tex">d^2</annotation></semantics></math>$ .
- CommentRowNumber43.
- CommentAuthorMike Shulman
- CommentTimeNov 21st 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItexYes, I think we are using language in dual ways. I'm thinking of nonstandard-analysis-style infinitesimals, whose square or cube is never actually equal to zero. Instead I'm saying, let's fix some particular "scale" infinitesimal $\epsilon$; then a first-order infinitesimal is one $\eta$ such that $\eta/\epsilon$ is finite ("limited"), a second-order one is such that $\eta/\epsilon^2$ is finite, etc. Then when we work "up to first order", which we could formalize as being in the quotient ring of limited numbers modulo $\epsilon^2$, the square of a first-order infinitesimal can be neglected. And when we work "up to second order", i.e. in the limited numbers modulo $\epsilon^3$, the cube of a first-order infinitesimal and the square of a second-order one can be neglected. So in the latter quotient ring, a first-order $\eta$ has $\eta^3=0$ while a second-order one has $\eta^2=0$. I think that this matches the use of phrases like "to first order" and "a first order change" in ordinary (non-infinitesimal) language better. A second order change is negligible if we are working to first order, but not if we are working to second order, yet the amount of the change itself is the same in both cases; what changes is our attitude towards it. But I guess it doesn't apply as well to SDG-style nilpotent infinitesimals, so with those it may be better to avoid terms "first order" and "second order" and talk instead about "nilsquare" and "nilcube" etc.

Yes, I think we are using language in dual ways. I’m thinking of nonstandard-analysis-style infinitesimals, whose square or cube is never actually equal to zero. Instead I’m saying, let’s fix some particular “scale” infinitesimal $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ε</mi></mrow><annotation encoding="application/x-tex">\epsilon</annotation></semantics></math>$ ; then a first-order infinitesimal is one $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>η</mi></mrow><annotation encoding="application/x-tex">\eta</annotation></semantics></math>$ such that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>η</mi><mo stretchy="false">/</mo><mi>ε</mi></mrow><annotation encoding="application/x-tex">\eta/\epsilon</annotation></semantics></math>$ is finite (“limited”), a second-order one is such that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>η</mi><mo stretchy="false">/</mo><msup><mi>ε</mi> <mn>2</mn></msup></mrow><annotation encoding="application/x-tex">\eta/\epsilon^2</annotation></semantics></math>$ is finite, etc.

Then when we work “up to first order”, which we could formalize as being in the quotient ring of limited numbers modulo $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>ε</mi> <mn>2</mn></msup></mrow><annotation encoding="application/x-tex">\epsilon^2</annotation></semantics></math>$ , the square of a first-order infinitesimal can be neglected. And when we work “up to second order”, i.e. in the limited numbers modulo $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>ε</mi> <mn>3</mn></msup></mrow><annotation encoding="application/x-tex">\epsilon^3</annotation></semantics></math>$ , the cube of a first-order infinitesimal and the square of a second-order one can be neglected. So in the latter quotient ring, a first-order $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>η</mi></mrow><annotation encoding="application/x-tex">\eta</annotation></semantics></math>$ has $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>η</mi> <mn>3</mn></msup><mo>=</mo><mn>0</mn></mrow><annotation encoding="application/x-tex">\eta^3=0</annotation></semantics></math>$ while a second-order one has $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>η</mi> <mn>2</mn></msup><mo>=</mo><mn>0</mn></mrow><annotation encoding="application/x-tex">\eta^2=0</annotation></semantics></math>$ .

I think that this matches the use of phrases like “to first order” and “a first order change” in ordinary (non-infinitesimal) language better. A second order change is negligible if we are working to first order, but not if we are working to second order, yet the amount of the change itself is the same in both cases; what changes is our attitude towards it. But I guess it doesn’t apply as well to SDG-style nilpotent infinitesimals, so with those it may be better to avoid terms “first order” and “second order” and talk instead about “nilsquare” and “nilcube” etc.
- CommentRowNumber44.
- CommentAuthorMichael_Bachtold
- CommentTimeNov 23rd 2013
- PermaLink
Author: Michael_Bachtold
Format: MarkdownItexMike thanks for the explanation! I'll have to think about it some more to resolve the conflicting views in my head. In the meantime, here is something slightly related to the original question. In my calculus class last week I asked the students to answer the following questions Compute the derivative of: 1. $\int_2^x \ln(t^2+1)dt$ with respect to $x$ 1. $\int_2^x \ln(t^2+1)dt$ with respect to $t$ 1. $\int \ln(t^2+1)dt$ with respect to $t$ (the last one is an indefinite integral, I'm using the notations of my calculus book here (Hughes-Hallett)) That caused a lot of confusion for my students. My preliminary reaction is to think of the notation for the indefinite integral as the bad guy.
Mike thanks for the explanation! I’ll have to think about it some more to resolve the conflicting views in my head.

In the meantime, here is something slightly related to the original question. In my calculus class last week I asked the students to answer the following questions

Compute the derivative of:
1. $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msubsup><mo>∫</mo> <mn>2</mn> <mi>x</mi></msubsup><mi>ln</mi><mo stretchy="false">(</mo><msup><mi>t</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn><mo stretchy="false">)</mo><mi>dt</mi></mrow><annotation encoding="application/x-tex">\int_2^x \ln(t^2+1)dt</annotation></semantics></math>$ with respect to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$
2. $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msubsup><mo>∫</mo> <mn>2</mn> <mi>x</mi></msubsup><mi>ln</mi><mo stretchy="false">(</mo><msup><mi>t</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn><mo stretchy="false">)</mo><mi>dt</mi></mrow><annotation encoding="application/x-tex">\int_2^x \ln(t^2+1)dt</annotation></semantics></math>$ with respect to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi></mrow><annotation encoding="application/x-tex">t</annotation></semantics></math>$
3. $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo>∫</mo><mi>ln</mi><mo stretchy="false">(</mo><msup><mi>t</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn><mo stretchy="false">)</mo><mi>dt</mi></mrow><annotation encoding="application/x-tex">\int \ln(t^2+1)dt</annotation></semantics></math>$ with respect to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi></mrow><annotation encoding="application/x-tex">t</annotation></semantics></math>$
(the last one is an indefinite integral, I’m using the notations of my calculus book here (Hughes-Hallett))

That caused a lot of confusion for my students. My preliminary reaction is to think of the notation for the indefinite integral as the bad guy.
- CommentRowNumber45.
- CommentAuthorMike Shulman
- CommentTimeNov 23rd 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItexI wouldn't ask my students (2) or (3). In fact, I'm not sure what you were expecting. In (2), are you assuming that $x$ is a function of $t$? Or constant with respect to $t$? Were you hoping that they would write $ln(x^2+1) \frac{dx}{dt}$? While there's technically no contradiction in using the same variable both free and bound, it's bad style even in published mathematical papers, so I wouldn't want to inflict it on calculus students. As for (3), the way I'm used to thinking of it $\int ln(t^2+1)dt$ is not a function but a class of functions (differing by local constants) --- hence not something you can take the derivative of. However, you do raise an important point, which is that $t$ is bound in $\int_a^b f(t) dt$ but (sort of) free in $\int f(t) dt$. I'm curious to hear Toby's take. I introduced indefinite integrals to my class last week by saying that the indefinite integral of a thing (I didn't say "differential form", but I might have as Toby suggested) is the most general expression whose differential is that thing. That made perfect sense to me. I haven't done definite integrals yet, but from that point of view, maybe the problem is with the definite integral notation, since we have the limits $a$ and $b$ specified but without indicating in the notation which variable is supposed to take on those values. For instance, in a chain rule / substitution problem, say we have $\int_1^2 2t \cos(t^2) dt$, which we can solve by letting $u = t^2$ so that $du = 2 t dt$ and $$2t \cos(t^2) dt = \cos(u) du.$$ But this equality (of differential forms) is not something to which we can apply the "operation" $\int_1^2$ and get $$\int_1^2 2t \cos(t^2) dt = \int_1^2 \cos(u) du.$$ Instead we have to put $t=1$ and $t=2$ into $u=t^2$ and get $$\int_1^2 2t \cos(t^2) dt = \int_1^4 \cos(u) du.$$ So maybe it would be better to write $\int_{t=1}^2 2t \cos(t^2) dt$ (as we do with summation notation, $\sum_{t=1}^4$) so that we could have $$\int_{t=1}^2 2t \cos(t^2) dt = \int_{t=1}^2 \cos(u) du = \int_{u=1}^4 \cos(u) du.$$

I wouldn’t ask my students (2) or (3). In fact, I’m not sure what you were expecting.

In (2), are you assuming that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ is a function of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi></mrow><annotation encoding="application/x-tex">t</annotation></semantics></math>$ ? Or constant with respect to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi></mrow><annotation encoding="application/x-tex">t</annotation></semantics></math>$ ? Were you hoping that they would write $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ln</mi><mo stretchy="false">(</mo><msup><mi>x</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn><mo stretchy="false">)</mo><mfrac><mi>dx</mi><mi>dt</mi></mfrac></mrow><annotation encoding="application/x-tex">ln(x^2+1) \frac{dx}{dt}</annotation></semantics></math>$ ? While there’s technically no contradiction in using the same variable both free and bound, it’s bad style even in published mathematical papers, so I wouldn’t want to inflict it on calculus students.

As for (3), the way I’m used to thinking of it $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo>∫</mo><mi>ln</mi><mo stretchy="false">(</mo><msup><mi>t</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn><mo stretchy="false">)</mo><mi>dt</mi></mrow><annotation encoding="application/x-tex">\int ln(t^2+1)dt</annotation></semantics></math>$ is not a function but a class of functions (differing by local constants) — hence not something you can take the derivative of.

However, you do raise an important point, which is that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi></mrow><annotation encoding="application/x-tex">t</annotation></semantics></math>$ is bound in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msubsup><mo>∫</mo> <mi>a</mi> <mi>b</mi></msubsup><mi>f</mi><mo stretchy="false">(</mo><mi>t</mi><mo stretchy="false">)</mo><mi>dt</mi></mrow><annotation encoding="application/x-tex">\int_a^b f(t) dt</annotation></semantics></math>$ but (sort of) free in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo>∫</mo><mi>f</mi><mo stretchy="false">(</mo><mi>t</mi><mo stretchy="false">)</mo><mi>dt</mi></mrow><annotation encoding="application/x-tex">\int f(t) dt</annotation></semantics></math>$ . I’m curious to hear Toby’s take. I introduced indefinite integrals to my class last week by saying that the indefinite integral of a thing (I didn’t say “differential form”, but I might have as Toby suggested) is the most general expression whose differential is that thing. That made perfect sense to me.

I haven’t done definite integrals yet, but from that point of view, maybe the problem is with the definite integral notation, since we have the limits $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>a</mi></mrow><annotation encoding="application/x-tex">a</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>b</mi></mrow><annotation encoding="application/x-tex">b</annotation></semantics></math>$ specified but without indicating in the notation which variable is supposed to take on those values. For instance, in a chain rule / substitution problem, say we have $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msubsup><mo>∫</mo> <mn>1</mn> <mn>2</mn></msubsup><mn>2</mn><mi>t</mi><mi>cos</mi><mo stretchy="false">(</mo><msup><mi>t</mi> <mn>2</mn></msup><mo stretchy="false">)</mo><mi>dt</mi></mrow><annotation encoding="application/x-tex">\int_1^2 2t \cos(t^2) dt</annotation></semantics></math>$ , which we can solve by letting $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>u</mi><mo>=</mo><msup><mi>t</mi> <mn>2</mn></msup></mrow><annotation encoding="application/x-tex">u = t^2</annotation></semantics></math>$ so that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>du</mi><mo>=</mo><mn>2</mn><mi>t</mi><mi>dt</mi></mrow><annotation encoding="application/x-tex">du = 2 t dt</annotation></semantics></math>$ and
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mn>2</mn><mi>t</mi><mi>cos</mi><mo stretchy="false">(</mo><msup><mi>t</mi> <mn>2</mn></msup><mo stretchy="false">)</mo><mi>dt</mi><mo>=</mo><mi>cos</mi><mo stretchy="false">(</mo><mi>u</mi><mo stretchy="false">)</mo><mi>du</mi><mo>.</mo></mrow><annotation encoding="application/x-tex">2t \cos(t^2) dt = \cos(u) du.</annotation></semantics></math>$
But this equality (of differential forms) is not something to which we can apply the “operation” $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msubsup><mo>∫</mo> <mn>1</mn> <mn>2</mn></msubsup></mrow><annotation encoding="application/x-tex">\int_1^2</annotation></semantics></math>$ and get
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><msubsup><mo>∫</mo> <mn>1</mn> <mn>2</mn></msubsup><mn>2</mn><mi>t</mi><mi>cos</mi><mo stretchy="false">(</mo><msup><mi>t</mi> <mn>2</mn></msup><mo stretchy="false">)</mo><mi>dt</mi><mo>=</mo><msubsup><mo>∫</mo> <mn>1</mn> <mn>2</mn></msubsup><mi>cos</mi><mo stretchy="false">(</mo><mi>u</mi><mo stretchy="false">)</mo><mi>du</mi><mo>.</mo></mrow><annotation encoding="application/x-tex">\int_1^2 2t \cos(t^2) dt = \int_1^2 \cos(u) du.</annotation></semantics></math>$
Instead we have to put $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi><mo>=</mo><mn>1</mn></mrow><annotation encoding="application/x-tex">t=1</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi><mo>=</mo><mn>2</mn></mrow><annotation encoding="application/x-tex">t=2</annotation></semantics></math>$ into $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>u</mi><mo>=</mo><msup><mi>t</mi> <mn>2</mn></msup></mrow><annotation encoding="application/x-tex">u=t^2</annotation></semantics></math>$ and get
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><msubsup><mo>∫</mo> <mn>1</mn> <mn>2</mn></msubsup><mn>2</mn><mi>t</mi><mi>cos</mi><mo stretchy="false">(</mo><msup><mi>t</mi> <mn>2</mn></msup><mo stretchy="false">)</mo><mi>dt</mi><mo>=</mo><msubsup><mo>∫</mo> <mn>1</mn> <mn>4</mn></msubsup><mi>cos</mi><mo stretchy="false">(</mo><mi>u</mi><mo stretchy="false">)</mo><mi>du</mi><mo>.</mo></mrow><annotation encoding="application/x-tex">\int_1^2 2t \cos(t^2) dt = \int_1^4 \cos(u) du.</annotation></semantics></math>$
So maybe it would be better to write $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msubsup><mo>∫</mo> <mrow><mi>t</mi><mo>=</mo><mn>1</mn></mrow> <mn>2</mn></msubsup><mn>2</mn><mi>t</mi><mi>cos</mi><mo stretchy="false">(</mo><msup><mi>t</mi> <mn>2</mn></msup><mo stretchy="false">)</mo><mi>dt</mi></mrow><annotation encoding="application/x-tex">\int_{t=1}^2 2t \cos(t^2) dt</annotation></semantics></math>$ (as we do with summation notation, $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msubsup><mo lspace="0.16667em" rspace="0.16667em">∑</mo> <mrow><mi>t</mi><mo>=</mo><mn>1</mn></mrow> <mn>4</mn></msubsup></mrow><annotation encoding="application/x-tex">\sum_{t=1}^4</annotation></semantics></math>$ ) so that we could have
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><msubsup><mo>∫</mo> <mrow><mi>t</mi><mo>=</mo><mn>1</mn></mrow> <mn>2</mn></msubsup><mn>2</mn><mi>t</mi><mi>cos</mi><mo stretchy="false">(</mo><msup><mi>t</mi> <mn>2</mn></msup><mo stretchy="false">)</mo><mi>dt</mi><mo>=</mo><msubsup><mo>∫</mo> <mrow><mi>t</mi><mo>=</mo><mn>1</mn></mrow> <mn>2</mn></msubsup><mi>cos</mi><mo stretchy="false">(</mo><mi>u</mi><mo stretchy="false">)</mo><mi>du</mi><mo>=</mo><msubsup><mo>∫</mo> <mrow><mi>u</mi><mo>=</mo><mn>1</mn></mrow> <mn>4</mn></msubsup><mi>cos</mi><mo stretchy="false">(</mo><mi>u</mi><mo stretchy="false">)</mo><mi>du</mi><mo>.</mo></mrow><annotation encoding="application/x-tex">\int_{t=1}^2 2t \cos(t^2) dt = \int_{t=1}^2 \cos(u) du = \int_{u=1}^4 \cos(u) du.</annotation></semantics></math>$
- CommentRowNumber46.
- CommentAuthorMichael_Bachtold
- CommentTimeNov 24th 2013
- PermaLink
Author: Michael_Bachtold
Format: MarkdownItex>In (2), are you assuming that $x$ is a function of $t$? Or constant with respect to $t$? Good point. But before I answers (and let me know me if you see this differently): the discussion so far showed that there are at least two popular interpretations for "variables" in calculus: one in the sense of "dummy variables" or placeholders for numbers, and one in the sense of "variable quantity" or maybe morphism in a suitable category. It also seems that these two interpretations can lead to conflicts. But I'd be glad to understand this better still. Having said that: For 2) I was expecting that they answer $0$. From the "dummy variable" perspective $x$ is a placeholder for a number (representing the upper boundary and otherwise not related to $t$), and since the variable $t$ is bound the whole integral does not "change" when we plug in different values for $t$, so the derivative is zero. From the "variable quantity" perspective $x$ might depend on $t$ so the correct answer would be as you suggest $ln(x^2+1) \frac{dx}{dt}$. So to be consistent with the previous we need to assume $x$ is constant with respect to $t$. But some things are not yet clear to me about this last answer. I'll come to it in a moment. As for 3) I was expecting that they answer $\ln(t^2+1)$. I also think of the indefinite integral as a family of functions (depending on the same variable $t$ as the differential form). I guess I'm using the convention here that taking the derivative of a family of functions means taking the derivative of each member of the family. Of course in principle the additive constant could still depend on $t$ in some context, which makes things more subtle. >...maybe the problem is with the definite integral notation, since we have the limits $a$ and $b$ specified but without indicating in the notation which variable is supposed to take on those values. I think the standard convention here is that the boundaries $a,b$ of the definite integral always refer to the variable appearing in $dx$ (or $dt$ etc.) so there is seldom ambiguity there. But as you suggest I also emphasize this by writing $\int_{u=1}^4 \cos(u)du$ instead of $\int_{1}^4 \cos(u)du$. In fact I sometimes overemphasize by writing $\int_{u=1}^{u=4} \cos(u)du$, which brings me back to 2). If I had written $\int_{t=2}^{t=x} \ln(t^2+1)dt$ and interpret variables as "variable quantities", then how should I interpret the equality $t=x$ appearing in the upper boundary? Does it mean that $t$ and $x$ are the same variable quantities? In that case the answer to 2) would be the same as the answer to 3) and it wouldn't be possible to ask if $x$ is constant with respect to $t$ (also a student might object that it is unnecessary to introduce a new name $x$ to denote the same thing as $t$, which nevertheless is considered bad style as you mention). But I suspect that the thing going on here and elsewhere in the "variable quantity" perspective is that an equality like $t=x$ is interpreted in a way more commonly seen in probability/statistics as in $\{x=t \}$ denoting "the set of all states where the random variables $x$ and $t$ assume the same value." This raises some (maybe sidetracking) questions for me: 1. if the "variable quantity" perspective can be formalized via arrows in a suitable category, then how does one formalize categorically the notion of two quantities (arrows) being "independent" or "constant" with respect to each other? 1. What does the "set of states of the world" (also mentioned by Toby) correspond to categorically? Some classifying object? These questions are not directly addressed at Mike or Toby, but if you happen to know some answers I won't complain. :) Apologies if I can't respond in the next few days.
In (2), are you assuming that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ is a function of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi></mrow><annotation encoding="application/x-tex">t</annotation></semantics></math>$ ? Or constant with respect to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi></mrow><annotation encoding="application/x-tex">t</annotation></semantics></math>$ ?

Good point. But before I answers (and let me know me if you see this differently): the discussion so far showed that there are at least two popular interpretations for “variables” in calculus: one in the sense of “dummy variables” or placeholders for numbers, and one in the sense of “variable quantity” or maybe morphism in a suitable category. It also seems that these two interpretations can lead to conflicts. But I’d be glad to understand this better still.

Having said that:

For 2) I was expecting that they answer $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>0</mn></mrow><annotation encoding="application/x-tex">0</annotation></semantics></math>$ . From the “dummy variable” perspective $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ is a placeholder for a number (representing the upper boundary and otherwise not related to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi></mrow><annotation encoding="application/x-tex">t</annotation></semantics></math>$ ), and since the variable $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi></mrow><annotation encoding="application/x-tex">t</annotation></semantics></math>$ is bound the whole integral does not “change” when we plug in different values for $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi></mrow><annotation encoding="application/x-tex">t</annotation></semantics></math>$ , so the derivative is zero.

From the “variable quantity” perspective $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ might depend on $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi></mrow><annotation encoding="application/x-tex">t</annotation></semantics></math>$ so the correct answer would be as you suggest $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ln</mi><mo stretchy="false">(</mo><msup><mi>x</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn><mo stretchy="false">)</mo><mfrac><mi>dx</mi><mi>dt</mi></mfrac></mrow><annotation encoding="application/x-tex">ln(x^2+1) \frac{dx}{dt}</annotation></semantics></math>$ . So to be consistent with the previous we need to assume $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ is constant with respect to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi></mrow><annotation encoding="application/x-tex">t</annotation></semantics></math>$ .

But some things are not yet clear to me about this last answer. I’ll come to it in a moment.

As for 3) I was expecting that they answer $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ln</mi><mo stretchy="false">(</mo><msup><mi>t</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">\ln(t^2+1)</annotation></semantics></math>$ . I also think of the indefinite integral as a family of functions (depending on the same variable $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi></mrow><annotation encoding="application/x-tex">t</annotation></semantics></math>$ as the differential form). I guess I’m using the convention here that taking the derivative of a family of functions means taking the derivative of each member of the family. Of course in principle the additive constant could still depend on $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi></mrow><annotation encoding="application/x-tex">t</annotation></semantics></math>$ in some context, which makes things more subtle.

…maybe the problem is with the definite integral notation, since we have the limits $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>a</mi></mrow><annotation encoding="application/x-tex">a</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>b</mi></mrow><annotation encoding="application/x-tex">b</annotation></semantics></math>$ specified but without indicating in the notation which variable is supposed to take on those values.

I think the standard convention here is that the boundaries $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>a</mi><mo>,</mo><mi>b</mi></mrow><annotation encoding="application/x-tex">a,b</annotation></semantics></math>$ of the definite integral always refer to the variable appearing in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>dx</mi></mrow><annotation encoding="application/x-tex">dx</annotation></semantics></math>$ (or $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>dt</mi></mrow><annotation encoding="application/x-tex">dt</annotation></semantics></math>$ etc.) so there is seldom ambiguity there. But as you suggest I also emphasize this by writing $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msubsup><mo>∫</mo> <mrow><mi>u</mi><mo>=</mo><mn>1</mn></mrow> <mn>4</mn></msubsup><mi>cos</mi><mo stretchy="false">(</mo><mi>u</mi><mo stretchy="false">)</mo><mi>du</mi></mrow><annotation encoding="application/x-tex">\int_{u=1}^4 \cos(u)du</annotation></semantics></math>$ instead of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msubsup><mo>∫</mo> <mn>1</mn> <mn>4</mn></msubsup><mi>cos</mi><mo stretchy="false">(</mo><mi>u</mi><mo stretchy="false">)</mo><mi>du</mi></mrow><annotation encoding="application/x-tex">\int_{1}^4 \cos(u)du</annotation></semantics></math>$ . In fact I sometimes overemphasize by writing $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msubsup><mo>∫</mo> <mrow><mi>u</mi><mo>=</mo><mn>1</mn></mrow> <mrow><mi>u</mi><mo>=</mo><mn>4</mn></mrow></msubsup><mi>cos</mi><mo stretchy="false">(</mo><mi>u</mi><mo stretchy="false">)</mo><mi>du</mi></mrow><annotation encoding="application/x-tex">\int_{u=1}^{u=4} \cos(u)du</annotation></semantics></math>$ , which brings me back to 2).

If I had written $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msubsup><mo>∫</mo> <mrow><mi>t</mi><mo>=</mo><mn>2</mn></mrow> <mrow><mi>t</mi><mo>=</mo><mi>x</mi></mrow></msubsup><mi>ln</mi><mo stretchy="false">(</mo><msup><mi>t</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn><mo stretchy="false">)</mo><mi>dt</mi></mrow><annotation encoding="application/x-tex">\int_{t=2}^{t=x} \ln(t^2+1)dt</annotation></semantics></math>$ and interpret variables as “variable quantities”, then how should I interpret the equality $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi><mo>=</mo><mi>x</mi></mrow><annotation encoding="application/x-tex">t=x</annotation></semantics></math>$ appearing in the upper boundary? Does it mean that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi></mrow><annotation encoding="application/x-tex">t</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ are the same variable quantities? In that case the answer to 2) would be the same as the answer to 3) and it wouldn’t be possible to ask if $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ is constant with respect to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi></mrow><annotation encoding="application/x-tex">t</annotation></semantics></math>$ (also a student might object that it is unnecessary to introduce a new name $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ to denote the same thing as $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi></mrow><annotation encoding="application/x-tex">t</annotation></semantics></math>$ , which nevertheless is considered bad style as you mention). But I suspect that the thing going on here and elsewhere in the “variable quantity” perspective is that an equality like $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi><mo>=</mo><mi>x</mi></mrow><annotation encoding="application/x-tex">t=x</annotation></semantics></math>$ is interpreted in a way more commonly seen in probability/statistics as in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">{</mo><mi>x</mi><mo>=</mo><mi>t</mi><mo stretchy="false">}</mo></mrow><annotation encoding="application/x-tex">\{x=t \}</annotation></semantics></math>$ denoting “the set of all states where the random variables $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi></mrow><annotation encoding="application/x-tex">t</annotation></semantics></math>$ assume the same value.”

This raises some (maybe sidetracking) questions for me:
1. if the “variable quantity” perspective can be formalized via arrows in a suitable category, then how does one formalize categorically the notion of two quantities (arrows) being “independent” or “constant” with respect to each other?
2. What does the “set of states of the world” (also mentioned by Toby) correspond to categorically? Some classifying object?
These questions are not directly addressed at Mike or Toby, but if you happen to know some answers I won’t complain. :) Apologies if I can’t respond in the next few days.
- CommentRowNumber47.
- CommentAuthorMike Shulman
- CommentTimeNov 24th 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItexYes, I'm perfectly aware of the standard convention, and I agree that in practice there is no ambiguity in the meaning of a particular definite integral expression, but I described a situation (integration by substitution) in which the lack of notation could be problematic for a student when manipulating several such expressions. As for the meaning of $t=x$, I think more generally one of the things we can do with a "variable quantity" is to let it be equal a particular other quantity. If the other quantity is constant, then it "stops varying" and becomes constant, while if the other quantity is also variable then their variation becomes dependent. For instance, when $x$ and $y$ are variable quantities and we write $\left.\frac{dy}{dx}\right|_{x=2}$ for what if $y=f(x)$ we might also write as $f'(2)$. Categorically, variable quantities are morphisms from some domain object say $\Gamma$ --- which I think is what Toby meant by the space of "states of the world" --- and setting two such variable quantities equal would correspond to restricting the domain to the equalizer of those two morphisms. That's probably the same as what you mean by $\{x=t\}$? I think this "fixing the value of a variable quantity" is the same thing that's happening in a definite integral. Given a differential form like $\ln(t^2+1)dt$ involving a variable quantity $t$ and its differential $dt$, we can integrate this form from one particular value of $t$ to another. These particular values might be constant quantities or other variable quantities (such as variables), and in the latter case the result is again going to be variable. I need to think a bit about your first question.

Yes, I’m perfectly aware of the standard convention, and I agree that in practice there is no ambiguity in the meaning of a particular definite integral expression, but I described a situation (integration by substitution) in which the lack of notation could be problematic for a student when manipulating several such expressions.

As for the meaning of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi><mo>=</mo><mi>x</mi></mrow><annotation encoding="application/x-tex">t=x</annotation></semantics></math>$ , I think more generally one of the things we can do with a “variable quantity” is to let it be equal a particular other quantity. If the other quantity is constant, then it “stops varying” and becomes constant, while if the other quantity is also variable then their variation becomes dependent. For instance, when $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi></mrow><annotation encoding="application/x-tex">y</annotation></semantics></math>$ are variable quantities and we write $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mrow><mfrac><mi>dy</mi><mi>dx</mi></mfrac><mo>|</mo></mrow> <mrow><mi>x</mi><mo>=</mo><mn>2</mn></mrow></msub></mrow><annotation encoding="application/x-tex">\left.\frac{dy}{dx}\right|_{x=2}</annotation></semantics></math>$ for what if $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi><mo>=</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">y=f(x)</annotation></semantics></math>$ we might also write as $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mn>2</mn><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">f'(2)</annotation></semantics></math>$ .

Categorically, variable quantities are morphisms from some domain object say $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>Γ</mi></mrow><annotation encoding="application/x-tex">\Gamma</annotation></semantics></math>$ — which I think is what Toby meant by the space of “states of the world” — and setting two such variable quantities equal would correspond to restricting the domain to the equalizer of those two morphisms. That’s probably the same as what you mean by $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">{</mo><mi>x</mi><mo>=</mo><mi>t</mi><mo stretchy="false">}</mo></mrow><annotation encoding="application/x-tex">\{x=t\}</annotation></semantics></math>$ ?

I think this “fixing the value of a variable quantity” is the same thing that’s happening in a definite integral. Given a differential form like $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ln</mi><mo stretchy="false">(</mo><msup><mi>t</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn><mo stretchy="false">)</mo><mi>dt</mi></mrow><annotation encoding="application/x-tex">\ln(t^2+1)dt</annotation></semantics></math>$ involving a variable quantity $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi></mrow><annotation encoding="application/x-tex">t</annotation></semantics></math>$ and its differential $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>dt</mi></mrow><annotation encoding="application/x-tex">dt</annotation></semantics></math>$ , we can integrate this form from one particular value of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi></mrow><annotation encoding="application/x-tex">t</annotation></semantics></math>$ to another. These particular values might be constant quantities or other variable quantities (such as variables), and in the latter case the result is again going to be variable.

I need to think a bit about your first question.
- CommentRowNumber48.
- CommentAuthorMike Shulman
- CommentTimeNov 24th 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItexI know what it means for a variable quantity $\Gamma \to R$ to be [[constant morphism|constant]]: it means that it factors through $1$. I'm not sure about "constant with respect to" some other quantity, though. Maybe that is one of those things which only makes sense if the quantity "with respect to" is part of a given basis, so that we can say that the corresponding partial derivative vanishes?

I know what it means for a variable quantity $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>Γ</mi><mo>→</mo><mi>R</mi></mrow><annotation encoding="application/x-tex">\Gamma \to R</annotation></semantics></math>$ to be constant: it means that it factors through $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn></mrow><annotation encoding="application/x-tex">1</annotation></semantics></math>$ . I’m not sure about “constant with respect to” some other quantity, though. Maybe that is one of those things which only makes sense if the quantity “with respect to” is part of a given basis, so that we can say that the corresponding partial derivative vanishes?
- CommentRowNumber49.
- CommentAuthorTobyBartels
- CommentTimeNov 25th 2013
- PermaLink
Author: TobyBartels
Format: MarkdownItexIn the context of indefinite integrals, I read ‘$\int$’ as ‘antidifferential’, since $\omega$ is the differential of $\int \omega$; that is, $\int \omega$ is an antidifferential of $\omega$. (Of course, when $\omega = y \,\mathrm{d}x$, the derivative of $\int y \,\mathrm{d}x$ with respect to $x$ is $y$, so $\int y \,\mathrm{d}x$ is also an antiderivative of $y$ with respect to $x$, like they say in the book.) I tend to avoid the term ‘indefinite integral’; it's bad enough that (almost) the same notation is used for two different concepts (definite and indefinite integrals), and I\'d just as soon not use (almost) the same terminology as well. I\'ve never liked the idea that the antidifferential of a differential form (or whatever you want to call that) is a _set_ of quantities; I try to say ‘an’ instead of ‘the’ as much as possible. If you just want one antidifferential, then (for example) $\int x^2 \,\mathrm{d}x = x^3/3$ is OK; but if you want all of them, then you need $\int x^2 \,\mathrm{d}x = x^3/3 + C$. (I enforce the book\'s answers to its problems by saying that it\'s asking for all of them. And I say that it\'s only interested in quantities defined on a connected domain, so I don\'t have to deal with _local_ constants.) For definite integrals, I introduce the notation first as $\int_p^q \omega$, where $p$ and $q$ are equations (preferably with unique solutions). Then $\int_{x = a}^b \omega$ is an abbreviation for $\int_{x = a}^{x = b} \omega$ (as Michael wrote) when the left-hand sides are the same; finally, $\int_a^b y \,\mathrm{d}x$ is an abbreviation for $\int_{x=a}^b y \,\mathrm{d}x$ when only one variable\'s differential appears in the expression for $\omega$. That\'s mostly how they look, but I encourage them to use a longer form when doing integration by substitution, for the reasons that Mike gives. (This can violate the requirement that $p$ and $q$ have *unique* solutions. It\'s sufficient that the result of the integral be the same for any choice of solution, or at least for any choice where the solutions of $p$ and $q$ are connected.) The Fundamental Theorem of Calculus has two parts, which are inconsistently numbered. By the numbering in our textbook (which is the way that I learnt it): 1. $\mathrm{d}(\int_p^q \omega) = {\omega|_p^q}$, 2. $\int_p^q \mathrm{d}u = {u|_p^q}$. Since this a theorem and needs fine print (about things being continuous and the like), I state and prove these first in function notation like the book does, but I bring up these forms eventually. The variable $t$ is definitely free in both $\mathrm{d}f(t)/\mathrm{d}t$ and $\int f(t) \,\mathrm{d}t$, no ‘sort of’ about it. It\'s bound in ${f(t)|_{t=a}} = f(a)$, in ${f(t)|_{t=a}^b} = f(b) - f(a)$, and in $\int_{t=a}^b f(t) \,\mathrm{d}t$ (which has a more complicated $t$-free definition). I agree with Mike that $t$ is bound in $\int_2^x \ln(t^2+1)dt$, so you can\'t really differentiate it with respect to $t$, although you could naïvely say that it\'s $\ln(x^2+1)dx/dt$ as Mike suggested. On the other hand, $\int \ln(t^2+1)dt$ is fine; by definition, its differential is $\ln(t^2+1)dt$, so its derivative with respect to $t$ is $\ln(t^2+1)$. (You don\'t even need the <small>FTC</small> for this one.)
In the context of indefinite integrals, I read ‘ $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo>∫</mo></mrow><annotation encoding="application/x-tex">\int</annotation></semantics></math>$ ’ as ‘antidifferential’, since $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ω</mi></mrow><annotation encoding="application/x-tex">\omega</annotation></semantics></math>$ is the differential of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo>∫</mo><mi>ω</mi></mrow><annotation encoding="application/x-tex">\int \omega</annotation></semantics></math>$ ; that is, $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo>∫</mo><mi>ω</mi></mrow><annotation encoding="application/x-tex">\int \omega</annotation></semantics></math>$ is an antidifferential of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ω</mi></mrow><annotation encoding="application/x-tex">\omega</annotation></semantics></math>$ . (Of course, when $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ω</mi><mo>=</mo><mi>y</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\omega = y \,\mathrm{d}x</annotation></semantics></math>$ , the derivative of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo>∫</mo><mi>y</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\int y \,\mathrm{d}x</annotation></semantics></math>$ with respect to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ is $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi></mrow><annotation encoding="application/x-tex">y</annotation></semantics></math>$ , so $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo>∫</mo><mi>y</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\int y \,\mathrm{d}x</annotation></semantics></math>$ is also an antiderivative of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi></mrow><annotation encoding="application/x-tex">y</annotation></semantics></math>$ with respect to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ , like they say in the book.) I tend to avoid the term ‘indefinite integral’; it’s bad enough that (almost) the same notation is used for two different concepts (definite and indefinite integrals), and I'd just as soon not use (almost) the same terminology as well.

I've never liked the idea that the antidifferential of a differential form (or whatever you want to call that) is a set of quantities; I try to say ‘an’ instead of ‘the’ as much as possible. If you just want one antidifferential, then (for example) $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo>∫</mo><msup><mi>x</mi> <mn>2</mn></msup><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi><mo>=</mo><msup><mi>x</mi> <mn>3</mn></msup><mo stretchy="false">/</mo><mn>3</mn></mrow><annotation encoding="application/x-tex">\int x^2 \,\mathrm{d}x = x^3/3</annotation></semantics></math>$ is OK; but if you want all of them, then you need $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo>∫</mo><msup><mi>x</mi> <mn>2</mn></msup><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi><mo>=</mo><msup><mi>x</mi> <mn>3</mn></msup><mo stretchy="false">/</mo><mn>3</mn><mo>+</mo><mi>C</mi></mrow><annotation encoding="application/x-tex">\int x^2 \,\mathrm{d}x = x^3/3 + C</annotation></semantics></math>$ . (I enforce the book's answers to its problems by saying that it's asking for all of them. And I say that it's only interested in quantities defined on a connected domain, so I don't have to deal with local constants.)

For definite integrals, I introduce the notation first as $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msubsup><mo>∫</mo> <mi>p</mi> <mi>q</mi></msubsup><mi>ω</mi></mrow><annotation encoding="application/x-tex">\int_p^q \omega</annotation></semantics></math>$ , where $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>p</mi></mrow><annotation encoding="application/x-tex">p</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>q</mi></mrow><annotation encoding="application/x-tex">q</annotation></semantics></math>$ are equations (preferably with unique solutions). Then $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msubsup><mo>∫</mo> <mrow><mi>x</mi><mo>=</mo><mi>a</mi></mrow> <mi>b</mi></msubsup><mi>ω</mi></mrow><annotation encoding="application/x-tex">\int_{x = a}^b \omega</annotation></semantics></math>$ is an abbreviation for $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msubsup><mo>∫</mo> <mrow><mi>x</mi><mo>=</mo><mi>a</mi></mrow> <mrow><mi>x</mi><mo>=</mo><mi>b</mi></mrow></msubsup><mi>ω</mi></mrow><annotation encoding="application/x-tex">\int_{x = a}^{x = b} \omega</annotation></semantics></math>$ (as Michael wrote) when the left-hand sides are the same; finally, $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msubsup><mo>∫</mo> <mi>a</mi> <mi>b</mi></msubsup><mi>y</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\int_a^b y \,\mathrm{d}x</annotation></semantics></math>$ is an abbreviation for $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msubsup><mo>∫</mo> <mrow><mi>x</mi><mo>=</mo><mi>a</mi></mrow> <mi>b</mi></msubsup><mi>y</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\int_{x=a}^b y \,\mathrm{d}x</annotation></semantics></math>$ when only one variable's differential appears in the expression for $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ω</mi></mrow><annotation encoding="application/x-tex">\omega</annotation></semantics></math>$ . That's mostly how they look, but I encourage them to use a longer form when doing integration by substitution, for the reasons that Mike gives. (This can violate the requirement that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>p</mi></mrow><annotation encoding="application/x-tex">p</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>q</mi></mrow><annotation encoding="application/x-tex">q</annotation></semantics></math>$ have unique solutions. It's sufficient that the result of the integral be the same for any choice of solution, or at least for any choice where the solutions of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>p</mi></mrow><annotation encoding="application/x-tex">p</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>q</mi></mrow><annotation encoding="application/x-tex">q</annotation></semantics></math>$ are connected.)

The Fundamental Theorem of Calculus has two parts, which are inconsistently numbered. By the numbering in our textbook (which is the way that I learnt it):
1. $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mo stretchy="false">(</mo><msubsup><mo>∫</mo> <mi>p</mi> <mi>q</mi></msubsup><mi>ω</mi><mo stretchy="false">)</mo><mo>=</mo><mrow><mi>ω</mi><msubsup><mo stretchy="false">|</mo> <mi>p</mi> <mi>q</mi></msubsup></mrow></mrow><annotation encoding="application/x-tex">\mathrm{d}(\int_p^q \omega) = {\omega|_p^q}</annotation></semantics></math>$ ,
2. $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msubsup><mo>∫</mo> <mi>p</mi> <mi>q</mi></msubsup><mi mathvariant="normal">d</mi><mi>u</mi><mo>=</mo><mrow><mi>u</mi><msubsup><mo stretchy="false">|</mo> <mi>p</mi> <mi>q</mi></msubsup></mrow></mrow><annotation encoding="application/x-tex">\int_p^q \mathrm{d}u = {u|_p^q}</annotation></semantics></math>$ .
Since this a theorem and needs fine print (about things being continuous and the like), I state and prove these first in function notation like the book does, but I bring up these forms eventually.

The variable $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi></mrow><annotation encoding="application/x-tex">t</annotation></semantics></math>$ is definitely free in both $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>f</mi><mo stretchy="false">(</mo><mi>t</mi><mo stretchy="false">)</mo><mo stretchy="false">/</mo><mi mathvariant="normal">d</mi><mi>t</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}f(t)/\mathrm{d}t</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo>∫</mo><mi>f</mi><mo stretchy="false">(</mo><mi>t</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>t</mi></mrow><annotation encoding="application/x-tex">\int f(t) \,\mathrm{d}t</annotation></semantics></math>$ , no ‘sort of’ about it. It's bound in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mrow><mi>f</mi><mo stretchy="false">(</mo><mi>t</mi><mo stretchy="false">)</mo><msub><mo stretchy="false">|</mo> <mrow><mi>t</mi><mo>=</mo><mi>a</mi></mrow></msub></mrow><mo>=</mo><mi>f</mi><mo stretchy="false">(</mo><mi>a</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">{f(t)|_{t=a}} = f(a)</annotation></semantics></math>$ , in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mrow><mi>f</mi><mo stretchy="false">(</mo><mi>t</mi><mo stretchy="false">)</mo><msubsup><mo stretchy="false">|</mo> <mrow><mi>t</mi><mo>=</mo><mi>a</mi></mrow> <mi>b</mi></msubsup></mrow><mo>=</mo><mi>f</mi><mo stretchy="false">(</mo><mi>b</mi><mo stretchy="false">)</mo><mo>−</mo><mi>f</mi><mo stretchy="false">(</mo><mi>a</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">{f(t)|_{t=a}^b} = f(b) - f(a)</annotation></semantics></math>$ , and in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msubsup><mo>∫</mo> <mrow><mi>t</mi><mo>=</mo><mi>a</mi></mrow> <mi>b</mi></msubsup><mi>f</mi><mo stretchy="false">(</mo><mi>t</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>t</mi></mrow><annotation encoding="application/x-tex">\int_{t=a}^b f(t) \,\mathrm{d}t</annotation></semantics></math>$ (which has a more complicated $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi></mrow><annotation encoding="application/x-tex">t</annotation></semantics></math>$ -free definition). I agree with Mike that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi></mrow><annotation encoding="application/x-tex">t</annotation></semantics></math>$ is bound in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msubsup><mo>∫</mo> <mn>2</mn> <mi>x</mi></msubsup><mi>ln</mi><mo stretchy="false">(</mo><msup><mi>t</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn><mo stretchy="false">)</mo><mi>dt</mi></mrow><annotation encoding="application/x-tex">\int_2^x \ln(t^2+1)dt</annotation></semantics></math>$ , so you can't really differentiate it with respect to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi></mrow><annotation encoding="application/x-tex">t</annotation></semantics></math>$ , although you could naïvely say that it's $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ln</mi><mo stretchy="false">(</mo><msup><mi>x</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn><mo stretchy="false">)</mo><mi>dx</mi><mo stretchy="false">/</mo><mi>dt</mi></mrow><annotation encoding="application/x-tex">\ln(x^2+1)dx/dt</annotation></semantics></math>$ as Mike suggested. On the other hand, $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo>∫</mo><mi>ln</mi><mo stretchy="false">(</mo><msup><mi>t</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn><mo stretchy="false">)</mo><mi>dt</mi></mrow><annotation encoding="application/x-tex">\int \ln(t^2+1)dt</annotation></semantics></math>$ is fine; by definition, its differential is $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ln</mi><mo stretchy="false">(</mo><msup><mi>t</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn><mo stretchy="false">)</mo><mi>dt</mi></mrow><annotation encoding="application/x-tex">\ln(t^2+1)dt</annotation></semantics></math>$ , so its derivative with respect to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi></mrow><annotation encoding="application/x-tex">t</annotation></semantics></math>$ is $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ln</mi><mo stretchy="false">(</mo><msup><mi>t</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">\ln(t^2+1)</annotation></semantics></math>$ . (You don't even need the FTC for this one.)
- CommentRowNumber50.
- CommentAuthorTobyBartels
- CommentTimeNov 25th 2013
- PermaLink
Author: TobyBartels
Format: MarkdownItex>there are at least two popular interpretations for "variables" in calculus: one in the sense of "dummy variables" or placeholders for numbers, and one in the sense of "variable quantity" or maybe morphism in a suitable category These two senses can both be incorporated into categorial logic. In the case of an expression like ${x^2|_{x=1}^2}$ (which is an abbreviation of ${x^2|_{x=1}^{x=2}}$ and is usually further abbreviated as ${x^2|_1^2}$), we start with a real-valued quantity $x$ in some context $\Gamma$ (formally a morphism $x\colon \Gamma \to \mathbb{R}$). The equations $x = 1$ and $x = 2$ specify certain extensions of $\Gamma$, categorially constructed as equalizers (as Mike suggested). Call these extensions ${\Gamma|_{x=1}}$ and ${\Gamma|_{x=2}}$ respectively; then if $u$ is any real-valued quantity in the context $\Gamma$, ${u|_{x=1}^2}$ is a real-valued quantity whose context is the product ${\Gamma|_{x=1}} \times {\Gamma|_{x=2}}$. (You should be able to draw this using arrow-theoretic diagrams, making use of the subtraction operation $\mathbb{R} \times \mathbb{R} \to \mathbb{R}$.) If it should so happen that ${\Gamma|_{x=1}}$ and ${\Gamma|_{x=2}}$ are points (terminal objects), then ${u|_{x=1}^2}$ is simply a real number. In the case of ${x^2|_{x=1}^2}$, if this appears as a problem in a textbook without any further context, the default interpretation is supposed to be that $\Gamma$ is the largest subset of $\mathbb{R}$ on which $(x \mapsto x^2)$ is defined, in this case the entire real line $\mathbb{R}$; then ${\Gamma|_{x=1}}$ and ${\Gamma|_{x=2}}$ are indeed points, and so ${x^2|_{x=1}^2}$ is indeed a real number (as it happens, $3$). In the context of a word problem where $x$ stands for an inherently positive quantity, then it would be more appropriate to take $\Gamma$ to be ${]0,\infty[}$ instead.[^intervalnotation] But in such problems, I think it even more natural to take $\Gamma$ to be an abstract space, which I think of as the space of possible states of the situation described in the problem. While $\Gamma$ might never be fully defined, various properties of it may be justified as needed on the basis of the intuition behind the problem. The textbooks, by encouraging us to put everything in the problem in terms of a single variable (such as $x$), effectively ask us to find that this variable mediates an isomorphism between $\Gamma$ and some subspace of $\mathbb{R}$ (such as ${]0,\infty[}$); this specifies $\Gamma$ up to specified isomorphism, so no further intuition is needed. But many problems are easier to solve without expressing everything in terms of one variable, and I encourage my students to take a more flexible approach (especially to things like related rates and optimization problems). This just requires them to be a little more careful about keeping track of the context. [^intervalnotation]: Of course, I write this as $(0,\infty)$ in class, in deference to the textbooks, but I prefer the less overloaded notation ${]0,\infty[}$. >I suspect that the thing going on here and elsewhere in the "variable quantity" perspective is that an equality like $t=x$ is interpreted in a way more commonly seen in probability/statistics as in $\{x=t \}$ denoting "the set of all states where the random variables $x$ and $t$ assume the same value." Yes, precisely, and this is an equalizer. In general, I\'d say that the probability/statistics people have a good handle on this stuff; they know what a random variable really is, after all, and the rest of us just need to learn that all of our variables are much the same sort of thing. >how does one formalize categorically the notion of two quantities (arrows) being "independent" or "constant" with respect to each other? Like Mike, I don\'t think that this is really a sensible notion without specifying what the other independent variables are supposed to be. Rather, what should be formalized is the idea that one quantity is *determined* by another. Working in the context $\Gamma$, a $T$-valued quantity $x$ is __determined__ by a $U$-valued quantity $y$ if there exists a morphism $f\colon U \to T$ such that $x = f \circ y$. (This definition appears as one of the fundamental concepts in Lawvere & Schanuel\'s _Conceptual Mathematics_.) >What does the "set of states of the world" (also mentioned by Toby) correspond to categorically? Some classifying object? Sure, although actually it\'s a coclassifying object. So, while a principal $G$-bundle on $S$ (for $G$ some topological group and $S$ some topological space) is the same as a continuous map *from* $S$ to the classifying space $B G$, so an $S$-valued smooth quantity (for $S$ some smooth space) in a given context $\Gamma$ is the same as a smooth map *to* $S$ from a coclassifying space (which I\'ve been calling simply $\Gamma$ again). So $\Gamma$ is the coclassifying space for the quantities in the problem.
there are at least two popular interpretations for “variables” in calculus: one in the sense of “dummy variables” or placeholders for numbers, and one in the sense of “variable quantity” or maybe morphism in a suitable category

These two senses can both be incorporated into categorial logic. In the case of an expression like $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mrow><msup><mi>x</mi> <mn>2</mn></msup><msubsup><mo stretchy="false">|</mo> <mrow><mi>x</mi><mo>=</mo><mn>1</mn></mrow> <mn>2</mn></msubsup></mrow></mrow><annotation encoding="application/x-tex">{x^2|_{x=1}^2}</annotation></semantics></math>$ (which is an abbreviation of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mrow><msup><mi>x</mi> <mn>2</mn></msup><msubsup><mo stretchy="false">|</mo> <mrow><mi>x</mi><mo>=</mo><mn>1</mn></mrow> <mrow><mi>x</mi><mo>=</mo><mn>2</mn></mrow></msubsup></mrow></mrow><annotation encoding="application/x-tex">{x^2|_{x=1}^{x=2}}</annotation></semantics></math>$ and is usually further abbreviated as $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mrow><msup><mi>x</mi> <mn>2</mn></msup><msubsup><mo stretchy="false">|</mo> <mn>1</mn> <mn>2</mn></msubsup></mrow></mrow><annotation encoding="application/x-tex">{x^2|_1^2}</annotation></semantics></math>$ ), we start with a real-valued quantity $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ in some context $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>Γ</mi></mrow><annotation encoding="application/x-tex">\Gamma</annotation></semantics></math>$ (formally a morphism $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi><mo lspace="0.11111em">:</mo><mi>Γ</mi><mo>→</mo><mi>ℝ</mi></mrow><annotation encoding="application/x-tex">x\colon \Gamma \to \mathbb{R}</annotation></semantics></math>$ ). The equations $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi><mo>=</mo><mn>1</mn></mrow><annotation encoding="application/x-tex">x = 1</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi><mo>=</mo><mn>2</mn></mrow><annotation encoding="application/x-tex">x = 2</annotation></semantics></math>$ specify certain extensions of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>Γ</mi></mrow><annotation encoding="application/x-tex">\Gamma</annotation></semantics></math>$ , categorially constructed as equalizers (as Mike suggested). Call these extensions $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mrow><mi>Γ</mi><msub><mo stretchy="false">|</mo> <mrow><mi>x</mi><mo>=</mo><mn>1</mn></mrow></msub></mrow></mrow><annotation encoding="application/x-tex">{\Gamma|_{x=1}}</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mrow><mi>Γ</mi><msub><mo stretchy="false">|</mo> <mrow><mi>x</mi><mo>=</mo><mn>2</mn></mrow></msub></mrow></mrow><annotation encoding="application/x-tex">{\Gamma|_{x=2}}</annotation></semantics></math>$ respectively; then if $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>u</mi></mrow><annotation encoding="application/x-tex">u</annotation></semantics></math>$ is any real-valued quantity in the context $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>Γ</mi></mrow><annotation encoding="application/x-tex">\Gamma</annotation></semantics></math>$ , $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mrow><mi>u</mi><msubsup><mo stretchy="false">|</mo> <mrow><mi>x</mi><mo>=</mo><mn>1</mn></mrow> <mn>2</mn></msubsup></mrow></mrow><annotation encoding="application/x-tex">{u|_{x=1}^2}</annotation></semantics></math>$ is a real-valued quantity whose context is the product $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mrow><mi>Γ</mi><msub><mo stretchy="false">|</mo> <mrow><mi>x</mi><mo>=</mo><mn>1</mn></mrow></msub></mrow><mo>×</mo><mrow><mi>Γ</mi><msub><mo stretchy="false">|</mo> <mrow><mi>x</mi><mo>=</mo><mn>2</mn></mrow></msub></mrow></mrow><annotation encoding="application/x-tex">{\Gamma|_{x=1}} \times {\Gamma|_{x=2}}</annotation></semantics></math>$ . (You should be able to draw this using arrow-theoretic diagrams, making use of the subtraction operation $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ℝ</mi><mo>×</mo><mi>ℝ</mi><mo>→</mo><mi>ℝ</mi></mrow><annotation encoding="application/x-tex">\mathbb{R} \times \mathbb{R} \to \mathbb{R}</annotation></semantics></math>$ .) If it should so happen that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mrow><mi>Γ</mi><msub><mo stretchy="false">|</mo> <mrow><mi>x</mi><mo>=</mo><mn>1</mn></mrow></msub></mrow></mrow><annotation encoding="application/x-tex">{\Gamma|_{x=1}}</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mrow><mi>Γ</mi><msub><mo stretchy="false">|</mo> <mrow><mi>x</mi><mo>=</mo><mn>2</mn></mrow></msub></mrow></mrow><annotation encoding="application/x-tex">{\Gamma|_{x=2}}</annotation></semantics></math>$ are points (terminal objects), then $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mrow><mi>u</mi><msubsup><mo stretchy="false">|</mo> <mrow><mi>x</mi><mo>=</mo><mn>1</mn></mrow> <mn>2</mn></msubsup></mrow></mrow><annotation encoding="application/x-tex">{u|_{x=1}^2}</annotation></semantics></math>$ is simply a real number.

In the case of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mrow><msup><mi>x</mi> <mn>2</mn></msup><msubsup><mo stretchy="false">|</mo> <mrow><mi>x</mi><mo>=</mo><mn>1</mn></mrow> <mn>2</mn></msubsup></mrow></mrow><annotation encoding="application/x-tex">{x^2|_{x=1}^2}</annotation></semantics></math>$ , if this appears as a problem in a textbook without any further context, the default interpretation is supposed to be that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>Γ</mi></mrow><annotation encoding="application/x-tex">\Gamma</annotation></semantics></math>$ is the largest subset of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ℝ</mi></mrow><annotation encoding="application/x-tex">\mathbb{R}</annotation></semantics></math>$ on which $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mi>x</mi><mo>↦</mo><msup><mi>x</mi> <mn>2</mn></msup><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">(x \mapsto x^2)</annotation></semantics></math>$ is defined, in this case the entire real line $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ℝ</mi></mrow><annotation encoding="application/x-tex">\mathbb{R}</annotation></semantics></math>$ ; then $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mrow><mi>Γ</mi><msub><mo stretchy="false">|</mo> <mrow><mi>x</mi><mo>=</mo><mn>1</mn></mrow></msub></mrow></mrow><annotation encoding="application/x-tex">{\Gamma|_{x=1}}</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mrow><mi>Γ</mi><msub><mo stretchy="false">|</mo> <mrow><mi>x</mi><mo>=</mo><mn>2</mn></mrow></msub></mrow></mrow><annotation encoding="application/x-tex">{\Gamma|_{x=2}}</annotation></semantics></math>$ are indeed points, and so $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mrow><msup><mi>x</mi> <mn>2</mn></msup><msubsup><mo stretchy="false">|</mo> <mrow><mi>x</mi><mo>=</mo><mn>1</mn></mrow> <mn>2</mn></msubsup></mrow></mrow><annotation encoding="application/x-tex">{x^2|_{x=1}^2}</annotation></semantics></math>$ is indeed a real number (as it happens, $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>3</mn></mrow><annotation encoding="application/x-tex">3</annotation></semantics></math>$ ). In the context of a word problem where $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ stands for an inherently positive quantity, then it would be more appropriate to take $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>Γ</mi></mrow><annotation encoding="application/x-tex">\Gamma</annotation></semantics></math>$ to be $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mrow><mo stretchy="false">]</mo><mn>0</mn><mo>,</mo><mn>∞</mn><mo stretchy="false">[</mo></mrow></mrow><annotation encoding="application/x-tex">{]0,\infty[}</annotation></semantics></math>$ instead.¹ But in such problems, I think it even more natural to take $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>Γ</mi></mrow><annotation encoding="application/x-tex">\Gamma</annotation></semantics></math>$ to be an abstract space, which I think of as the space of possible states of the situation described in the problem. While $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>Γ</mi></mrow><annotation encoding="application/x-tex">\Gamma</annotation></semantics></math>$ might never be fully defined, various properties of it may be justified as needed on the basis of the intuition behind the problem. The textbooks, by encouraging us to put everything in the problem in terms of a single variable (such as $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ ), effectively ask us to find that this variable mediates an isomorphism between $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>Γ</mi></mrow><annotation encoding="application/x-tex">\Gamma</annotation></semantics></math>$ and some subspace of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ℝ</mi></mrow><annotation encoding="application/x-tex">\mathbb{R}</annotation></semantics></math>$ (such as $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mrow><mo stretchy="false">]</mo><mn>0</mn><mo>,</mo><mn>∞</mn><mo stretchy="false">[</mo></mrow></mrow><annotation encoding="application/x-tex">{]0,\infty[}</annotation></semantics></math>$ ); this specifies $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>Γ</mi></mrow><annotation encoding="application/x-tex">\Gamma</annotation></semantics></math>$ up to specified isomorphism, so no further intuition is needed. But many problems are easier to solve without expressing everything in terms of one variable, and I encourage my students to take a more flexible approach (especially to things like related rates and optimization problems). This just requires them to be a little more careful about keeping track of the context.

I suspect that the thing going on here and elsewhere in the “variable quantity” perspective is that an equality like $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi><mo>=</mo><mi>x</mi></mrow><annotation encoding="application/x-tex">t=x</annotation></semantics></math>$ is interpreted in a way more commonly seen in probability/statistics as in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">{</mo><mi>x</mi><mo>=</mo><mi>t</mi><mo stretchy="false">}</mo></mrow><annotation encoding="application/x-tex">\{x=t \}</annotation></semantics></math>$ denoting “the set of all states where the random variables $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi></mrow><annotation encoding="application/x-tex">t</annotation></semantics></math>$ assume the same value.”

Yes, precisely, and this is an equalizer. In general, I'd say that the probability/statistics people have a good handle on this stuff; they know what a random variable really is, after all, and the rest of us just need to learn that all of our variables are much the same sort of thing.

how does one formalize categorically the notion of two quantities (arrows) being “independent” or “constant” with respect to each other?

Like Mike, I don't think that this is really a sensible notion without specifying what the other independent variables are supposed to be. Rather, what should be formalized is the idea that one quantity is determined by another. Working in the context $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>Γ</mi></mrow><annotation encoding="application/x-tex">\Gamma</annotation></semantics></math>$ , a $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>T</mi></mrow><annotation encoding="application/x-tex">T</annotation></semantics></math>$ -valued quantity $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ is determined by a $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>U</mi></mrow><annotation encoding="application/x-tex">U</annotation></semantics></math>$ -valued quantity $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi></mrow><annotation encoding="application/x-tex">y</annotation></semantics></math>$ if there exists a morphism $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo lspace="0.11111em">:</mo><mi>U</mi><mo>→</mo><mi>T</mi></mrow><annotation encoding="application/x-tex">f\colon U \to T</annotation></semantics></math>$ such that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi><mo>=</mo><mi>f</mi><mo>∘</mo><mi>y</mi></mrow><annotation encoding="application/x-tex">x = f \circ y</annotation></semantics></math>$ . (This definition appears as one of the fundamental concepts in Lawvere & Schanuel's Conceptual Mathematics.)

What does the “set of states of the world” (also mentioned by Toby) correspond to categorically? Some classifying object?

Sure, although actually it's a coclassifying object. So, while a principal $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>G</mi></mrow><annotation encoding="application/x-tex">G</annotation></semantics></math>$ -bundle on $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>S</mi></mrow><annotation encoding="application/x-tex">S</annotation></semantics></math>$ (for $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>G</mi></mrow><annotation encoding="application/x-tex">G</annotation></semantics></math>$ some topological group and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>S</mi></mrow><annotation encoding="application/x-tex">S</annotation></semantics></math>$ some topological space) is the same as a continuous map from $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>S</mi></mrow><annotation encoding="application/x-tex">S</annotation></semantics></math>$ to the classifying space $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>B</mi><mi>G</mi></mrow><annotation encoding="application/x-tex">B G</annotation></semantics></math>$ , so an $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>S</mi></mrow><annotation encoding="application/x-tex">S</annotation></semantics></math>$ -valued smooth quantity (for $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>S</mi></mrow><annotation encoding="application/x-tex">S</annotation></semantics></math>$ some smooth space) in a given context $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>Γ</mi></mrow><annotation encoding="application/x-tex">\Gamma</annotation></semantics></math>$ is the same as a smooth map to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>S</mi></mrow><annotation encoding="application/x-tex">S</annotation></semantics></math>$ from a coclassifying space (which I've been calling simply $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>Γ</mi></mrow><annotation encoding="application/x-tex">\Gamma</annotation></semantics></math>$ again). So $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>Γ</mi></mrow><annotation encoding="application/x-tex">\Gamma</annotation></semantics></math>$ is the coclassifying space for the quantities in the problem.
1. Of course, I write this as $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mn>0</mn><mo>,</mo><mn>∞</mn><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">(0,\infty)</annotation></semantics></math>$ in class, in deference to the textbooks, but I prefer the less overloaded notation $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mrow><mo stretchy="false">]</mo><mn>0</mn><mo>,</mo><mn>∞</mn><mo stretchy="false">[</mo></mrow></mrow><annotation encoding="application/x-tex">{]0,\infty[}</annotation></semantics></math>$ . ↩
- CommentRowNumber51.
- CommentAuthorTobyBartels
- CommentTimeNov 25th 2013
- PermaLink
Author: TobyBartels
Format: MarkdownItex>Lawvere & Schanuel\'s _Conceptual Mathematics_ One of my Calculus students came upon this very thread the other day and asked for reading material that would give him some idea of what we were talking about, and I recommended Lawvere & Schanuel. In my opinion, a course using this book should be the first college-level math course that every student takes. Algebra is a prerequisite for it, but not Calculus, so it should come before Calculus. (A bonus is that the practice of requiring Calculus as a prerequisite for unrelated courses such as linear algebra or discrete mathematics, intended to guarantee a level of mathematical maturity, would be served by requiring the course in conceptual mathematics, which is more important to know anyway.) Of course, first the math *teachers* have to learn this stuff!

Lawvere & Schanuel's Conceptual Mathematics

One of my Calculus students came upon this very thread the other day and asked for reading material that would give him some idea of what we were talking about, and I recommended Lawvere & Schanuel. In my opinion, a course using this book should be the first college-level math course that every student takes. Algebra is a prerequisite for it, but not Calculus, so it should come before Calculus. (A bonus is that the practice of requiring Calculus as a prerequisite for unrelated courses such as linear algebra or discrete mathematics, intended to guarantee a level of mathematical maturity, would be served by requiring the course in conceptual mathematics, which is more important to know anyway.)

Of course, first the math teachers have to learn this stuff!
- CommentRowNumber52.
- CommentAuthorMike Shulman
- CommentTimeNov 25th 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItexThanks Toby! How do you define the general form $\int_a^b \omega$ with $a$ and $b$ equations? I'm also curious whether you've ever tried teaching a course out of Lawvere & Schanuel?

Thanks Toby! How do you define the general form $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msubsup><mo>∫</mo> <mi>a</mi> <mi>b</mi></msubsup><mi>ω</mi></mrow><annotation encoding="application/x-tex">\int_a^b \omega</annotation></semantics></math>$ with $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>a</mi></mrow><annotation encoding="application/x-tex">a</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>b</mi></mrow><annotation encoding="application/x-tex">b</annotation></semantics></math>$ equations?

I’m also curious whether you’ve ever tried teaching a course out of Lawvere & Schanuel?
- CommentRowNumber53.
- CommentAuthorMike Shulman
- CommentTimeNov 26th 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItexAnother question for you, Toby, though not closely related to the subject of this thread. In emphasizing differentials more this semester than before, I've found that a lot of my students mix up derivatives and differentials. E.g. they will write things like $f'(x) = 2x \, dx$. Do you have any tricks for alleviating or preventing this confusion?

Another question for you, Toby, though not closely related to the subject of this thread. In emphasizing differentials more this semester than before, I’ve found that a lot of my students mix up derivatives and differentials. E.g. they will write things like $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mn>2</mn><mi>x</mi><mspace width="0.16667em"/><mi>dx</mi></mrow><annotation encoding="application/x-tex">f'(x) = 2x \, dx</annotation></semantics></math>$ . Do you have any tricks for alleviating or preventing this confusion?
- CommentRowNumber54.
- CommentAuthorMike Shulman
- CommentTimeNov 26th 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItexI just noticed that [Sage](http://www.sagemath.org)'s calculus functions use a notion of [symbolic variable](http://www.sagemath.org/doc/reference/calculus/sage/calculus/calculus.html) which seems quite similar to the "variable quantities" under discussion here. The documentation's description of them as "elements of the symbolic expression ring" suggests that they have a different mathematical formalization in mind, although I haven't figured out exactly what that means. But their behavior seems quite similar to what we've been talking about, e.g. once you declare a symbolic variable $x$, you can then write $y = x^2+1$ and differentiate $y$ with respect to $x$: var('x') y = x^2+1 y.derivative(x) gives $2x$. Although it will also try to guess the variable to differentiate with respect to if you don't give it one: y.derivative() also gives $2x$. Sage also seems to assume that all variables are constant with respect to each other: var('t') y = x^2 + t^2 y.derivative(x) also gives $2x$. Although you can declare one "variable" to be instead a function of the other: t = function('t',x) w = x^2 + t^2 w.derivative(x) uses the chain rule to give `2*t(x)*D[0](t)(x) + 2*x`. Finally, a symbolic expression like these $y$s can't be evaluated like a function --- or at least trying to do so y(3) gives a `DeprecationWarning`. But you can make it into a "callable symbolic expression" by designating an order of the variables occurring in it: z = y.function(x,t) z(3,8) I wonder if this would be a good sort of convention to adopt in a calculus class as well, especially one that involves learning to use Sage.
I just noticed that Sage’s calculus functions use a notion of symbolic variable which seems quite similar to the “variable quantities” under discussion here. The documentation’s description of them as “elements of the symbolic expression ring” suggests that they have a different mathematical formalization in mind, although I haven’t figured out exactly what that means. But their behavior seems quite similar to what we’ve been talking about, e.g. once you declare a symbolic variable $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ , you can then write $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi><mo>=</mo><msup><mi>x</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn></mrow><annotation encoding="application/x-tex">y = x^2+1</annotation></semantics></math>$ and differentiate $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi></mrow><annotation encoding="application/x-tex">y</annotation></semantics></math>$ with respect to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ :
```
var('x')
y = x^2+1
y.derivative(x)
```
gives $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>2</mn><mi>x</mi></mrow><annotation encoding="application/x-tex">2x</annotation></semantics></math>$ . Although it will also try to guess the variable to differentiate with respect to if you don’t give it one:
```
y.derivative()
```
also gives $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>2</mn><mi>x</mi></mrow><annotation encoding="application/x-tex">2x</annotation></semantics></math>$ . Sage also seems to assume that all variables are constant with respect to each other:
```
var('t')
y = x^2 + t^2
y.derivative(x)
```
also gives $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>2</mn><mi>x</mi></mrow><annotation encoding="application/x-tex">2x</annotation></semantics></math>$ . Although you can declare one “variable” to be instead a function of the other:
```
t = function('t',x)
w = x^2 + t^2
w.derivative(x)
```
uses the chain rule to give 2*t(x)*D[0](t)(x) + 2*x. Finally, a symbolic expression like these $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi></mrow><annotation encoding="application/x-tex">y</annotation></semantics></math>$ s can’t be evaluated like a function — or at least trying to do so
```
y(3)
```
gives a DeprecationWarning. But you can make it into a “callable symbolic expression” by designating an order of the variables occurring in it:
```
z = y.function(x,t)
z(3,8)
```
I wonder if this would be a good sort of convention to adopt in a calculus class as well, especially one that involves learning to use Sage.
- CommentRowNumber55.
- CommentAuthorTobyBartels
- CommentTimeNov 27th 2013
- PermaLink
Author: TobyBartels
Format: MarkdownItex>How do you define the general form $\int_a^b \omega$ with $a$ and $b$ equations? Now I feel like I ought to think about pulling $\omega$ back to the solution subspace of those equations, but I really only define it for equations with unique solutions on a simply-connected $1$-dimensional domain, that is expressions that can be reduced to $\int_{x=a}^b f(x) \,\mathrm{d}x$, which I define (following the textbook) as a Riemann integral (although sometimes I feel like I ought to do a Henstock integral). This is an approach that already does not generalize to complex variables, of course; in the multivariable class, I talk about oriented curves and all that.

How do you define the general form $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msubsup><mo>∫</mo> <mi>a</mi> <mi>b</mi></msubsup><mi>ω</mi></mrow><annotation encoding="application/x-tex">\int_a^b \omega</annotation></semantics></math>$ with $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>a</mi></mrow><annotation encoding="application/x-tex">a</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>b</mi></mrow><annotation encoding="application/x-tex">b</annotation></semantics></math>$ equations?

Now I feel like I ought to think about pulling $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ω</mi></mrow><annotation encoding="application/x-tex">\omega</annotation></semantics></math>$ back to the solution subspace of those equations, but I really only define it for equations with unique solutions on a simply-connected $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn></mrow><annotation encoding="application/x-tex">1</annotation></semantics></math>$ -dimensional domain, that is expressions that can be reduced to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msubsup><mo>∫</mo> <mrow><mi>x</mi><mo>=</mo><mi>a</mi></mrow> <mi>b</mi></msubsup><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\int_{x=a}^b f(x) \,\mathrm{d}x</annotation></semantics></math>$ , which I define (following the textbook) as a Riemann integral (although sometimes I feel like I ought to do a Henstock integral). This is an approach that already does not generalize to complex variables, of course; in the multivariable class, I talk about oriented curves and all that.
- CommentRowNumber56.
- CommentAuthorTobyBartels
- CommentTimeNov 27th 2013
- (edited Nov 27th 2013)
- PermaLink
Author: TobyBartels
Format: MarkdownItex>Do you have any tricks for alleviating or preventing this confusion? Not ones that work! Mind you, there are plenty of analogous mistakes *without* differentials. My goal is that they only make mistakes like this that don\'t make their final answer wrong. ETA: So for example, if they put in too many differentials, then they might write this: $$ f(x) = \ln(3x+1) $$ $$ f'(x) = \frac{\mathrm{d}(3x+1)}{3x+1} $$ $$ f'(x) = \frac{3\,\mathrm{d}x+0}{3x+1} $$ $$ f'(x) = \frac3{3x+1} ;$$ the middle lines are wrong, but the last is correct (given the first). But if they put in too few differentials, then they might write this: $$ x^5 + y^5 = x + y $$ $$ 5x^4 + 5y^4 = 1 + y' $$ $$ y' = 5x^4 + 5y^4 - 1 ;$$ now everything is completely wrong (after the first line). The latter is a fairly standard Calculus-class error, which using differentials helps to avoid; I much prefer the former error.

Do you have any tricks for alleviating or preventing this confusion?

Not ones that work!

Mind you, there are plenty of analogous mistakes without differentials. My goal is that they only make mistakes like this that don't make their final answer wrong.

ETA: So for example, if they put in too many differentials, then they might write this:
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mi>ln</mi><mo stretchy="false">(</mo><mn>3</mn><mi>x</mi><mo>+</mo><mn>1</mn><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex"> f(x) = \ln(3x+1) </annotation></semantics></math>$ $<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mfrac><mrow><mi mathvariant="normal">d</mi><mo stretchy="false">(</mo><mn>3</mn><mi>x</mi><mo>+</mo><mn>1</mn><mo stretchy="false">)</mo></mrow><mrow><mn>3</mn><mi>x</mi><mo>+</mo><mn>1</mn></mrow></mfrac></mrow><annotation encoding="application/x-tex"> f'(x) = \frac{\mathrm{d}(3x+1)}{3x+1} </annotation></semantics></math>$ $<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mfrac><mrow><mn>3</mn><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi><mo>+</mo><mn>0</mn></mrow><mrow><mn>3</mn><mi>x</mi><mo>+</mo><mn>1</mn></mrow></mfrac></mrow><annotation encoding="application/x-tex"> f'(x) = \frac{3\,\mathrm{d}x+0}{3x+1} </annotation></semantics></math>$ $<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mfrac><mn>3</mn><mrow><mn>3</mn><mi>x</mi><mo>+</mo><mn>1</mn></mrow></mfrac><mo>;</mo></mrow><annotation encoding="application/x-tex"> f'(x) = \frac3{3x+1} ;</annotation></semantics></math>$
the middle lines are wrong, but the last is correct (given the first).

But if they put in too few differentials, then they might write this:
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><msup><mi>x</mi> <mn>5</mn></msup><mo>+</mo><msup><mi>y</mi> <mn>5</mn></msup><mo>=</mo><mi>x</mi><mo>+</mo><mi>y</mi></mrow><annotation encoding="application/x-tex"> x^5 + y^5 = x + y </annotation></semantics></math>$ $<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mn>5</mn><msup><mi>x</mi> <mn>4</mn></msup><mo>+</mo><mn>5</mn><msup><mi>y</mi> <mn>4</mn></msup><mo>=</mo><mn>1</mn><mo>+</mo><mi>y</mi><mo>′</mo></mrow><annotation encoding="application/x-tex"> 5x^4 + 5y^4 = 1 + y' </annotation></semantics></math>$ $<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mi>y</mi><mo>′</mo><mo>=</mo><mn>5</mn><msup><mi>x</mi> <mn>4</mn></msup><mo>+</mo><mn>5</mn><msup><mi>y</mi> <mn>4</mn></msup><mo>−</mo><mn>1</mn><mo>;</mo></mrow><annotation encoding="application/x-tex"> y' = 5x^4 + 5y^4 - 1 ;</annotation></semantics></math>$
now everything is completely wrong (after the first line).

The latter is a fairly standard Calculus-class error, which using differentials helps to avoid; I much prefer the former error.
- CommentRowNumber57.
- CommentAuthorMichael_Bachtold
- CommentTimeNov 27th 2013
- PermaLink
Author: Michael_Bachtold
Format: MarkdownItex@Mike 54: nice idea to look at how people have implemented these things in software. Just a quick question for clarification: >I wonder if this would be a good sort of convention to adopt in a calculus class as well, especially one that involves learning to use Sage. Do you mean the convention of distinguishing between "symbolic variables" and "callable symbolic expressions"? If yes, it looks to me (at first sight) that these two notions corresponds to our distinction between "variable quantities" (maps with unspecified domain) and "functions" with domains some subset of $\mathbb{R}^n$. In the classical notation it might be the difference between writing $f=x^2+1$ and $f(x)=x^2+1$. In the first case $f$ would be a variable quantity, in the second case $f$ is a function from $\mathbb{R}$ to itself. Would you agree?

@Mike 54: nice idea to look at how people have implemented these things in software. Just a quick question for clarification:

I wonder if this would be a good sort of convention to adopt in a calculus class as well, especially one that involves learning to use Sage.

Do you mean the convention of distinguishing between “symbolic variables” and “callable symbolic expressions”?

If yes, it looks to me (at first sight) that these two notions corresponds to our distinction between “variable quantities” (maps with unspecified domain) and “functions” with domains some subset of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>ℝ</mi> <mi>n</mi></msup></mrow><annotation encoding="application/x-tex">\mathbb{R}^n</annotation></semantics></math>$ . In the classical notation it might be the difference between writing $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>=</mo><msup><mi>x</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn></mrow><annotation encoding="application/x-tex">f=x^2+1</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><msup><mi>x</mi> <mn>2</mn></msup><mo>+</mo><mn>1</mn></mrow><annotation encoding="application/x-tex">f(x)=x^2+1</annotation></semantics></math>$ . In the first case $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ would be a variable quantity, in the second case $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ is a function from $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ℝ</mi></mrow><annotation encoding="application/x-tex">\mathbb{R}</annotation></semantics></math>$ to itself. Would you agree?
- CommentRowNumber58.
- CommentAuthorMike Shulman
- CommentTimeNov 27th 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItex> Do you mean the convention of distinguishing between “symbolic variables” and “callable symbolic expressions”? I guess that's mostly what I meant. As I said in #54, Sage's "symbolic variables" do seem to correspond to our "variable quantities", but I think its "callable symbolic expresions" are not quite the mathematician's functions, because they still remember the names of their variables. E.g. f(x) = x^2 (another way to define a callable symbolic expression) f(3) ===> 9 f(x=3) ===> 9 f(y=3) ===> x^2 So I guess I was wondering whether it would be worth discussing with calculus students the idea of a "function that knows the name of its arguments".
Do you mean the convention of distinguishing between “symbolic variables” and “callable symbolic expressions”?

I guess that’s mostly what I meant. As I said in #54, Sage’s “symbolic variables” do seem to correspond to our “variable quantities”, but I think its “callable symbolic expresions” are not quite the mathematician’s functions, because they still remember the names of their variables. E.g.
```
f(x) = x^2
```
(another way to define a callable symbolic expression)
```
f(3)           ===>  9
f(x=3)         ===>  9
f(y=3)     ===>  x^2
```
So I guess I was wondering whether it would be worth discussing with calculus students the idea of a “function that knows the name of its arguments”.
- CommentRowNumber59.
- CommentAuthorzskoda
- CommentTimeNov 27th 2013
- (edited Nov 27th 2013)
- PermaLink
Author: zskoda
Format: MarkdownItexMike 43 > Instead I'm saying, let's fix some particular "scale" infinitesimal $\epsilon$; then a first-order infinitesimal is one $\eta$ such that $\eta/\epsilon$ is finite ("limited"), a second-order one is such that $\eta/\epsilon^2$ is finite, etc. Well, even more, in ultrafilter model, one looks at sequences with some limiting behaviour, and the integer power law in comparing asymptotic infinitesimals is not the only possibility. You can have exponentially small ones, e.g. such ratios that say $\frac{\eta}{\epsilon^{3/2} exp(-1/\epsilon^2)}$ is finite. I hope you agree. (Sorry for bringing an issue which is already aged in the thread).

Mike 43

Instead I’m saying, let’s fix some particular “scale” infinitesimal $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ε</mi></mrow><annotation encoding="application/x-tex">\epsilon</annotation></semantics></math>$ ; then a first-order infinitesimal is one $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>η</mi></mrow><annotation encoding="application/x-tex">\eta</annotation></semantics></math>$ such that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>η</mi><mo stretchy="false">/</mo><mi>ε</mi></mrow><annotation encoding="application/x-tex">\eta/\epsilon</annotation></semantics></math>$ is finite (“limited”), a second-order one is such that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>η</mi><mo stretchy="false">/</mo><msup><mi>ε</mi> <mn>2</mn></msup></mrow><annotation encoding="application/x-tex">\eta/\epsilon^2</annotation></semantics></math>$ is finite, etc.

Well, even more, in ultrafilter model, one looks at sequences with some limiting behaviour, and the integer power law in comparing asymptotic infinitesimals is not the only possibility. You can have exponentially small ones, e.g. such ratios that say $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mfrac><mi>η</mi><mrow><msup><mi>ε</mi> <mrow><mn>3</mn><mo stretchy="false">/</mo><mn>2</mn></mrow></msup><mi>exp</mi><mo stretchy="false">(</mo><mo lspace="0.11111em" rspace="0em">−</mo><mn>1</mn><mo stretchy="false">/</mo><msup><mi>ε</mi> <mn>2</mn></msup><mo stretchy="false">)</mo></mrow></mfrac></mrow><annotation encoding="application/x-tex">\frac{\eta}{\epsilon^{3/2} exp(-1/\epsilon^2)}</annotation></semantics></math>$ is finite. I hope you agree. (Sorry for bringing an issue which is already aged in the thread).
- CommentRowNumber60.
- CommentAuthorMike Shulman
- CommentTimeNov 27th 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItex@Zoran: Yes, of course. That's not even particular to an ultrafilter model, e.g. $\sqrt{\epsilon}$ is still infinitesimal, but "less than first order". But the integer power law is the relevant one for defining derivatives and higher derivatives.

@Zoran: Yes, of course. That’s not even particular to an ultrafilter model, e.g. $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msqrt><mi>ε</mi></msqrt></mrow><annotation encoding="application/x-tex">\sqrt{\epsilon}</annotation></semantics></math>$ is still infinitesimal, but “less than first order”. But the integer power law is the relevant one for defining derivatives and higher derivatives.
- CommentRowNumber61.
- CommentAuthorzskoda
- CommentTimeNov 27th 2013
- (edited Nov 27th 2013)
- PermaLink
Author: zskoda
Format: MarkdownItexSurely, Mike, I was not considering the issue critical for your calculus discussion, but for the intuition/image people who know other approaches, primarily SDG, gain about the [[nonstandard analysis]].

Surely, Mike, I was not considering the issue critical for your calculus discussion, but for the intuition/image people who know other approaches, primarily SDG, gain about the nonstandard analysis.
- CommentRowNumber62.
- CommentAuthorMichael_Bachtold
- CommentTimeNov 28th 2013
- PermaLink
Author: Michael_Bachtold
Format: MarkdownItex@Toby #56: why would you say that there are too many differentials in the first computation? If we add one more $dx$ (for example multiplying on left) it seems correct. To understand the confusion of students it would be interesting to understand what the student was thinking when doing that computation.

@Toby #56: why would you say that there are too many differentials in the first computation? If we add one more $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>dx</mi></mrow><annotation encoding="application/x-tex">dx</annotation></semantics></math>$ (for example multiplying on left) it seems correct. To understand the confusion of students it would be interesting to understand what the student was thinking when doing that computation.
- CommentRowNumber63.
- CommentAuthorTobyBartels
- CommentTimeNov 29th 2013
- PermaLink
Author: TobyBartels
Format: MarkdownItexSure, too many on the right or too few on the left. I\'m basically taking the left-hand side (the simpler one and the one first written down) as indicating what the student meant to do and judging correctness or incorrectness based on that. (But when correcting a paper, I might well amend the left-hand side instead, if that\'s the simpler fix.)

Sure, too many on the right or too few on the left. I'm basically taking the left-hand side (the simpler one and the one first written down) as indicating what the student meant to do and judging correctness or incorrectness based on that. (But when correcting a paper, I might well amend the left-hand side instead, if that's the simpler fix.)
- CommentRowNumber64.
- CommentAuthorMike Shulman
- CommentTimeNov 29th 2013
- PermaLink
Author: Mike Shulman
Format: MarkdownItexIt's not clear to me that when students make mistakes like this they are thinking *anything*, in the sense that we would mean the word. Rather, they just don't seem to have the same understanding we do that mathematical words and symbols have precise meanings and have to be used correctly.

It’s not clear to me that when students make mistakes like this they are thinking anything, in the sense that we would mean the word. Rather, they just don’t seem to have the same understanding we do that mathematical words and symbols have precise meanings and have to be used correctly.
- CommentRowNumber65.
- CommentAuthorTobyBartels
- CommentTimeNov 30th 2013
- (edited Dec 10th 2013)
- PermaLink
Author: TobyBartels
Format: MarkdownItexYeah, I wouldn\'t want to defend the thesis that the left-hand side indicates what the student intended in any seriously discriminatory way; I mean, I wouldn\'t want to assume that the student is thinking clearly enough to discriminate between intending $f'(x)$, intending $f'(x) \,\mathrm{d}x$, or intending $\mathrm{d}f(x)$ (the latter two being equal, of course, but maybe not trivially so even to a student who is thinking clearly). I just mean that if I have to pick some way to classify the error (as too many differentials or as too few, in this case), then that\'s the criterion that I\'ll use.

Yeah, I wouldn't want to defend the thesis that the left-hand side indicates what the student intended in any seriously discriminatory way; I mean, I wouldn't want to assume that the student is thinking clearly enough to discriminate between intending $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">f'(x)</annotation></semantics></math>$ , intending $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">f'(x) \,\mathrm{d}x</annotation></semantics></math>$ , or intending $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">\mathrm{d}f(x)</annotation></semantics></math>$ (the latter two being equal, of course, but maybe not trivially so even to a student who is thinking clearly). I just mean that if I have to pick some way to classify the error (as too many differentials or as too few, in this case), then that's the criterion that I'll use.
- CommentRowNumber66.
- CommentAuthorTobyBartels
- CommentTimeDec 10th 2013
- PermaLink
Author: TobyBartels
Format: MarkdownItex>I’m also curious whether you’ve ever tried teaching a course out of Lawvere & Schanuel? No. It might not work very well for the students that we get either; it would need a massive illustrated, hand-holding, problem-filled expansion.

I’m also curious whether you’ve ever tried teaching a course out of Lawvere & Schanuel?

No. It might not work very well for the students that we get either; it would need a massive illustrated, hand-holding, problem-filled expansion.
- CommentRowNumber67.
- CommentAuthorTobyBartels
- CommentTimeFeb 4th 2014
- PermaLink
Author: TobyBartels
Format: MarkdownItexOn second derivatives and second differentials … John Armstrong was considering them in 2009 in [two](http://unapologetic.wordpress.com/2009/10/16/higher-order-differentials/) [posts](http://unapologetic.wordpress.com/2009/10/19/higher-differentials-and-composite-functions/) that unfortunately attracted no comments.

On second derivatives and second differentials … John Armstrong was considering them in 2009 in two posts that unfortunately attracted no comments.
- CommentRowNumber68.
- CommentAuthorMike Shulman
- CommentTimeFeb 8th 2014
- PermaLink
Author: Mike Shulman
Format: MarkdownItexRegarding antidifferentials (#44-49), what about introducing a new notation for "equality up to a local constant"? Since an equation like $\int x^2 dx = \frac{1}{3} x^3 + C$ is not an "equation involving a variable $x$" in the same sense as $(x+1)^2 = x^2+2x=1$ anyway (you can't substitute $x=3$ in itto get anything meaningful), it has to be regarded as an "equation between variable quantities", and then we can change the sense of "equal" as well. Say that if $u$ and $v$ are variable quantities, then $u\equiv v$ means that $u$ and $v$ have the same domain, and on every connected subset of that domain there is a constant $C$ such that $u=v+C$ on that subset (or some simpler version of this statement that would be easier to understand). Then we could write $$ \int x^2 dx \equiv \frac{1}{3} x^3 $$ and even $$ \int \frac{1}{x} dx \equiv \ln |x|.$$

Regarding antidifferentials (#44-49), what about introducing a new notation for “equality up to a local constant”? Since an equation like $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo>∫</mo><msup><mi>x</mi> <mn>2</mn></msup><mi>dx</mi><mo>=</mo><mfrac><mn>1</mn><mn>3</mn></mfrac><msup><mi>x</mi> <mn>3</mn></msup><mo>+</mo><mi>C</mi></mrow><annotation encoding="application/x-tex">\int x^2 dx = \frac{1}{3} x^3 + C</annotation></semantics></math>$ is not an “equation involving a variable $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ ” in the same sense as $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mi>x</mi><mo>+</mo><mn>1</mn><msup><mo stretchy="false">)</mo> <mn>2</mn></msup><mo>=</mo><msup><mi>x</mi> <mn>2</mn></msup><mo>+</mo><mn>2</mn><mi>x</mi><mo>=</mo><mn>1</mn></mrow><annotation encoding="application/x-tex">(x+1)^2 = x^2+2x=1</annotation></semantics></math>$ anyway (you can’t substitute $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi><mo>=</mo><mn>3</mn></mrow><annotation encoding="application/x-tex">x=3</annotation></semantics></math>$ in itto get anything meaningful), it has to be regarded as an “equation between variable quantities”, and then we can change the sense of “equal” as well. Say that if $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>u</mi></mrow><annotation encoding="application/x-tex">u</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>v</mi></mrow><annotation encoding="application/x-tex">v</annotation></semantics></math>$ are variable quantities, then $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>u</mi><mo>≡</mo><mi>v</mi></mrow><annotation encoding="application/x-tex">u\equiv v</annotation></semantics></math>$ means that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>u</mi></mrow><annotation encoding="application/x-tex">u</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>v</mi></mrow><annotation encoding="application/x-tex">v</annotation></semantics></math>$ have the same domain, and on every connected subset of that domain there is a constant $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>C</mi></mrow><annotation encoding="application/x-tex">C</annotation></semantics></math>$ such that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>u</mi><mo>=</mo><mi>v</mi><mo>+</mo><mi>C</mi></mrow><annotation encoding="application/x-tex">u=v+C</annotation></semantics></math>$ on that subset (or some simpler version of this statement that would be easier to understand). Then we could write
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mo>∫</mo><msup><mi>x</mi> <mn>2</mn></msup><mi>dx</mi><mo>≡</mo><mfrac><mn>1</mn><mn>3</mn></mfrac><msup><mi>x</mi> <mn>3</mn></msup></mrow><annotation encoding="application/x-tex"> \int x^2 dx \equiv \frac{1}{3} x^3 </annotation></semantics></math>$
and even
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mo>∫</mo><mfrac><mn>1</mn><mi>x</mi></mfrac><mi>dx</mi><mo>≡</mo><mi>ln</mi><mo stretchy="false">|</mo><mi>x</mi><mo stretchy="false">|</mo><mo>.</mo></mrow><annotation encoding="application/x-tex"> \int \frac{1}{x} dx \equiv \ln |x|.</annotation></semantics></math>$
- CommentRowNumber69.
- CommentAuthorMike Shulman
- CommentTimeFeb 9th 2014
- PermaLink
Author: Mike Shulman
Format: MarkdownItexRe #67: I remember stumbling over that issue sometime as an undergrad, or maybe even a grad student. I think I spent days, or at least hours, trying to figure out why some computation wasn't working, before I realized that I was implicitly assuming a version of "Cauchy's invariant rule" for second derivatives (though I didn't know the name of it), and that it might not be true. From the perspective of #33 above, the problem arises from neglecting the $d^2 x$ terms that ought to be there in the second differential. I certainly didn't understand that at the time, but I might have if someone had taught me calculus using differentials to start with!

Re #67: I remember stumbling over that issue sometime as an undergrad, or maybe even a grad student. I think I spent days, or at least hours, trying to figure out why some computation wasn’t working, before I realized that I was implicitly assuming a version of “Cauchy’s invariant rule” for second derivatives (though I didn’t know the name of it), and that it might not be true.

From the perspective of #33 above, the problem arises from neglecting the $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>d</mi> <mn>2</mn></msup><mi>x</mi></mrow><annotation encoding="application/x-tex">d^2 x</annotation></semantics></math>$ terms that ought to be there in the second differential. I certainly didn’t understand that at the time, but I might have if someone had taught me calculus using differentials to start with!
- CommentRowNumber70.
- CommentAuthorMike Shulman
- CommentTimeFeb 9th 2014
- PermaLink
Author: Mike Shulman
Format: MarkdownItex@Toby, did either of the two answers on your [MO question](http://mathoverflow.net/questions/60474/is-there-a-convenient-differential-calculus-for-cojets) ever pan out? The Hasse-Schmidt one seems promising, as you said, but as stated it seems to be purely algebraic and so only applies to polynomials. Also, if I understood it correctly, there isn't an operator $d$ that could be applied to anything already containing $d$s -- instead there is a separate $d^2$ operator which is just asserted to satisfy the Leibniz rule that you would expect if it were actually "$d$-of-$d$".

@Toby, did either of the two answers on your MO question ever pan out? The Hasse-Schmidt one seems promising, as you said, but as stated it seems to be purely algebraic and so only applies to polynomials. Also, if I understood it correctly, there isn’t an operator $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>d</mi></mrow><annotation encoding="application/x-tex">d</annotation></semantics></math>$ that could be applied to anything already containing $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>d</mi></mrow><annotation encoding="application/x-tex">d</annotation></semantics></math>$ s – instead there is a separate $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>d</mi> <mn>2</mn></msup></mrow><annotation encoding="application/x-tex">d^2</annotation></semantics></math>$ operator which is just asserted to satisfy the Leibniz rule that you would expect if it were actually “ $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>d</mi></mrow><annotation encoding="application/x-tex">d</annotation></semantics></math>$ -of- $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>d</mi></mrow><annotation encoding="application/x-tex">d</annotation></semantics></math>$ ”.
- CommentRowNumber71.
- CommentAuthorTobyBartels
- CommentTimeFeb 9th 2014
- (edited Feb 9th 2014)
- PermaLink
Author: TobyBartels
Format: MarkdownItexRe #68: Then an important basic result (an easy corollary of the Mean Value Theorem) is that (for differentiable quantities) $u \equiv v$ is equivalent to $\mathrm{d}u = \mathrm{d}v$. Actually, I\'ve considered formally *defining* $\mathrm{d}$ to be the operation taking $u$ to its ${\equiv}$-equivalence class. Then all of the hard work goes into defining multiplication of such an equivalence class by an ordinary quantity (or more precisely into defining the equality relation on formal linear combinations of differentials with coefficients from the ring of quantities). Note that naïvely, every quantity has a differential in this sense, but we\'ll find that things are better behaved when we restrict to differentiable quantities.

Re #68: Then an important basic result (an easy corollary of the Mean Value Theorem) is that (for differentiable quantities) $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>u</mi><mo>≡</mo><mi>v</mi></mrow><annotation encoding="application/x-tex">u \equiv v</annotation></semantics></math>$ is equivalent to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>u</mi><mo>=</mo><mi mathvariant="normal">d</mi><mi>v</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}u = \mathrm{d}v</annotation></semantics></math>$ .

Actually, I've considered formally defining $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}</annotation></semantics></math>$ to be the operation taking $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>u</mi></mrow><annotation encoding="application/x-tex">u</annotation></semantics></math>$ to its $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo>≡</mo></mrow><annotation encoding="application/x-tex">{\equiv}</annotation></semantics></math>$ -equivalence class. Then all of the hard work goes into defining multiplication of such an equivalence class by an ordinary quantity (or more precisely into defining the equality relation on formal linear combinations of differentials with coefficients from the ring of quantities). Note that naïvely, every quantity has a differential in this sense, but we'll find that things are better behaved when we restrict to differentiable quantities.
- CommentRowNumber72.
- CommentAuthorTobyBartels
- CommentTimeFeb 9th 2014
- PermaLink
Author: TobyBartels
Format: MarkdownItexRe #69: I dare say that I spent *years* on this, off and on, struggling to figure out what the heck was going on. It may actually have only been when I was first assigned to teach Calculus that I forced myself to come to some resolution (and shortly thereafter started writing M.O questions about it). I remember struggling with the minus sign in $\mathrm{d}y/\mathrm{d}x = -(\partial{F}/\partial{x})/(\partial{F}/\partial{y})$ around the same time (although I resolved that one much earlier). Re #70: No, I never really slogged through the linked articles. I\'ve really just these past few months settled on my own answer. To wit: $\mathrm{d}f$ is the operation that maps a smooth curve $c$ to $(f \circ c)'(0)$; $\mathrm{d}^2f$ maps $c$ to $(f \circ c)''(0)$, and so on. Of course, $f$ itself maps $c$ to $(f \circ c)(0)$. Then we just take the subring generated by the above, within the ring of all operations that map a curve to a number (which is commutative). At least for smooth functions, that\'s all that there is to it.

Re #69: I dare say that I spent years on this, off and on, struggling to figure out what the heck was going on. It may actually have only been when I was first assigned to teach Calculus that I forced myself to come to some resolution (and shortly thereafter started writing M.O questions about it). I remember struggling with the minus sign in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>y</mi><mo stretchy="false">/</mo><mi mathvariant="normal">d</mi><mi>x</mi><mo>=</mo><mo lspace="0.11111em" rspace="0em">−</mo><mo stretchy="false">(</mo><mo>∂</mo><mi>F</mi><mo stretchy="false">/</mo><mo>∂</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">/</mo><mo stretchy="false">(</mo><mo>∂</mo><mi>F</mi><mo stretchy="false">/</mo><mo>∂</mo><mi>y</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">\mathrm{d}y/\mathrm{d}x = -(\partial{F}/\partial{x})/(\partial{F}/\partial{y})</annotation></semantics></math>$ around the same time (although I resolved that one much earlier).

Re #70: No, I never really slogged through the linked articles. I've really just these past few months settled on my own answer. To wit: $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>f</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}f</annotation></semantics></math>$ is the operation that maps a smooth curve $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>c</mi></mrow><annotation encoding="application/x-tex">c</annotation></semantics></math>$ to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mi>f</mi><mo>∘</mo><mi>c</mi><mo stretchy="false">)</mo><mo>′</mo><mo stretchy="false">(</mo><mn>0</mn><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">(f \circ c)'(0)</annotation></semantics></math>$ ; $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>f</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}^2f</annotation></semantics></math>$ maps $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>c</mi></mrow><annotation encoding="application/x-tex">c</annotation></semantics></math>$ to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mi>f</mi><mo>∘</mo><mi>c</mi><mo stretchy="false">)</mo><mo>″</mo><mo stretchy="false">(</mo><mn>0</mn><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">(f \circ c)''(0)</annotation></semantics></math>$ , and so on. Of course, $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ itself maps $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>c</mi></mrow><annotation encoding="application/x-tex">c</annotation></semantics></math>$ to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mi>f</mi><mo>∘</mo><mi>c</mi><mo stretchy="false">)</mo><mo stretchy="false">(</mo><mn>0</mn><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">(f \circ c)(0)</annotation></semantics></math>$ . Then we just take the subring generated by the above, within the ring of all operations that map a curve to a number (which is commutative). At least for smooth functions, that's all that there is to it.
- CommentRowNumber73.
- CommentAuthorMike Shulman
- CommentTimeFeb 9th 2014
- PermaLink
Author: Mike Shulman
Format: MarkdownItexIs there a derivation $d$ that maps that entire subring to itself? It's clear what it should do on the generators, of course, but it's not immediately obvious to me that that yields a well-defined operation. Anyway, it sounds like a reasonable answer, but I find it a bit unsatisfying not to have a more intrinsic characterization of the subring in question, and also to have to assume in advance the notion of smooth.

Is there a derivation $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>d</mi></mrow><annotation encoding="application/x-tex">d</annotation></semantics></math>$ that maps that entire subring to itself? It’s clear what it should do on the generators, of course, but it’s not immediately obvious to me that that yields a well-defined operation.

Anyway, it sounds like a reasonable answer, but I find it a bit unsatisfying not to have a more intrinsic characterization of the subring in question, and also to have to assume in advance the notion of smooth.
- CommentRowNumber74.
- CommentAuthorTobyBartels
- CommentTimeFeb 9th 2014
- PermaLink
Author: TobyBartels
Format: MarkdownItexI\'m not sure what you mean by >have to assume in advance the notion of smooth As far as the M.O question is concerned, we\'re working on a smooth manifold (in fact a Cartesian space, without loss of generality), so we have this notion. Even if then we try to make it work more generally for diffeological spaces or the like, then all of these still start out with *some* notion of smooth. (It\'s the other thread where we\'re trying to define everything in terms of curves in very general spaces; here we\'re still trying to understand $\mathbb{R}^n$.) But if instead you mean that it\'s unsatisfying to only define this for smooth maps (so not to extend to the case where, say, $\mathrm{d}^2 f$ exists but $\mathrm{d}^3 f$ does not), then I think that it should still work, just with extra effort to keep track of when things might be undefined. (Again, we know ahead of time what\'s $C^k$ and what\'s not, so we already know when $\mathrm{d}$ should be defined.) >It’s clear what it should do on the generators, of course, but it’s not immediately obvious to me that that yields a well-defined operation. Ah, good point! Actually, I think that I can extend $\mathrm{d}$ (partially defined) to every operation whatsoever taking a smooth parametrized curve to a real number. Given the curve $c$ and a real number $h$, let $c_h$ be the reparametrization of $c$ given by $t \mapsto c(t + h)$. Then given the operation $\eta$ (so $\langle{\eta{|}c}\rangle$ is a number), define $\mathrm{d}\eta$ so that $$ \langle{\mathrm{d}\eta{|}c}\rangle \coloneqq \lim_{h \to 0} \frac{\langle{\eta{|}c_h}\rangle - \langle{\eta{|}c}\rangle} h $$ if this exists. (You can leave $\mathrm{d}\eta$ as a partially defined operation, or declare that $\mathrm{d}\eta$ exists only if this limit exists for all $c$.) This manifestly depends only on the underlying operation, and it does the right thing, recursively, to smooth maps.

I'm not sure what you mean by

have to assume in advance the notion of smooth

As far as the M.O question is concerned, we're working on a smooth manifold (in fact a Cartesian space, without loss of generality), so we have this notion. Even if then we try to make it work more generally for diffeological spaces or the like, then all of these still start out with some notion of smooth. (It's the other thread where we're trying to define everything in terms of curves in very general spaces; here we're still trying to understand $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>ℝ</mi> <mi>n</mi></msup></mrow><annotation encoding="application/x-tex">\mathbb{R}^n</annotation></semantics></math>$ .)

But if instead you mean that it's unsatisfying to only define this for smooth maps (so not to extend to the case where, say, $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>f</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}^2 f</annotation></semantics></math>$ exists but $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mn>3</mn></msup><mi>f</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}^3 f</annotation></semantics></math>$ does not), then I think that it should still work, just with extra effort to keep track of when things might be undefined. (Again, we know ahead of time what's $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>C</mi> <mi>k</mi></msup></mrow><annotation encoding="application/x-tex">C^k</annotation></semantics></math>$ and what's not, so we already know when $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}</annotation></semantics></math>$ should be defined.)

It’s clear what it should do on the generators, of course, but it’s not immediately obvious to me that that yields a well-defined operation.

Ah, good point! Actually, I think that I can extend $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}</annotation></semantics></math>$ (partially defined) to every operation whatsoever taking a smooth parametrized curve to a real number. Given the curve $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>c</mi></mrow><annotation encoding="application/x-tex">c</annotation></semantics></math>$ and a real number $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>h</mi></mrow><annotation encoding="application/x-tex">h</annotation></semantics></math>$ , let $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi>c</mi> <mi>h</mi></msub></mrow><annotation encoding="application/x-tex">c_h</annotation></semantics></math>$ be the reparametrization of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>c</mi></mrow><annotation encoding="application/x-tex">c</annotation></semantics></math>$ given by $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>t</mi><mo>↦</mo><mi>c</mi><mo stretchy="false">(</mo><mi>t</mi><mo>+</mo><mi>h</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">t \mapsto c(t + h)</annotation></semantics></math>$ . Then given the operation $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>η</mi></mrow><annotation encoding="application/x-tex">\eta</annotation></semantics></math>$ (so $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">⟨</mo><mrow><mi>η</mi><mo stretchy="false">|</mo><mi>c</mi></mrow><mo stretchy="false">⟩</mo></mrow><annotation encoding="application/x-tex">\langle{\eta{|}c}\rangle</annotation></semantics></math>$ is a number), define $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>η</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}\eta</annotation></semantics></math>$ so that
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mo stretchy="false">⟨</mo><mrow><mi mathvariant="normal">d</mi><mi>η</mi><mo stretchy="false">|</mo><mi>c</mi></mrow><mo stretchy="false">⟩</mo><mo>≔</mo><munder><mi>lim</mi> <mrow><mi>h</mi><mo>→</mo><mn>0</mn></mrow></munder><mfrac><mrow><mo stretchy="false">⟨</mo><mrow><mi>η</mi><mo stretchy="false">|</mo><msub><mi>c</mi> <mi>h</mi></msub></mrow><mo stretchy="false">⟩</mo><mo>−</mo><mo stretchy="false">⟨</mo><mrow><mi>η</mi><mo stretchy="false">|</mo><mi>c</mi></mrow><mo stretchy="false">⟩</mo></mrow><mi>h</mi></mfrac></mrow><annotation encoding="application/x-tex"> \langle{\mathrm{d}\eta{|}c}\rangle \coloneqq \lim_{h \to 0} \frac{\langle{\eta{|}c_h}\rangle - \langle{\eta{|}c}\rangle} h </annotation></semantics></math>$
if this exists. (You can leave $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>η</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}\eta</annotation></semantics></math>$ as a partially defined operation, or declare that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>η</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}\eta</annotation></semantics></math>$ exists only if this limit exists for all $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>c</mi></mrow><annotation encoding="application/x-tex">c</annotation></semantics></math>$ .)

This manifestly depends only on the underlying operation, and it does the right thing, recursively, to smooth maps.
- CommentRowNumber75.
- CommentAuthorMike Shulman
- CommentTimeFeb 10th 2014
- PermaLink
Author: Mike Shulman
Format: MarkdownItexVery nice! You can exclude some uninteresting things by restricting to germs of curves, and I think you can even omit the *a priori* restriction to smooth curves: consider partial real-valued functions from the set of germs (at 0) of all curves, and say a curve $c$ is smooth if $d^n x$ is defined at $c$ for all coordinate functions $x$. (I'm not sure exactly what I was complaining about re: "smooth", but whatever it was, this makes me happier.) That feels kind of [[Froelicher space|Froelicher]]: given the relation $\langle \eta | c \rangle$ between partial operations and curves, we consider the fixed point of the resulting Galois connection generated by the coordinate functions. The point in the [other thread](http://nforum.mathforge.org/discussion/5518/differentials/?Focus=45089#Comment_45089) is that this doesn't correctly isolate the differentiable functions on the other side: even if $d f$ is defined, as an operation, on all smooth $c$, then $f$ may not be differentiable in the usual sense unless $d f$ additionally depends only on the tangent vector of a curve and is a linear function thereof. Right? Interestingly, I think this context also allows operations like $e^{dx}$: it's the operation that takes $c$ to $e^{(x\circ c)'(0)}$. And presumably its differential is $d(e^{dx}) = e^{dx}\, d^2x$. I'm not sure whether this is a good thing or not. I'm currently playing around with a different idea for defining higher differentials; if it works I may post up somewhere. Can you think of a good name for these things that include differentials and also higher ones? We can't really call them "differential forms" once they have $d^2x\neq 0$ and $dx\,dy = dy\,dx$.

Very nice! You can exclude some uninteresting things by restricting to germs of curves, and I think you can even omit the a priori restriction to smooth curves: consider partial real-valued functions from the set of germs (at 0) of all curves, and say a curve $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>c</mi></mrow><annotation encoding="application/x-tex">c</annotation></semantics></math>$ is smooth if $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>d</mi> <mi>n</mi></msup><mi>x</mi></mrow><annotation encoding="application/x-tex">d^n x</annotation></semantics></math>$ is defined at $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>c</mi></mrow><annotation encoding="application/x-tex">c</annotation></semantics></math>$ for all coordinate functions $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ . (I’m not sure exactly what I was complaining about re: “smooth”, but whatever it was, this makes me happier.) That feels kind of Froelicher: given the relation $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">⟨</mo><mi>η</mi><mo stretchy="false">|</mo><mi>c</mi><mo stretchy="false">⟩</mo></mrow><annotation encoding="application/x-tex">\langle \eta | c \rangle</annotation></semantics></math>$ between partial operations and curves, we consider the fixed point of the resulting Galois connection generated by the coordinate functions. The point in the other thread is that this doesn’t correctly isolate the differentiable functions on the other side: even if $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>d</mi><mi>f</mi></mrow><annotation encoding="application/x-tex">d f</annotation></semantics></math>$ is defined, as an operation, on all smooth $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>c</mi></mrow><annotation encoding="application/x-tex">c</annotation></semantics></math>$ , then $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ may not be differentiable in the usual sense unless $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>d</mi><mi>f</mi></mrow><annotation encoding="application/x-tex">d f</annotation></semantics></math>$ additionally depends only on the tangent vector of a curve and is a linear function thereof. Right?

Interestingly, I think this context also allows operations like $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>e</mi> <mi>dx</mi></msup></mrow><annotation encoding="application/x-tex">e^{dx}</annotation></semantics></math>$ : it’s the operation that takes $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>c</mi></mrow><annotation encoding="application/x-tex">c</annotation></semantics></math>$ to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>e</mi> <mrow><mo stretchy="false">(</mo><mi>x</mi><mo>∘</mo><mi>c</mi><mo stretchy="false">)</mo><mo>′</mo><mo stretchy="false">(</mo><mn>0</mn><mo stretchy="false">)</mo></mrow></msup></mrow><annotation encoding="application/x-tex">e^{(x\circ c)'(0)}</annotation></semantics></math>$ . And presumably its differential is $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>d</mi><mo stretchy="false">(</mo><msup><mi>e</mi> <mi>dx</mi></msup><mo stretchy="false">)</mo><mo>=</mo><msup><mi>e</mi> <mi>dx</mi></msup><mspace width="0.16667em"/><msup><mi>d</mi> <mn>2</mn></msup><mi>x</mi></mrow><annotation encoding="application/x-tex">d(e^{dx}) = e^{dx}\, d^2x</annotation></semantics></math>$ . I’m not sure whether this is a good thing or not. I’m currently playing around with a different idea for defining higher differentials; if it works I may post up somewhere.

Can you think of a good name for these things that include differentials and also higher ones? We can’t really call them “differential forms” once they have $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>d</mi> <mn>2</mn></msup><mi>x</mi><mo>≠</mo><mn>0</mn></mrow><annotation encoding="application/x-tex">d^2x\neq 0</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>dx</mi><mspace width="0.16667em"/><mi>dy</mi><mo>=</mo><mi>dy</mi><mspace width="0.16667em"/><mi>dx</mi></mrow><annotation encoding="application/x-tex">dx\,dy = dy\,dx</annotation></semantics></math>$ .
- CommentRowNumber76.
- CommentAuthorMike Shulman
- CommentTimeFeb 10th 2014
- PermaLink
Author: Mike Shulman
Format: MarkdownItex(I guess I'm having trouble separating the threads, sorry -- in my mind it's all one discussion. (-: )

(I guess I’m having trouble separating the threads, sorry – in my mind it’s all one discussion. (-: )
- CommentRowNumber77.
- CommentAuthorTobyBartels
- CommentTimeFeb 10th 2014
- PermaLink
Author: TobyBartels
Format: MarkdownItexWe certainly *can* call them differential forms even when $\mathrm{d}x \,\mathrm{d}y = \mathrm{d}y \,\mathrm{d}x$; they\'re just not *exterior* differential forms. The term ‘form’ is quite general and has a venerable history. (Compare ‘[[quadratic form]]’, ‘[[symmetric bilinear form]]’, etc.) In M.O, I said ‘cojet differential form’, which is not quite as nice a term as ‘exterior differential form’ (since ‘cojet’ is a noun rather than an adjective like ‘exterior’), but it does get at the right idea: that they act on spaces of jets (the limit of which is the space of germs, as you noted).

We certainly can call them differential forms even when $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mi>x</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>y</mi><mo>=</mo><mi mathvariant="normal">d</mi><mi>y</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}x \,\mathrm{d}y = \mathrm{d}y \,\mathrm{d}x</annotation></semantics></math>$ ; they're just not exterior differential forms. The term ‘form’ is quite general and has a venerable history. (Compare ‘quadratic form’, ‘symmetric bilinear form’, etc.) In M.O, I said ‘cojet differential form’, which is not quite as nice a term as ‘exterior differential form’ (since ‘cojet’ is a noun rather than an adjective like ‘exterior’), but it does get at the right idea: that they act on spaces of jets (the limit of which is the space of germs, as you noted).
- CommentRowNumber78.
- CommentAuthorTobyBartels
- CommentTimeFeb 11th 2014
- PermaLink
Author: TobyBartels
Format: MarkdownItexI like your $\mathrm{e}^{\mathrm{d}x}$; I have successfully calculated $\mathrm{d}(\mathrm{e}^{\mathrm{d}x}) = \mathrm{e}^{\mathrm{d}x} \,\mathrm{d}^2x$ (using Taylor\'s Theorem with Peano\'s remainder); actually, the calculation works for $\mathrm{e}^\omega$ generally. Generalizing still further, I conclude that $$ \mathrm{d}(f(\omega_1, \ldots, \omega_n)) = D_1{f}(\omega_1, \ldots, \omega_n) \,\mathrm{d}\omega_1 + \cdots + D_n{f}(\omega_1, \ldots, \omega_n) \,\mathrm{d}\omega_n $$ for any differentiable function $f$ of $n$ variables, by pushing everything through the definition, applying Taylor\'s Theorem to $f$, and observing that the unwanted terms drop out in the limit. What more could one possibly want? (In particular, $\mathrm{d}$ is a derivation.) Technicality: You wrote in part >say a curve $c$ is smooth if $d^n x$ is defined at $c$ for all coordinate functions $x$ You mean that $c$ is smooth *at $0$*, or else you mean that $\mathrm{d}^n x$ must be defined at *$c_h$* for all $x$ *and all real numbers $h$*. >in my mind it’s all one discussion Certainly you borrowed notation from an off-site file linked only in the other thread!

I like your $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi mathvariant="normal">e</mi> <mrow><mi mathvariant="normal">d</mi><mi>x</mi></mrow></msup></mrow><annotation encoding="application/x-tex">\mathrm{e}^{\mathrm{d}x}</annotation></semantics></math>$ ; I have successfully calculated $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mo stretchy="false">(</mo><msup><mi mathvariant="normal">e</mi> <mrow><mi mathvariant="normal">d</mi><mi>x</mi></mrow></msup><mo stretchy="false">)</mo><mo>=</mo><msup><mi mathvariant="normal">e</mi> <mrow><mi mathvariant="normal">d</mi><mi>x</mi></mrow></msup><mspace width="0.16667em"/><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}(\mathrm{e}^{\mathrm{d}x}) = \mathrm{e}^{\mathrm{d}x} \,\mathrm{d}^2x</annotation></semantics></math>$ (using Taylor's Theorem with Peano's remainder); actually, the calculation works for $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi mathvariant="normal">e</mi> <mi>ω</mi></msup></mrow><annotation encoding="application/x-tex">\mathrm{e}^\omega</annotation></semantics></math>$ generally.

Generalizing still further, I conclude that
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mi mathvariant="normal">d</mi><mo stretchy="false">(</mo><mi>f</mi><mo stretchy="false">(</mo><msub><mi>ω</mi> <mn>1</mn></msub><mo>,</mo><mi>…</mi><mo>,</mo><msub><mi>ω</mi> <mi>n</mi></msub><mo stretchy="false">)</mo><mo stretchy="false">)</mo><mo>=</mo><msub><mi>D</mi> <mn>1</mn></msub><mi>f</mi><mo stretchy="false">(</mo><msub><mi>ω</mi> <mn>1</mn></msub><mo>,</mo><mi>…</mi><mo>,</mo><msub><mi>ω</mi> <mi>n</mi></msub><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><msub><mi>ω</mi> <mn>1</mn></msub><mo>+</mo><mi>⋯</mi><mo>+</mo><msub><mi>D</mi> <mi>n</mi></msub><mi>f</mi><mo stretchy="false">(</mo><msub><mi>ω</mi> <mn>1</mn></msub><mo>,</mo><mi>…</mi><mo>,</mo><msub><mi>ω</mi> <mi>n</mi></msub><mo stretchy="false">)</mo><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><msub><mi>ω</mi> <mi>n</mi></msub></mrow><annotation encoding="application/x-tex"> \mathrm{d}(f(\omega_1, \ldots, \omega_n)) = D_1{f}(\omega_1, \ldots, \omega_n) \,\mathrm{d}\omega_1 + \cdots + D_n{f}(\omega_1, \ldots, \omega_n) \,\mathrm{d}\omega_n </annotation></semantics></math>$
for any differentiable function $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>n</mi></mrow><annotation encoding="application/x-tex">n</annotation></semantics></math>$ variables, by pushing everything through the definition, applying Taylor's Theorem to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ , and observing that the unwanted terms drop out in the limit. What more could one possibly want? (In particular, $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}</annotation></semantics></math>$ is a derivation.)

Technicality: You wrote in part

say a curve $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>c</mi></mrow><annotation encoding="application/x-tex">c</annotation></semantics></math>$ is smooth if $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>d</mi> <mi>n</mi></msup><mi>x</mi></mrow><annotation encoding="application/x-tex">d^n x</annotation></semantics></math>$ is defined at $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>c</mi></mrow><annotation encoding="application/x-tex">c</annotation></semantics></math>$ for all coordinate functions $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$

You mean that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>c</mi></mrow><annotation encoding="application/x-tex">c</annotation></semantics></math>$ is smooth at $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>0</mn></mrow><annotation encoding="application/x-tex">0</annotation></semantics></math>$ , or else you mean that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mi>n</mi></msup><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}^n x</annotation></semantics></math>$ must be defined at $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi>c</mi> <mi>h</mi></msub></mrow><annotation encoding="application/x-tex">c_h</annotation></semantics></math>$ for all $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ and all real numbers $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>h</mi></mrow><annotation encoding="application/x-tex">h</annotation></semantics></math>$ .

in my mind it’s all one discussion

Certainly you borrowed notation from an off-site file linked only in the other thread!
- CommentRowNumber79.
- CommentAuthorMike Shulman
- CommentTimeFeb 11th 2014
- PermaLink
Author: Mike Shulman
Format: MarkdownItex> You mean that $c$ is smooth at $0$ Yes, thanks. > Certainly you borrowed notation from an off-site file linked only in the other thread! Really? What notation? You used $\langle{\eta{|}c}\rangle$ up in #74 here... > spaces of jets (the limit of which is the space of germs Technicality again, but that doesn't seem quite right to me; at least, I can't see a sense in which it's true. In particular, a germ is not determined by its $k$-jets for $k\lt\infty$, is it? > We certainly *can* call them differential forms Okay, I see the point that it's historically fine, but my experience is that nowadays mathematicians pretty universally say "differential form" to mean "exterior differential form". I guess "cojet differential form" would suffice to clarify, which might get abbreviated to "cojet form". I think my main worry is using the same symbol $d$ for the cojet differential and the exterior differential. For instance, pedagogically speaking, if I teach my calc 1 or calc 2 students to calculate with cojet differentials, aren't they going to be confused when they get to multivariable and I tell them that now $d^2=0$?

You mean that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>c</mi></mrow><annotation encoding="application/x-tex">c</annotation></semantics></math>$ is smooth at $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>0</mn></mrow><annotation encoding="application/x-tex">0</annotation></semantics></math>$

Yes, thanks.

Certainly you borrowed notation from an off-site file linked only in the other thread!

Really? What notation? You used $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">⟨</mo><mrow><mi>η</mi><mo stretchy="false">|</mo><mi>c</mi></mrow><mo stretchy="false">⟩</mo></mrow><annotation encoding="application/x-tex">\langle{\eta{|}c}\rangle</annotation></semantics></math>$ up in #74 here…

spaces of jets (the limit of which is the space of germs

Technicality again, but that doesn’t seem quite right to me; at least, I can’t see a sense in which it’s true. In particular, a germ is not determined by its $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>k</mi></mrow><annotation encoding="application/x-tex">k</annotation></semantics></math>$ -jets for $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>k</mi><mo>&lt;</mo><mn>∞</mn></mrow><annotation encoding="application/x-tex">k\lt\infty</annotation></semantics></math>$ , is it?

We certainly can call them differential forms

Okay, I see the point that it’s historically fine, but my experience is that nowadays mathematicians pretty universally say “differential form” to mean “exterior differential form”. I guess “cojet differential form” would suffice to clarify, which might get abbreviated to “cojet form”.

I think my main worry is using the same symbol $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>d</mi></mrow><annotation encoding="application/x-tex">d</annotation></semantics></math>$ for the cojet differential and the exterior differential. For instance, pedagogically speaking, if I teach my calc 1 or calc 2 students to calculate with cojet differentials, aren’t they going to be confused when they get to multivariable and I tell them that now $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>d</mi> <mn>2</mn></msup><mo>=</mo><mn>0</mn></mrow><annotation encoding="application/x-tex">d^2=0</annotation></semantics></math>$ ?
- CommentRowNumber80.
- CommentAuthorMike Shulman
- CommentTimeFeb 11th 2014
- PermaLink
Author: Mike Shulman
Format: MarkdownItexI wonder whether cojet forms and exterior forms could be unified in a larger framework? In some sense, all these cojet forms are still only 1-forms: even though they involve higher derivatives, they only act on curves. But we could consider instead real-valued operators on germs of parametrized surfaces or hypersurfaces as well. For instance, if $\omega$ is an operator on germs of curves, we could define its exterior differential $\hat{d}\omega$ as an operator on germs of surfaces by $$ \langle \hat{d}\omega {|} c \rangle = \lim_{t\to 0} \frac{ \langle \omega {|} \lambda s.c(s,0) \rangle + \langle \omega {|} \lambda s.c(t,s) \rangle - \langle \omega {|} \lambda s.c(s,t) \rangle - \langle \omega {|} \lambda s.c(0,s) \rangle }{t} $$ or perhaps in the case when $\omega$ might be nonlinear it would be better to say $$ \langle \hat{d}\omega {|} c \rangle = \lim_{t\to 0} \frac{ \langle \omega {|} \lambda s.c(s,0) \rangle + \langle \omega {|} \lambda s.c(t,s) \rangle + \langle \omega {|} \lambda s.c(-s,t) \rangle + \langle \omega {|} \lambda s.c(0,-s) \rangle }{t} $$ I haven't checked that this is at all sensible. But it also starts (unsurprisingly) to make me think of the Weil algebras that define the infinitesimal objects in SDG.

I wonder whether cojet forms and exterior forms could be unified in a larger framework? In some sense, all these cojet forms are still only 1-forms: even though they involve higher derivatives, they only act on curves. But we could consider instead real-valued operators on germs of parametrized surfaces or hypersurfaces as well. For instance, if $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ω</mi></mrow><annotation encoding="application/x-tex">\omega</annotation></semantics></math>$ is an operator on germs of curves, we could define its exterior differential $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mover><mi>d</mi><mo stretchy="false">^</mo></mover><mi>ω</mi></mrow><annotation encoding="application/x-tex">\hat{d}\omega</annotation></semantics></math>$ as an operator on germs of surfaces by
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mo stretchy="false">⟨</mo><mover><mi>d</mi><mo stretchy="false">^</mo></mover><mi>ω</mi><mo stretchy="false">|</mo><mi>c</mi><mo stretchy="false">⟩</mo><mo>=</mo><munder><mi>lim</mi> <mrow><mi>t</mi><mo>→</mo><mn>0</mn></mrow></munder><mfrac><mrow><mo stretchy="false">⟨</mo><mi>ω</mi><mo stretchy="false">|</mo><mi>λ</mi><mi>s</mi><mo>.</mo><mi>c</mi><mo stretchy="false">(</mo><mi>s</mi><mo>,</mo><mn>0</mn><mo stretchy="false">)</mo><mo stretchy="false">⟩</mo><mo>+</mo><mo stretchy="false">⟨</mo><mi>ω</mi><mo stretchy="false">|</mo><mi>λ</mi><mi>s</mi><mo>.</mo><mi>c</mi><mo stretchy="false">(</mo><mi>t</mi><mo>,</mo><mi>s</mi><mo stretchy="false">)</mo><mo stretchy="false">⟩</mo><mo>−</mo><mo stretchy="false">⟨</mo><mi>ω</mi><mo stretchy="false">|</mo><mi>λ</mi><mi>s</mi><mo>.</mo><mi>c</mi><mo stretchy="false">(</mo><mi>s</mi><mo>,</mo><mi>t</mi><mo stretchy="false">)</mo><mo stretchy="false">⟩</mo><mo>−</mo><mo stretchy="false">⟨</mo><mi>ω</mi><mo stretchy="false">|</mo><mi>λ</mi><mi>s</mi><mo>.</mo><mi>c</mi><mo stretchy="false">(</mo><mn>0</mn><mo>,</mo><mi>s</mi><mo stretchy="false">)</mo><mo stretchy="false">⟩</mo></mrow><mi>t</mi></mfrac></mrow><annotation encoding="application/x-tex"> \langle \hat{d}\omega {|} c \rangle = \lim_{t\to 0} \frac{ \langle \omega {|} \lambda s.c(s,0) \rangle + \langle \omega {|} \lambda s.c(t,s) \rangle - \langle \omega {|} \lambda s.c(s,t) \rangle - \langle \omega {|} \lambda s.c(0,s) \rangle }{t} </annotation></semantics></math>$
or perhaps in the case when $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ω</mi></mrow><annotation encoding="application/x-tex">\omega</annotation></semantics></math>$ might be nonlinear it would be better to say
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mo stretchy="false">⟨</mo><mover><mi>d</mi><mo stretchy="false">^</mo></mover><mi>ω</mi><mo stretchy="false">|</mo><mi>c</mi><mo stretchy="false">⟩</mo><mo>=</mo><munder><mi>lim</mi> <mrow><mi>t</mi><mo>→</mo><mn>0</mn></mrow></munder><mfrac><mrow><mo stretchy="false">⟨</mo><mi>ω</mi><mo stretchy="false">|</mo><mi>λ</mi><mi>s</mi><mo>.</mo><mi>c</mi><mo stretchy="false">(</mo><mi>s</mi><mo>,</mo><mn>0</mn><mo stretchy="false">)</mo><mo stretchy="false">⟩</mo><mo>+</mo><mo stretchy="false">⟨</mo><mi>ω</mi><mo stretchy="false">|</mo><mi>λ</mi><mi>s</mi><mo>.</mo><mi>c</mi><mo stretchy="false">(</mo><mi>t</mi><mo>,</mo><mi>s</mi><mo stretchy="false">)</mo><mo stretchy="false">⟩</mo><mo>+</mo><mo stretchy="false">⟨</mo><mi>ω</mi><mo stretchy="false">|</mo><mi>λ</mi><mi>s</mi><mo>.</mo><mi>c</mi><mo stretchy="false">(</mo><mo lspace="0.11111em" rspace="0em">−</mo><mi>s</mi><mo>,</mo><mi>t</mi><mo stretchy="false">)</mo><mo stretchy="false">⟩</mo><mo>+</mo><mo stretchy="false">⟨</mo><mi>ω</mi><mo stretchy="false">|</mo><mi>λ</mi><mi>s</mi><mo>.</mo><mi>c</mi><mo stretchy="false">(</mo><mn>0</mn><mo>,</mo><mo lspace="0.11111em" rspace="0em">−</mo><mi>s</mi><mo stretchy="false">)</mo><mo stretchy="false">⟩</mo></mrow><mi>t</mi></mfrac></mrow><annotation encoding="application/x-tex"> \langle \hat{d}\omega {|} c \rangle = \lim_{t\to 0} \frac{ \langle \omega {|} \lambda s.c(s,0) \rangle + \langle \omega {|} \lambda s.c(t,s) \rangle + \langle \omega {|} \lambda s.c(-s,t) \rangle + \langle \omega {|} \lambda s.c(0,-s) \rangle }{t} </annotation></semantics></math>$
I haven’t checked that this is at all sensible. But it also starts (unsurprisingly) to make me think of the Weil algebras that define the infinitesimal objects in SDG.
- CommentRowNumber81.
- CommentAuthorMike Shulman
- CommentTimeFeb 11th 2014
- PermaLink
Author: Mike Shulman
Format: MarkdownItexHere's another thought: can we integrate an arbitrary cojet form? Suppose $\omega$ is a real-valued operator on germs of curves, and let $c$ be a curve defined on $(a-\epsilon,b+\epsilon)$. Then we have a function $f:[a,b]\to\mathbb{R}$ defined by $$ f(x) = \langle \omega {|} c_{x} \rangle $$ and we could define $$ \oint_c \omega = \int_{a}^b f(x) dx $$ if the RHS exists. It seems like it ought to follow that $$ \oint_c d\omega = \langle \omega {|} c_b \rangle - \langle \omega {|} c_a \rangle.$$ (where $d$ is the commutative cojet differential). But it's late at night, so I could be spewing nonsense...

Here’s another thought: can we integrate an arbitrary cojet form? Suppose $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ω</mi></mrow><annotation encoding="application/x-tex">\omega</annotation></semantics></math>$ is a real-valued operator on germs of curves, and let $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>c</mi></mrow><annotation encoding="application/x-tex">c</annotation></semantics></math>$ be a curve defined on $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mi>a</mi><mo>−</mo><mi>ε</mi><mo>,</mo><mi>b</mi><mo>+</mo><mi>ε</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">(a-\epsilon,b+\epsilon)</annotation></semantics></math>$ . Then we have a function $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>:</mo><mo stretchy="false">[</mo><mi>a</mi><mo>,</mo><mi>b</mi><mo stretchy="false">]</mo><mo>→</mo><mi>ℝ</mi></mrow><annotation encoding="application/x-tex">f:[a,b]\to\mathbb{R}</annotation></semantics></math>$ defined by
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mo stretchy="false">⟨</mo><mi>ω</mi><mo stretchy="false">|</mo><msub><mi>c</mi> <mi>x</mi></msub><mo stretchy="false">⟩</mo></mrow><annotation encoding="application/x-tex"> f(x) = \langle \omega {|} c_{x} \rangle </annotation></semantics></math>$
and we could define
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><msub><mo>∮</mo> <mi>c</mi></msub><mi>ω</mi><mo>=</mo><msubsup><mo>∫</mo> <mi>a</mi> <mi>b</mi></msubsup><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mi>dx</mi></mrow><annotation encoding="application/x-tex"> \oint_c \omega = \int_{a}^b f(x) dx </annotation></semantics></math>$
if the RHS exists. It seems like it ought to follow that
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><msub><mo>∮</mo> <mi>c</mi></msub><mi>d</mi><mi>ω</mi><mo>=</mo><mo stretchy="false">⟨</mo><mi>ω</mi><mo stretchy="false">|</mo><msub><mi>c</mi> <mi>b</mi></msub><mo stretchy="false">⟩</mo><mo>−</mo><mo stretchy="false">⟨</mo><mi>ω</mi><mo stretchy="false">|</mo><msub><mi>c</mi> <mi>a</mi></msub><mo stretchy="false">⟩</mo><mo>.</mo></mrow><annotation encoding="application/x-tex"> \oint_c d\omega = \langle \omega {|} c_b \rangle - \langle \omega {|} c_a \rangle.</annotation></semantics></math>$
(where $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>d</mi></mrow><annotation encoding="application/x-tex">d</annotation></semantics></math>$ is the commutative cojet differential). But it’s late at night, so I could be spewing nonsense…
- CommentRowNumber82.
- CommentAuthorTobyBartels
- CommentTimeFeb 11th 2014
- PermaLink
Author: TobyBartels
Format: MarkdownItex>You used $\langle{\eta{|}c}\rangle$ up in #74 here... Oops, never mind, that was me, not you! >a germ is not determined by its $k$-jets for $k\lt\infty$, is it? Ah, no, I must have been implicitly assuming that every function (or at least every smooth function) is analytic, and we wouldn\'t want to restrict to analytic curves. Still, these operations do depend only on the jets, even when the germs differ. But germs are a simpler concept. >if I teach my calc 1 or calc 2 students to calculate with cojet differentials, aren't they going to be confused when they get to multivariable and I tell them that now $d^2=0$? In my Calculus classes, I\'ve been using $\mathrm{d} \wedge \eta$ for the exterior differential of $\eta$. They\'ve already seen $\eta \wedge \zeta$ by this point, and this gives the right idea regarding skew-commutativity. (In particular, the signs in the product rule $$ \mathrm{d} \wedge (\eta \wedge \zeta) = (\mathrm{d} \wedge \eta) \wedge \zeta + (-1)^{|\eta|} \eta \wedge (\mathrm{d} \wedge \zeta) = (-1)^{(1 + {|\eta|}){|\zeta|}} \zeta \wedge \mathrm{d} \wedge \eta + (-1)^{|\eta|} \eta \wedge \mathrm{d} \wedge \zeta $$ come out right that way. Not that I ever write down anything like this in that class.) So $\mathrm{d} \wedge \mathrm{d} \wedge \eta = 0$, but this is very different from $\mathrm{d}^2 \eta = \mathrm{d} (\mathrm{d} \eta)$. I do tell them that people usually don\'t put the wedge in there (and that they sometimes don\'t put the wedge in the wedge product either), and this is OK because they\'re restricting attention to *exterior* differential forms. But even though I don\'t actually use higher differentials in my Calculus classes[^higherdiffs], they *do* see differential forms that aren\'t exterior forms. There are the [[absolute differential forms]], of course, but there\'s more; consider $$ đs = \sqrt{\mathrm{d}x^2 + \mathrm{d}y^2} .$$ It would be criminal not to introduce that in class! But what is $\mathrm{d}x^2$? (or ${|\mathrm{d}x|}^2$). It can be thought of as a symmetric bilinear form, but it\'s also a cojet form. (The two operations, one on a pair of curves and one on a single curve, are related by [[polarization identity|polarization]].) [^higherdiffs]: Now that I understand them better, I might. But expressing, say, the second derivative test for extreme values in terms of differentials instead of derivatives looks so different that it may be too difficult, when it\'s not in the book. Anyway, the main reason for using differential in class is that people use them in applied fields, so it\'s not so justifiable to bring in something that you and I invented ourselves.
You used $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">⟨</mo><mrow><mi>η</mi><mo stretchy="false">|</mo><mi>c</mi></mrow><mo stretchy="false">⟩</mo></mrow><annotation encoding="application/x-tex">\langle{\eta{|}c}\rangle</annotation></semantics></math>$ up in #74 here…

Oops, never mind, that was me, not you!

a germ is not determined by its $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>k</mi></mrow><annotation encoding="application/x-tex">k</annotation></semantics></math>$ -jets for $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>k</mi><mo>&lt;</mo><mn>∞</mn></mrow><annotation encoding="application/x-tex">k\lt\infty</annotation></semantics></math>$ , is it?

Ah, no, I must have been implicitly assuming that every function (or at least every smooth function) is analytic, and we wouldn't want to restrict to analytic curves. Still, these operations do depend only on the jets, even when the germs differ. But germs are a simpler concept.

if I teach my calc 1 or calc 2 students to calculate with cojet differentials, aren’t they going to be confused when they get to multivariable and I tell them that now $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>d</mi> <mn>2</mn></msup><mo>=</mo><mn>0</mn></mrow><annotation encoding="application/x-tex">d^2=0</annotation></semantics></math>$ ?

In my Calculus classes, I've been using $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mo>∧</mo><mi>η</mi></mrow><annotation encoding="application/x-tex">\mathrm{d} \wedge \eta</annotation></semantics></math>$ for the exterior differential of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>η</mi></mrow><annotation encoding="application/x-tex">\eta</annotation></semantics></math>$ . They've already seen $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>η</mi><mo>∧</mo><mi>ζ</mi></mrow><annotation encoding="application/x-tex">\eta \wedge \zeta</annotation></semantics></math>$ by this point, and this gives the right idea regarding skew-commutativity. (In particular, the signs in the product rule
$\mathrm{d} \wedge (\eta \wedge \zeta) = (\mathrm{d} \wedge \eta) \wedge \zeta + (-1)^{|\eta|} \eta \wedge (\mathrm{d} \wedge \zeta) = (-1)^{(1 + {|\eta|}){|\zeta|}} \zeta \wedge \mathrm{d} \wedge \eta + (-1)^{|\eta|} \eta \wedge \mathrm{d} \wedge \zeta$
come out right that way. Not that I ever write down anything like this in that class.) So $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mo>∧</mo><mi mathvariant="normal">d</mi><mo>∧</mo><mi>η</mi><mo>=</mo><mn>0</mn></mrow><annotation encoding="application/x-tex">\mathrm{d} \wedge \mathrm{d} \wedge \eta = 0</annotation></semantics></math>$ , but this is very different from $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>η</mi><mo>=</mo><mi mathvariant="normal">d</mi><mo stretchy="false">(</mo><mi mathvariant="normal">d</mi><mi>η</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">\mathrm{d}^2 \eta = \mathrm{d} (\mathrm{d} \eta)</annotation></semantics></math>$ .

I do tell them that people usually don't put the wedge in there (and that they sometimes don't put the wedge in the wedge product either), and this is OK because they're restricting attention to exterior differential forms.

But even though I don't actually use higher differentials in my Calculus classes¹, they do see differential forms that aren't exterior forms. There are the absolute differential forms, of course, but there's more; consider
$đs = \sqrt{\mathrm{d}x^2 + \mathrm{d}y^2} .$
It would be criminal not to introduce that in class! But what is $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><msup><mi>x</mi> <mn>2</mn></msup></mrow><annotation encoding="application/x-tex">\mathrm{d}x^2</annotation></semantics></math>$ ? (or $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mrow><mo stretchy="false">|</mo><mi mathvariant="normal">d</mi><mi>x</mi><mo stretchy="false">|</mo></mrow> <mn>2</mn></msup></mrow><annotation encoding="application/x-tex">{|\mathrm{d}x|}^2</annotation></semantics></math>$ ). It can be thought of as a symmetric bilinear form, but it's also a cojet form. (The two operations, one on a pair of curves and one on a single curve, are related by polarization.)
1. Now that I understand them better, I might. But expressing, say, the second derivative test for extreme values in terms of differentials instead of derivatives looks so different that it may be too difficult, when it's not in the book. Anyway, the main reason for using differential in class is that people use them in applied fields, so it's not so justifiable to bring in something that you and I invented ourselves. ↩
- CommentRowNumber83.
- CommentAuthorMike Shulman
- CommentTimeFeb 11th 2014
- PermaLink
Author: Mike Shulman
Format: MarkdownItex> these operations do depend only on the jets, even when the germs differ That's true if by "these operations" you mean the ones constructed from functions by applying the cojet $d$ and algebra operations. In #72 you suggested generating a subring, so I guess this is what you're thinking of. Although $e^{dx}$ wouldn't be in that subring, nor would $\sqrt{dx^2 + dy^2}$; we'd need to close up under more functions than the ring operations. The whole ring of operations-on-germs, of course, might include operations that really do depend on the whole germ rather than only the jets, although I can't think of any examples off the top of my head. > In my Calculus classes, I\'ve been using $\mathrm{d} \wedge \eta$ for the exterior differential of $\eta$ That's good! I might do the same when I get to exterior derivatives. (Although I still haven't decided whether I can justify talking about exterior differential forms at all, given that our standard textbook does everything the traditional way in terms of vectors. Is there a good multivariable calculus textbook that uses differential forms?) > the main reason for using differential in class is that people use them in applied fields Hmm, that's one good reason, but I think another good reason is that they just make the concepts easier to understand and the computations easier to do. However, it's not clear to me that higher cojet differentials would be much use in single-variable calc for either of those purposes either. The main advantage I see right now is if I could somehow avoid talking about derivatives at all and use *only* differentials, but to be really effective that would require a supporting textbook.

these operations do depend only on the jets, even when the germs differ

That’s true if by “these operations” you mean the ones constructed from functions by applying the cojet $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>d</mi></mrow><annotation encoding="application/x-tex">d</annotation></semantics></math>$ and algebra operations. In #72 you suggested generating a subring, so I guess this is what you’re thinking of. Although $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>e</mi> <mi>dx</mi></msup></mrow><annotation encoding="application/x-tex">e^{dx}</annotation></semantics></math>$ wouldn’t be in that subring, nor would $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msqrt><mrow><msup><mi>dx</mi> <mn>2</mn></msup><mo>+</mo><msup><mi>dy</mi> <mn>2</mn></msup></mrow></msqrt></mrow><annotation encoding="application/x-tex">\sqrt{dx^2 + dy^2}</annotation></semantics></math>$ ; we’d need to close up under more functions than the ring operations. The whole ring of operations-on-germs, of course, might include operations that really do depend on the whole germ rather than only the jets, although I can’t think of any examples off the top of my head.

In my Calculus classes, I've been using $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mo>∧</mo><mi>η</mi></mrow><annotation encoding="application/x-tex">\mathrm{d} \wedge \eta</annotation></semantics></math>$ for the exterior differential of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>η</mi></mrow><annotation encoding="application/x-tex">\eta</annotation></semantics></math>$

That’s good! I might do the same when I get to exterior derivatives. (Although I still haven’t decided whether I can justify talking about exterior differential forms at all, given that our standard textbook does everything the traditional way in terms of vectors. Is there a good multivariable calculus textbook that uses differential forms?)

the main reason for using differential in class is that people use them in applied fields

Hmm, that’s one good reason, but I think another good reason is that they just make the concepts easier to understand and the computations easier to do. However, it’s not clear to me that higher cojet differentials would be much use in single-variable calc for either of those purposes either. The main advantage I see right now is if I could somehow avoid talking about derivatives at all and use only differentials, but to be really effective that would require a supporting textbook.
- CommentRowNumber84.
- CommentAuthorMike Shulman
- CommentTimeFeb 11th 2014
- PermaLink
Author: Mike Shulman
Format: MarkdownItexOne issue with my proposed notion of integration in #81 is that in general, it will depend on the parametrization of the curve, whereas the integral of an ordinary 1-form along a curve does not (though it does depend on its orientation). However, it does include integration with respect to $ds = \sqrt{dx^2+dy^2}$, which is also parametrization-invariant --- I guess what matters for that is not linearity but "degree-1 homogeneity". Does it also include integration of absolute 1-forms? Can an absolute 1-form be regarded as a cojet form like $|dx|$ defined by $$\langle {|\omega|} ; c\rangle = {\Big|\langle \omega ; c\rangle\Big|}?$$ (I changed your notation $\langle \omega | c \rangle$ to $\langle \omega ; c \rangle$ to avoid confusion with the absolute value bars.)

One issue with my proposed notion of integration in #81 is that in general, it will depend on the parametrization of the curve, whereas the integral of an ordinary 1-form along a curve does not (though it does depend on its orientation). However, it does include integration with respect to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ds</mi><mo>=</mo><msqrt><mrow><msup><mi>dx</mi> <mn>2</mn></msup><mo>+</mo><msup><mi>dy</mi> <mn>2</mn></msup></mrow></msqrt></mrow><annotation encoding="application/x-tex">ds = \sqrt{dx^2+dy^2}</annotation></semantics></math>$ , which is also parametrization-invariant — I guess what matters for that is not linearity but “degree-1 homogeneity”.

Does it also include integration of absolute 1-forms? Can an absolute 1-form be regarded as a cojet form like $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">|</mo><mi>dx</mi><mo stretchy="false">|</mo></mrow><annotation encoding="application/x-tex">|dx|</annotation></semantics></math>$ defined by
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mo stretchy="false">⟨</mo><mrow><mo stretchy="false">|</mo><mi>ω</mi><mo stretchy="false">|</mo></mrow><mo>;</mo><mi>c</mi><mo stretchy="false">⟩</mo><mo>=</mo><mrow><mo maxsize="1.8em" minsize="1.8em">|</mo><mo stretchy="false">⟨</mo><mi>ω</mi><mo>;</mo><mi>c</mi><mo stretchy="false">⟩</mo><mo maxsize="1.8em" minsize="1.8em">|</mo></mrow><mo>?</mo></mrow><annotation encoding="application/x-tex">\langle {|\omega|} ; c\rangle = {\Big|\langle \omega ; c\rangle\Big|}?</annotation></semantics></math>$
(I changed your notation $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">⟨</mo><mi>ω</mi><mo stretchy="false">|</mo><mi>c</mi><mo stretchy="false">⟩</mo></mrow><annotation encoding="application/x-tex">\langle \omega | c \rangle</annotation></semantics></math>$ to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">⟨</mo><mi>ω</mi><mo>;</mo><mi>c</mi><mo stretchy="false">⟩</mo></mrow><annotation encoding="application/x-tex">\langle \omega ; c \rangle</annotation></semantics></math>$ to avoid confusion with the absolute value bars.)
- CommentRowNumber85.
- CommentAuthorMike Shulman
- CommentTimeFeb 11th 2014
- PermaLink
Author: Mike Shulman
Format: MarkdownItexRe: #80, the wedge product of two cojet 1-forms $\omega$ and $\eta$ ought probably to be the "cojet 2-form" defined on a surface germ $c$ by $$\langle \omega\wedge\eta {|} c \rangle =\langle\omega {|} \lambda s.c(s,0) \rangle \cdot \langle\eta {|} \lambda s.c(0,s) \rangle - \langle\omega {|} \lambda s.c(0,s) \rangle \cdot \langle\eta {|} \lambda s.c(s,0) \rangle $$

Re: #80, the wedge product of two cojet 1-forms $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ω</mi></mrow><annotation encoding="application/x-tex">\omega</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>η</mi></mrow><annotation encoding="application/x-tex">\eta</annotation></semantics></math>$ ought probably to be the “cojet 2-form” defined on a surface germ $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>c</mi></mrow><annotation encoding="application/x-tex">c</annotation></semantics></math>$ by
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mo stretchy="false">⟨</mo><mi>ω</mi><mo>∧</mo><mi>η</mi><mo stretchy="false">|</mo><mi>c</mi><mo stretchy="false">⟩</mo><mo>=</mo><mo stretchy="false">⟨</mo><mi>ω</mi><mo stretchy="false">|</mo><mi>λ</mi><mi>s</mi><mo>.</mo><mi>c</mi><mo stretchy="false">(</mo><mi>s</mi><mo>,</mo><mn>0</mn><mo stretchy="false">)</mo><mo stretchy="false">⟩</mo><mo>⋅</mo><mo stretchy="false">⟨</mo><mi>η</mi><mo stretchy="false">|</mo><mi>λ</mi><mi>s</mi><mo>.</mo><mi>c</mi><mo stretchy="false">(</mo><mn>0</mn><mo>,</mo><mi>s</mi><mo stretchy="false">)</mo><mo stretchy="false">⟩</mo><mo>−</mo><mo stretchy="false">⟨</mo><mi>ω</mi><mo stretchy="false">|</mo><mi>λ</mi><mi>s</mi><mo>.</mo><mi>c</mi><mo stretchy="false">(</mo><mn>0</mn><mo>,</mo><mi>s</mi><mo stretchy="false">)</mo><mo stretchy="false">⟩</mo><mo>⋅</mo><mo stretchy="false">⟨</mo><mi>η</mi><mo stretchy="false">|</mo><mi>λ</mi><mi>s</mi><mo>.</mo><mi>c</mi><mo stretchy="false">(</mo><mi>s</mi><mo>,</mo><mn>0</mn><mo stretchy="false">)</mo><mo stretchy="false">⟩</mo></mrow><annotation encoding="application/x-tex">\langle \omega\wedge\eta {|} c \rangle =\langle\omega {|} \lambda s.c(s,0) \rangle \cdot \langle\eta {|} \lambda s.c(0,s) \rangle - \langle\omega {|} \lambda s.c(0,s) \rangle \cdot \langle\eta {|} \lambda s.c(s,0) \rangle </annotation></semantics></math>$
- CommentRowNumber86.
- CommentAuthorTobyBartels
- CommentTimeFeb 11th 2014
- PermaLink
Author: TobyBartels
Format: MarkdownItex>I still haven't decided whether I can justify talking about exterior differential forms at all, given that our standard textbook does everything the traditional way in terms of vectors. Is there a good multivariable calculus textbook that uses differential forms? I don\'t know of one; even Dray & Minogue don\'t go that far. My justification is that they\'re already integrating differential forms; the classical expression $\int \mathbf{F} \cdot d\mathbf{r}$ is already the integral of a differential form; you just need to take it literally. All of the formulas are in [my handout](http://tobybartels.name/MATH-2080/2013s/forms/) (where Page 6 is strictly time-permitting ... which so far it hasn\'t been).

I still haven’t decided whether I can justify talking about exterior differential forms at all, given that our standard textbook does everything the traditional way in terms of vectors. Is there a good multivariable calculus textbook that uses differential forms?

I don't know of one; even Dray & Minogue don't go that far.

My justification is that they're already integrating differential forms; the classical expression $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo>∫</mo><mstyle mathvariant="bold"><mi>F</mi></mstyle><mo>⋅</mo><mi>d</mi><mstyle mathvariant="bold"><mi>r</mi></mstyle></mrow><annotation encoding="application/x-tex">\int \mathbf{F} \cdot d\mathbf{r}</annotation></semantics></math>$ is already the integral of a differential form; you just need to take it literally. All of the formulas are in my handout (where Page 6 is strictly time-permitting … which so far it hasn't been).
- CommentRowNumber87.
- CommentAuthorMike Shulman
- CommentTimeFeb 12th 2014
- PermaLink
Author: Mike Shulman
Format: MarkdownItexSuppose I start with a function and take its cojet differential over and over again. $$ d f(x) = f'(x) dx $$ $$ d^2f(x) = f''(x) dx^2 + f'(x)d^2x $$ $$ d^3f(x) = f'''(x) dx^3 + 3 f''(x) dx\cdot d^2x + f'(x) d^3x $$ $$ d^4f(x) = f^{(4)}(x) dx^4 + 6 f'''(x) dx^2 d^2x + f''(x)(3(d^2x)^2 + 4 dx \cdot d^3x) + f'(x) d^4x $$ $$ d^5f(x) = f^{(5)}(x) dx^5 + 5 f^{(4)}(x) dx^3 \cdot d^2 x + f'''(x)(15dx\cdot (d^2x)^2 + 10 dx^2 \cdot d^3x) + f''(x) (10 d^2x \cdot d^3x + 5 dx \cdot d^4x) + f'(x) d^5 x $$ It appears that each term in $d^n f(x)$ is of the form $$ a f^{(k)}(x) d^{i_1}x \cdot d^{i_2}x \cdot \cdots \cdot d^{i_k}x $$ for some $k\le n$ and some (unordered) partition $i_1 + i_2 + \cdots i_k = n$. Are the coefficients appearing here some well-known combinatorial numbers associated to partitions?

Suppose I start with a function and take its cojet differential over and over again.
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mi>d</mi><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mi>dx</mi></mrow><annotation encoding="application/x-tex"> d f(x) = f'(x) dx </annotation></semantics></math>$ $<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><msup><mi>d</mi> <mn>2</mn></msup><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mi>f</mi><mo>″</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><msup><mi>dx</mi> <mn>2</mn></msup><mo>+</mo><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><msup><mi>d</mi> <mn>2</mn></msup><mi>x</mi></mrow><annotation encoding="application/x-tex"> d^2f(x) = f''(x) dx^2 + f'(x)d^2x </annotation></semantics></math>$ $<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><msup><mi>d</mi> <mn>3</mn></msup><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mi>f</mi><mo>‴</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><msup><mi>dx</mi> <mn>3</mn></msup><mo>+</mo><mn>3</mn><mi>f</mi><mo>″</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mi>dx</mi><mo>⋅</mo><msup><mi>d</mi> <mn>2</mn></msup><mi>x</mi><mo>+</mo><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><msup><mi>d</mi> <mn>3</mn></msup><mi>x</mi></mrow><annotation encoding="application/x-tex"> d^3f(x) = f'''(x) dx^3 + 3 f''(x) dx\cdot d^2x + f'(x) d^3x </annotation></semantics></math>$ $<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><msup><mi>d</mi> <mn>4</mn></msup><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><msup><mi>f</mi> <mrow><mo stretchy="false">(</mo><mn>4</mn><mo stretchy="false">)</mo></mrow></msup><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><msup><mi>dx</mi> <mn>4</mn></msup><mo>+</mo><mn>6</mn><mi>f</mi><mo>‴</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><msup><mi>dx</mi> <mn>2</mn></msup><msup><mi>d</mi> <mn>2</mn></msup><mi>x</mi><mo>+</mo><mi>f</mi><mo>″</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">(</mo><mn>3</mn><mo stretchy="false">(</mo><msup><mi>d</mi> <mn>2</mn></msup><mi>x</mi><msup><mo stretchy="false">)</mo> <mn>2</mn></msup><mo>+</mo><mn>4</mn><mi>dx</mi><mo>⋅</mo><msup><mi>d</mi> <mn>3</mn></msup><mi>x</mi><mo stretchy="false">)</mo><mo>+</mo><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><msup><mi>d</mi> <mn>4</mn></msup><mi>x</mi></mrow><annotation encoding="application/x-tex"> d^4f(x) = f^{(4)}(x) dx^4 + 6 f'''(x) dx^2 d^2x + f''(x)(3(d^2x)^2 + 4 dx \cdot d^3x) + f'(x) d^4x </annotation></semantics></math>$ $<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><msup><mi>d</mi> <mn>5</mn></msup><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><msup><mi>f</mi> <mrow><mo stretchy="false">(</mo><mn>5</mn><mo stretchy="false">)</mo></mrow></msup><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><msup><mi>dx</mi> <mn>5</mn></msup><mo>+</mo><mn>5</mn><msup><mi>f</mi> <mrow><mo stretchy="false">(</mo><mn>4</mn><mo stretchy="false">)</mo></mrow></msup><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><msup><mi>dx</mi> <mn>3</mn></msup><mo>⋅</mo><msup><mi>d</mi> <mn>2</mn></msup><mi>x</mi><mo>+</mo><mi>f</mi><mo>‴</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">(</mo><mn>15</mn><mi>dx</mi><mo>⋅</mo><mo stretchy="false">(</mo><msup><mi>d</mi> <mn>2</mn></msup><mi>x</mi><msup><mo stretchy="false">)</mo> <mn>2</mn></msup><mo>+</mo><mn>10</mn><msup><mi>dx</mi> <mn>2</mn></msup><mo>⋅</mo><msup><mi>d</mi> <mn>3</mn></msup><mi>x</mi><mo stretchy="false">)</mo><mo>+</mo><mi>f</mi><mo>″</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo stretchy="false">(</mo><mn>10</mn><msup><mi>d</mi> <mn>2</mn></msup><mi>x</mi><mo>⋅</mo><msup><mi>d</mi> <mn>3</mn></msup><mi>x</mi><mo>+</mo><mn>5</mn><mi>dx</mi><mo>⋅</mo><msup><mi>d</mi> <mn>4</mn></msup><mi>x</mi><mo stretchy="false">)</mo><mo>+</mo><mi>f</mi><mo>′</mo><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><msup><mi>d</mi> <mn>5</mn></msup><mi>x</mi></mrow><annotation encoding="application/x-tex"> d^5f(x) = f^{(5)}(x) dx^5 + 5 f^{(4)}(x) dx^3 \cdot d^2 x + f'''(x)(15dx\cdot (d^2x)^2 + 10 dx^2 \cdot d^3x) + f''(x) (10 d^2x \cdot d^3x + 5 dx \cdot d^4x) + f'(x) d^5 x </annotation></semantics></math>$
It appears that each term in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>d</mi> <mi>n</mi></msup><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">d^n f(x)</annotation></semantics></math>$ is of the form
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mi>a</mi><msup><mi>f</mi> <mrow><mo stretchy="false">(</mo><mi>k</mi><mo stretchy="false">)</mo></mrow></msup><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><msup><mi>d</mi> <mrow><msub><mi>i</mi> <mn>1</mn></msub></mrow></msup><mi>x</mi><mo>⋅</mo><msup><mi>d</mi> <mrow><msub><mi>i</mi> <mn>2</mn></msub></mrow></msup><mi>x</mi><mo>⋅</mo><mi>⋯</mi><mo>⋅</mo><msup><mi>d</mi> <mrow><msub><mi>i</mi> <mi>k</mi></msub></mrow></msup><mi>x</mi></mrow><annotation encoding="application/x-tex"> a f^{(k)}(x) d^{i_1}x \cdot d^{i_2}x \cdot \cdots \cdot d^{i_k}x </annotation></semantics></math>$
for some $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>k</mi><mo>≤</mo><mi>n</mi></mrow><annotation encoding="application/x-tex">k\le n</annotation></semantics></math>$ and some (unordered) partition $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi>i</mi> <mn>1</mn></msub><mo>+</mo><msub><mi>i</mi> <mn>2</mn></msub><mo>+</mo><mi>⋯</mi><msub><mi>i</mi> <mi>k</mi></msub><mo>=</mo><mi>n</mi></mrow><annotation encoding="application/x-tex">i_1 + i_2 + \cdots i_k = n</annotation></semantics></math>$ . Are the coefficients appearing here some well-known combinatorial numbers associated to partitions?
- CommentRowNumber88.
- CommentAuthorMike Shulman
- CommentTimeFeb 12th 2014
- PermaLink
Author: Mike Shulman
Format: MarkdownItexOver in the other thread, David R posted a link to an MO answer which reminded me to look back at Arnold's book on classical mechanics, which suggests the following definition of the exterior differential of a cojet (or perhaps "cogerm" would be more appropriate) 1-form: $$ \langle d\wedge \eta {|} S \rangle = \lim_{c\to 0} \frac{1}{{|c|}^2} \oint_{S\circ c} \eta $$ where $c$ is a loop inside the parametrized surface $S$ which shrinks to nothing around $(0,0)$. (It might be a rectangle or parallellogram, but from the general perspective that restriction seems unaesthetic.) Comparing this to the definition of the differential $d$ from cogerm 1-forms to cogerm 1-forms, and its relationship to the exterior differential acting from 0-forms to 1-forms, suggests the following operation from cogerm 2-forms to cogerm 2-forms: $$ \langle d \omega {|} S \rangle = \lim_{c\to 0} \frac{1}{{|c|}^2} \int_{t=a}^b \langle \omega {|} S_{c(t)} \rangle $$ where $c$ is a loop as before, with domain $[a,b]$, and $S_{(u,v)}(s+u,t+v)$ is a shifted version of the surface. Is this a 2-form version of the cogerm differential? Just throwing stuff out there at the moment, hoping sometime soon I'll have time to think about it all carefully.

Over in the other thread, David R posted a link to an MO answer which reminded me to look back at Arnold’s book on classical mechanics, which suggests the following definition of the exterior differential of a cojet (or perhaps “cogerm” would be more appropriate) 1-form:
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mo stretchy="false">⟨</mo><mi>d</mi><mo>∧</mo><mi>η</mi><mo stretchy="false">|</mo><mi>S</mi><mo stretchy="false">⟩</mo><mo>=</mo><munder><mi>lim</mi> <mrow><mi>c</mi><mo>→</mo><mn>0</mn></mrow></munder><mfrac><mn>1</mn><mrow><msup><mrow><mo stretchy="false">|</mo><mi>c</mi><mo stretchy="false">|</mo></mrow> <mn>2</mn></msup></mrow></mfrac><msub><mo>∮</mo> <mrow><mi>S</mi><mo>∘</mo><mi>c</mi></mrow></msub><mi>η</mi></mrow><annotation encoding="application/x-tex"> \langle d\wedge \eta {|} S \rangle = \lim_{c\to 0} \frac{1}{{|c|}^2} \oint_{S\circ c} \eta </annotation></semantics></math>$
where $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>c</mi></mrow><annotation encoding="application/x-tex">c</annotation></semantics></math>$ is a loop inside the parametrized surface $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>S</mi></mrow><annotation encoding="application/x-tex">S</annotation></semantics></math>$ which shrinks to nothing around $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mn>0</mn><mo>,</mo><mn>0</mn><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">(0,0)</annotation></semantics></math>$ . (It might be a rectangle or parallellogram, but from the general perspective that restriction seems unaesthetic.)

Comparing this to the definition of the differential $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>d</mi></mrow><annotation encoding="application/x-tex">d</annotation></semantics></math>$ from cogerm 1-forms to cogerm 1-forms, and its relationship to the exterior differential acting from 0-forms to 1-forms, suggests the following operation from cogerm 2-forms to cogerm 2-forms:
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mo stretchy="false">⟨</mo><mi>d</mi><mi>ω</mi><mo stretchy="false">|</mo><mi>S</mi><mo stretchy="false">⟩</mo><mo>=</mo><munder><mi>lim</mi> <mrow><mi>c</mi><mo>→</mo><mn>0</mn></mrow></munder><mfrac><mn>1</mn><mrow><msup><mrow><mo stretchy="false">|</mo><mi>c</mi><mo stretchy="false">|</mo></mrow> <mn>2</mn></msup></mrow></mfrac><msubsup><mo>∫</mo> <mrow><mi>t</mi><mo>=</mo><mi>a</mi></mrow> <mi>b</mi></msubsup><mo stretchy="false">⟨</mo><mi>ω</mi><mo stretchy="false">|</mo><msub><mi>S</mi> <mrow><mi>c</mi><mo stretchy="false">(</mo><mi>t</mi><mo stretchy="false">)</mo></mrow></msub><mo stretchy="false">⟩</mo></mrow><annotation encoding="application/x-tex"> \langle d \omega {|} S \rangle = \lim_{c\to 0} \frac{1}{{|c|}^2} \int_{t=a}^b \langle \omega {|} S_{c(t)} \rangle </annotation></semantics></math>$
where $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>c</mi></mrow><annotation encoding="application/x-tex">c</annotation></semantics></math>$ is a loop as before, with domain $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">[</mo><mi>a</mi><mo>,</mo><mi>b</mi><mo stretchy="false">]</mo></mrow><annotation encoding="application/x-tex">[a,b]</annotation></semantics></math>$ , and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi>S</mi> <mrow><mo stretchy="false">(</mo><mi>u</mi><mo>,</mo><mi>v</mi><mo stretchy="false">)</mo></mrow></msub><mo stretchy="false">(</mo><mi>s</mi><mo>+</mo><mi>u</mi><mo>,</mo><mi>t</mi><mo>+</mo><mi>v</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">S_{(u,v)}(s+u,t+v)</annotation></semantics></math>$ is a shifted version of the surface. Is this a 2-form version of the cogerm differential?

Just throwing stuff out there at the moment, hoping sometime soon I’ll have time to think about it all carefully.
- CommentRowNumber89.
- CommentAuthorMike Shulman
- CommentTimeFeb 12th 2014
- PermaLink
Author: Mike Shulman
Format: MarkdownItexProbably "$|c|^2$" should be instead the area enclosed by $c$. But having thought about it a little more, I realized those limits don't really make sense unless the integrals are invariant under reparametrization. So maybe the exterior differential doesn't really make sense except for degree-1 1-forms? And is there any sort of commutative differential on 2-forms? Would we hope or expect it to behave in any particular way? It feels weird to me that we have the world of cogerm 1-forms with the commutative $d$, and the world of exterior forms with the exterior $d\wedge$, which agree in the world of linear degree-1 1-forms and the differential of functions, but are thereafter completely unrelated.

Probably “ $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">|</mo><mi>c</mi><msup><mo stretchy="false">|</mo> <mn>2</mn></msup></mrow><annotation encoding="application/x-tex">|c|^2</annotation></semantics></math>$ ” should be instead the area enclosed by $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>c</mi></mrow><annotation encoding="application/x-tex">c</annotation></semantics></math>$ . But having thought about it a little more, I realized those limits don’t really make sense unless the integrals are invariant under reparametrization. So maybe the exterior differential doesn’t really make sense except for degree-1 1-forms? And is there any sort of commutative differential on 2-forms? Would we hope or expect it to behave in any particular way? It feels weird to me that we have the world of cogerm 1-forms with the commutative $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>d</mi></mrow><annotation encoding="application/x-tex">d</annotation></semantics></math>$ , and the world of exterior forms with the exterior $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>d</mi><mo>∧</mo></mrow><annotation encoding="application/x-tex">d\wedge</annotation></semantics></math>$ , which agree in the world of linear degree-1 1-forms and the differential of functions, but are thereafter completely unrelated.
- CommentRowNumber90.
- CommentAuthorTobyBartels
- CommentTimeFeb 20th 2014
- (edited Apr 2nd 2015)
- PermaLink
Author: TobyBartels
Format: MarkdownItex>Can an absolute 1-form be regarded as a cojet form like $|d x|$ defined by $$\langle {|\omega|} ; c\rangle = {\Big|\langle \omega ; c\rangle\Big|}?$$ I would certainly accept this definition of ${|\omega|}$ in line with the previous discussion of $f(\omega)$ (where $\omega$ is a cojet form, or more generally a finite list of such, and $f$ is a differentiable function); there\'s no reason that $f$ has to be differentiable (we just can\'t conclude that $f(\omega)$ is differentiable). So I guess that your question is: if $\omega$ is an exterior $1$-form, then is this ${|\omega|}$ the absolute $1$-form called ${|\omega|}$ on the [[absolute differential form]] page? And the answer is Yes; at least, it certainly does the right thing to a curve. But not every absolute $1$-form arises in this way! Besides multiplying by an arbitrary $0$-form (so that an absolute $1$-form need not be positive semidefinite), even some positive definite forms, such as $\sqrt{\mathrm{d}x^2 + \mathrm{d}y^2}$, don\'t arise in this way. Nevertheless, any absolute $1$-form does have an action on curves (via their tangent vectors, if you follow the definition at [[absolute differential form]]), and this is homogeneous of degree $1$, so your integration formula does integrate them.

Can an absolute 1-form be regarded as a cojet form like $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">|</mo><mi>d</mi><mi>x</mi><mo stretchy="false">|</mo></mrow><annotation encoding="application/x-tex">|d x|</annotation></semantics></math>$ defined by
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mo stretchy="false">⟨</mo><mrow><mo stretchy="false">|</mo><mi>ω</mi><mo stretchy="false">|</mo></mrow><mo>;</mo><mi>c</mi><mo stretchy="false">⟩</mo><mo>=</mo><mrow><mo maxsize="1.8em" minsize="1.8em">|</mo><mo stretchy="false">⟨</mo><mi>ω</mi><mo>;</mo><mi>c</mi><mo stretchy="false">⟩</mo><mo maxsize="1.8em" minsize="1.8em">|</mo></mrow><mo>?</mo></mrow><annotation encoding="application/x-tex">\langle {|\omega|} ; c\rangle = {\Big|\langle \omega ; c\rangle\Big|}?</annotation></semantics></math>$

I would certainly accept this definition of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mrow><mo stretchy="false">|</mo><mi>ω</mi><mo stretchy="false">|</mo></mrow></mrow><annotation encoding="application/x-tex">{|\omega|}</annotation></semantics></math>$ in line with the previous discussion of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo stretchy="false">(</mo><mi>ω</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">f(\omega)</annotation></semantics></math>$ (where $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ω</mi></mrow><annotation encoding="application/x-tex">\omega</annotation></semantics></math>$ is a cojet form, or more generally a finite list of such, and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ is a differentiable function); there's no reason that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ has to be differentiable (we just can't conclude that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo stretchy="false">(</mo><mi>ω</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">f(\omega)</annotation></semantics></math>$ is differentiable).

So I guess that your question is: if $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ω</mi></mrow><annotation encoding="application/x-tex">\omega</annotation></semantics></math>$ is an exterior $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn></mrow><annotation encoding="application/x-tex">1</annotation></semantics></math>$ -form, then is this $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mrow><mo stretchy="false">|</mo><mi>ω</mi><mo stretchy="false">|</mo></mrow></mrow><annotation encoding="application/x-tex">{|\omega|}</annotation></semantics></math>$ the absolute $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn></mrow><annotation encoding="application/x-tex">1</annotation></semantics></math>$ -form called $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mrow><mo stretchy="false">|</mo><mi>ω</mi><mo stretchy="false">|</mo></mrow></mrow><annotation encoding="application/x-tex">{|\omega|}</annotation></semantics></math>$ on the absolute differential form page? And the answer is Yes; at least, it certainly does the right thing to a curve.

But not every absolute $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn></mrow><annotation encoding="application/x-tex">1</annotation></semantics></math>$ -form arises in this way! Besides multiplying by an arbitrary $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>0</mn></mrow><annotation encoding="application/x-tex">0</annotation></semantics></math>$ -form (so that an absolute $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn></mrow><annotation encoding="application/x-tex">1</annotation></semantics></math>$ -form need not be positive semidefinite), even some positive definite forms, such as $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msqrt><mrow><mi mathvariant="normal">d</mi><msup><mi>x</mi> <mn>2</mn></msup><mo>+</mo><mi mathvariant="normal">d</mi><msup><mi>y</mi> <mn>2</mn></msup></mrow></msqrt></mrow><annotation encoding="application/x-tex">\sqrt{\mathrm{d}x^2 + \mathrm{d}y^2}</annotation></semantics></math>$ , don't arise in this way.

Nevertheless, any absolute $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn></mrow><annotation encoding="application/x-tex">1</annotation></semantics></math>$ -form does have an action on curves (via their tangent vectors, if you follow the definition at absolute differential form), and this is homogeneous of degree $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn></mrow><annotation encoding="application/x-tex">1</annotation></semantics></math>$ , so your integration formula does integrate them.
- CommentRowNumber91.
- CommentAuthorTobyBartels
- CommentTimeFeb 20th 2014
- PermaLink
Author: TobyBartels
Format: MarkdownItex>It feels weird to me that we have the world of cogerm 1-forms with the commutative $d$, and the world of exterior forms with the exterior $d\wedge$, which agree in the world of linear degree-1 1-forms and the differential of functions, but are thereafter completely unrelated. There is some more overlap if you look at symmetric bilinear forms (rather than only the antisymmetric ones that are exterior $2$-forms). Some cojet (or cogerm) forms are linear, and these agree with the exterior $1$-forms; but some cojet forms are quadratic, and these agree with the symmetric bilinear forms. Of course, these are viewed as functions of different things, but they are equivalent by the [[polarization identities]]. An arbitrary bilinear forms is then given by a quadratic cojet form together with an exterior $2$-form. This doesn\'t go so easily into higher rank.

It feels weird to me that we have the world of cogerm 1-forms with the commutative $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>d</mi></mrow><annotation encoding="application/x-tex">d</annotation></semantics></math>$ , and the world of exterior forms with the exterior $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>d</mi><mo>∧</mo></mrow><annotation encoding="application/x-tex">d\wedge</annotation></semantics></math>$ , which agree in the world of linear degree-1 1-forms and the differential of functions, but are thereafter completely unrelated.

There is some more overlap if you look at symmetric bilinear forms (rather than only the antisymmetric ones that are exterior $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>2</mn></mrow><annotation encoding="application/x-tex">2</annotation></semantics></math>$ -forms). Some cojet (or cogerm) forms are linear, and these agree with the exterior $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn></mrow><annotation encoding="application/x-tex">1</annotation></semantics></math>$ -forms; but some cojet forms are quadratic, and these agree with the symmetric bilinear forms. Of course, these are viewed as functions of different things, but they are equivalent by the polarization identities. An arbitrary bilinear forms is then given by a quadratic cojet form together with an exterior $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>2</mn></mrow><annotation encoding="application/x-tex">2</annotation></semantics></math>$ -form.

This doesn't go so easily into higher rank.
- CommentRowNumber92.
- CommentAuthorMike Shulman
- CommentTimeFeb 24th 2014
- PermaLink
Author: Mike Shulman
Format: MarkdownItexI thought it was about time to record some of this discussion, so I created [[cogerm differential form]].

I thought it was about time to record some of this discussion, so I created cogerm differential form.
- CommentRowNumber93.
- CommentAuthorTobyBartels
- CommentTimeFeb 24th 2014
- PermaLink
Author: TobyBartels
Format: MarkdownItexLooks good! I discussed it in [a thread dedicated to it](http://nforum.mathforge.org/discussion/5700/cogerm-forms/). (Mike already noticed this, but I record it for the sake of future generations.)

Looks good! I discussed it in a thread dedicated to it. (Mike already noticed this, but I record it for the sake of future generations.)
- CommentRowNumber94.
- CommentAuthorMike Shulman
- CommentTimeMay 18th 2015
- PermaLink
Author: Mike Shulman
Format: MarkdownItexRe: #87, the sum of the coefficients of terms in $\mathrm{d}^n f$ involving $f^{(k)}(x)$ is the [Stirling number of the second kind](http://en.wikipedia.org/wiki/Stirling_numbers_of_the_second_kind) $S(n,k)$: the number of ways to partition an $n$-element set into $k$ nonempty subsets. The coefficients themselves are simply the further classification of these partitions according to the multiset of cardinalities of the $k$ nonempty subsets (which feels like it ought to have something to do with Young tableaux). This is more obvious if we use the [coflare differentials](http://nforum.mathforge.org/comments.php?DiscussionID=5941) where $d_1 d_0 \neq d_0 d_1$: then none of the terms can be combined, and each term like $\mathrm{d}_{2}\mathrm{d}_0 x \, \mathrm{d}_3 \mathrm{d}_1 x$ evidently represents a *particular* partition of an $n$-element set into $k$ nonempty subsets.

Re: #87, the sum of the coefficients of terms in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mi>n</mi></msup><mi>f</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}^n f</annotation></semantics></math>$ involving $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>f</mi> <mrow><mo stretchy="false">(</mo><mi>k</mi><mo stretchy="false">)</mo></mrow></msup><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">f^{(k)}(x)</annotation></semantics></math>$ is the Stirling number of the second kind $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>S</mi><mo stretchy="false">(</mo><mi>n</mi><mo>,</mo><mi>k</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">S(n,k)</annotation></semantics></math>$ : the number of ways to partition an $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>n</mi></mrow><annotation encoding="application/x-tex">n</annotation></semantics></math>$ -element set into $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>k</mi></mrow><annotation encoding="application/x-tex">k</annotation></semantics></math>$ nonempty subsets. The coefficients themselves are simply the further classification of these partitions according to the multiset of cardinalities of the $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>k</mi></mrow><annotation encoding="application/x-tex">k</annotation></semantics></math>$ nonempty subsets (which feels like it ought to have something to do with Young tableaux). This is more obvious if we use the coflare differentials where $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi>d</mi> <mn>1</mn></msub><msub><mi>d</mi> <mn>0</mn></msub><mo>≠</mo><msub><mi>d</mi> <mn>0</mn></msub><msub><mi>d</mi> <mn>1</mn></msub></mrow><annotation encoding="application/x-tex">d_1 d_0 \neq d_0 d_1</annotation></semantics></math>$ : then none of the terms can be combined, and each term like $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi mathvariant="normal">d</mi> <mn>2</mn></msub><msub><mi mathvariant="normal">d</mi> <mn>0</mn></msub><mi>x</mi><mspace width="0.16667em"/><msub><mi mathvariant="normal">d</mi> <mn>3</mn></msub><msub><mi mathvariant="normal">d</mi> <mn>1</mn></msub><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}_{2}\mathrm{d}_0 x \, \mathrm{d}_3 \mathrm{d}_1 x</annotation></semantics></math>$ evidently represents a particular partition of an $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>n</mi></mrow><annotation encoding="application/x-tex">n</annotation></semantics></math>$ -element set into $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>k</mi></mrow><annotation encoding="application/x-tex">k</annotation></semantics></math>$ nonempty subsets.
- CommentRowNumber95.
- CommentAuthorTobyBartels
- CommentTimeMay 19th 2015
- (edited May 19th 2015)
- PermaLink
Author: TobyBartels
Format: MarkdownItexIn coflare differentials, I don\'t think that $\mathrm{d}_0\mathrm{d}_1x$ makes sense at all; in any case, it doesn\'t show up in $\mathrm{d}^{n}f(x)$. That\'s just as well, since the Stirling number doesn\'t count $\{\{0,1\}\}$ and $\{\{1,0\}\}$ as distinct partitions of $2$ into $1$ nonempty subset.

In coflare differentials, I don't think that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi mathvariant="normal">d</mi> <mn>0</mn></msub><msub><mi mathvariant="normal">d</mi> <mn>1</mn></msub><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}_0\mathrm{d}_1x</annotation></semantics></math>$ makes sense at all; in any case, it doesn't show up in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mi>n</mi></msup><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">\mathrm{d}^{n}f(x)</annotation></semantics></math>$ . That's just as well, since the Stirling number doesn't count $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">{</mo><mo stretchy="false">{</mo><mn>0</mn><mo>,</mo><mn>1</mn><mo stretchy="false">}</mo><mo stretchy="false">}</mo></mrow><annotation encoding="application/x-tex">\{\{0,1\}\}</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">{</mo><mo stretchy="false">{</mo><mn>1</mn><mo>,</mo><mn>0</mn><mo stretchy="false">}</mo><mo stretchy="false">}</mo></mrow><annotation encoding="application/x-tex">\{\{1,0\}\}</annotation></semantics></math>$ as distinct partitions of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>2</mn></mrow><annotation encoding="application/x-tex">2</annotation></semantics></math>$ into $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn></mrow><annotation encoding="application/x-tex">1</annotation></semantics></math>$ nonempty subset.
- CommentRowNumber96.
- CommentAuthorMike Shulman
- CommentTimeMay 20th 2015
- PermaLink
Author: Mike Shulman
Format: MarkdownItexYes, that's true; I think I meant to say something like $d_1d_0 \neq d_2d_0$.

Yes, that’s true; I think I meant to say something like $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi>d</mi> <mn>1</mn></msub><msub><mi>d</mi> <mn>0</mn></msub><mo>≠</mo><msub><mi>d</mi> <mn>2</mn></msub><msub><mi>d</mi> <mn>0</mn></msub></mrow><annotation encoding="application/x-tex">d_1d_0 \neq d_2d_0</annotation></semantics></math>$ .
- CommentRowNumber97.
- CommentAuthorTobyBartels
- CommentTimeMay 25th 2015
- PermaLink
Author: TobyBartels
Format: MarkdownItexOr simply that $\mathrm{d}_0 \neq \mathrm{d}_1$. Either will do, since the first nontrivial coefficient comes from combining $\mathrm{d}_2\mathrm{d}_1x \,\mathrm{d}_0x$, $\mathrm{d}_1x \,\mathrm{d}_2\mathrm{d}_0x$, and $\mathrm{d}_2x \,\mathrm{d}_1\mathrm{d}_0x$, where already for each pair there are two differences between them.

Or simply that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi mathvariant="normal">d</mi> <mn>0</mn></msub><mo>≠</mo><msub><mi mathvariant="normal">d</mi> <mn>1</mn></msub></mrow><annotation encoding="application/x-tex">\mathrm{d}_0 \neq \mathrm{d}_1</annotation></semantics></math>$ . Either will do, since the first nontrivial coefficient comes from combining $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi mathvariant="normal">d</mi> <mn>2</mn></msub><msub><mi mathvariant="normal">d</mi> <mn>1</mn></msub><mi>x</mi><mspace width="0.16667em"/><msub><mi mathvariant="normal">d</mi> <mn>0</mn></msub><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}_2\mathrm{d}_1x \,\mathrm{d}_0x</annotation></semantics></math>$ , $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi mathvariant="normal">d</mi> <mn>1</mn></msub><mi>x</mi><mspace width="0.16667em"/><msub><mi mathvariant="normal">d</mi> <mn>2</mn></msub><msub><mi mathvariant="normal">d</mi> <mn>0</mn></msub><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}_1x \,\mathrm{d}_2\mathrm{d}_0x</annotation></semantics></math>$ , and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi mathvariant="normal">d</mi> <mn>2</mn></msub><mi>x</mi><mspace width="0.16667em"/><msub><mi mathvariant="normal">d</mi> <mn>1</mn></msub><msub><mi mathvariant="normal">d</mi> <mn>0</mn></msub><mi>x</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}_2x \,\mathrm{d}_1\mathrm{d}_0x</annotation></semantics></math>$ , where already for each pair there are two differences between them.
- CommentRowNumber98.
- CommentAuthorTobyBartels
- CommentTimeJul 19th 2015
- PermaLink
Author: TobyBartels
Format: MarkdownItexOn the subject of partial derivatives, John Denker makes the interesting point that $$ \Big(\frac{\partial{u}}{\partial{x}}\Big)_{y,z} = \frac{\mathrm{d}u \wedge \mathrm{d}y \wedge \mathrm{d}z}{\mathrm{d}x \wedge \mathrm{d}y \wedge \mathrm{d}z} $$ at <http://www.av8n.com/physics/partial-derivative.htm#sec-wedge-ratio>. This is easy enough to verify by calculation, but also check out the pictorial explanation.

On the subject of partial derivatives, John Denker makes the interesting point that
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mo maxsize="1.8em" minsize="1.8em">(</mo><mfrac><mrow><mo>∂</mo><mi>u</mi></mrow><mrow><mo>∂</mo><mi>x</mi></mrow></mfrac><msub><mo maxsize="1.8em" minsize="1.8em">)</mo> <mrow><mi>y</mi><mo>,</mo><mi>z</mi></mrow></msub><mo>=</mo><mfrac><mrow><mi mathvariant="normal">d</mi><mi>u</mi><mo>∧</mo><mi mathvariant="normal">d</mi><mi>y</mi><mo>∧</mo><mi mathvariant="normal">d</mi><mi>z</mi></mrow><mrow><mi mathvariant="normal">d</mi><mi>x</mi><mo>∧</mo><mi mathvariant="normal">d</mi><mi>y</mi><mo>∧</mo><mi mathvariant="normal">d</mi><mi>z</mi></mrow></mfrac></mrow><annotation encoding="application/x-tex"> \Big(\frac{\partial{u}}{\partial{x}}\Big)_{y,z} = \frac{\mathrm{d}u \wedge \mathrm{d}y \wedge \mathrm{d}z}{\mathrm{d}x \wedge \mathrm{d}y \wedge \mathrm{d}z} </annotation></semantics></math>$
at http://www.av8n.com/physics/partial-derivative.htm#sec-wedge-ratio. This is easy enough to verify by calculation, but also check out the pictorial explanation.
- CommentRowNumber99.
- CommentAuthorTobyBartels
- CommentTimeNov 4th 2020
- PermaLink
Author: TobyBartels
Format: MarkdownItexTrying to make the previous comment work with second derivatives: Suppose that $ u $ is a function of $ x $. Then $$ \mathrm { d } u = \frac { \partial u } { \partial x } \, \mathrm { d } x ,$$ so $$ \frac { \partial u } { \partial x } = \frac { \mathrm { d } u } { \mathrm { d } x } .$$ Thus, $$ \frac { \partial ^ 2 u } { \partial x ^ 2 } = \frac { \partial \left ( \frac { \partial u } { \partial x } \right ) } { \partial x } = \frac { \mathrm { d } \left ( \frac { \mathrm { d } u } { \mathrm { d } x } \right ) } { \mathrm { d } x } ,$$ which expands to $$ \frac { \partial ^ 2 u } { \partial x ^ 2 } = \frac { \mathrm { d } x \, \mathrm { d } ^ 2 u - \mathrm { d } u \, \mathrm { d } ^ 2 x } { \mathrm { d } x ^ 3 } .$$ On the other hand, $$ \mathrm { d } ^ 2 u = \frac { \partial ^ 2 u } { \partial x ^ 2 } \, \mathrm { d } x + \frac { \partial u } { \partial x } \, \mathrm { d } ^ 2 x ,$$ so $$ \mathrm { d } ^ 2 u \wedge \mathrm { d } ^ 2 x = \frac { \partial ^ 2 u } { \partial x ^ 2 } \, \mathrm { d } x \wedge \mathrm { d } ^ 2 x ,$$ so $$ \frac { \partial ^ 2 u } { \partial x ^ 2 } = \frac { \mathrm { d } ^ 2 u \wedge \mathrm { d } ^ 2 x } { \mathrm { d } x \wedge \mathrm { d } ^ 2 x } .$$ Now suppose that $ u $ is a function of $ x $ and $ y $. Then $$ \mathrm { d } u = \frac { \partial u } { \partial x } \, \mathrm { d } x + \frac { \partial u } { \partial y } \, \mathrm { d } y ,$$ so $$ \mathrm { d } u \wedge \mathrm { d } y = \frac { \partial u } { \partial x } \, \mathrm { d } x \wedge \mathrm { d } y ,$$ so $$ \frac { \partial u } { \partial x } = \frac { \mathrm { d } u \wedge \mathrm { d } y } { \mathrm { d } x \wedge \mathrm { d } y } .$$ Thus, $$ \frac { \partial ^ 2 u } { \partial x ^ 2 } = \frac { \partial \left ( \frac { \partial u } { \partial x } \right ) } { \partial x } = \frac { \mathrm { d } \left ( \frac { \mathrm { d } u \wedge \mathrm { d } y } { \mathrm { d } x \wedge \mathrm { d } y } \right ) \wedge \mathrm { d } y } { \mathrm { d } x \wedge \mathrm { d } y } ,$$ which unfortunately can\'t be expanded without abandoning the $ \wedge $ notation. On the other hand, $$ \mathrm { d } ^ 2 u = \frac { \partial ^ 2 u } { \partial x ^ 2 } \, \mathrm { d } x ^ 2 + 2 \frac { \partial ^ 2 u } { \partial x \partial y } \, \mathrm { d } x \, \mathrm { d } y + \frac { \partial ^ 2 u } { \partial y ^ 2 } \, \mathrm { d } y ^ 2 + \frac { \partial u } { \partial x } \, \mathrm { d } ^ 2 x + \frac { \partial u } { \partial y } \, \mathrm { d } ^ 2 y ,$$ so $$ \mathrm { d } ^ 2 u \wedge \mathrm { d } x \mathrm { d } y \wedge \mathrm { d } y ^ 2 \wedge \mathrm { d } ^ 2 x \wedge \mathrm { d } ^ 2 y = \frac { \partial ^ 2 u } { \partial x ^ 2 } \, \mathrm { d } x ^ 2 \wedge \mathrm { d } x \mathrm { d } y \wedge \mathrm { d } y ^ 2 \wedge \mathrm { d } ^ 2 x \wedge \mathrm { d } ^ 2 y ,$$ so $$ \frac { \partial ^ 2 u } { \partial x ^ 2 } = \frac { \mathrm { d } ^ 2 u \wedge \mathrm { d } x \mathrm { d } y \wedge \mathrm { d } y ^ 2 \wedge \mathrm { d } ^ 2 x \wedge \mathrm { d } ^ 2 y } { \mathrm { d } x ^ 2 \wedge \mathrm { d } x \mathrm { d } y \wedge \mathrm { d } y ^ 2 \wedge \mathrm { d } ^ 2 x \wedge \mathrm { d } ^ 2 y } .$$

Trying to make the previous comment work with second derivatives:

Suppose that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>u</mi></mrow><annotation encoding="application/x-tex"> u </annotation></semantics></math>$ is a function of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex"> x </annotation></semantics></math>$ . Then
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mi mathvariant="normal">d</mi><mi>u</mi><mo>=</mo><mfrac><mrow><mo>∂</mo><mi>u</mi></mrow><mrow><mo>∂</mo><mi>x</mi></mrow></mfrac><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi><mo>,</mo></mrow><annotation encoding="application/x-tex"> \mathrm { d } u = \frac { \partial u } { \partial x } \, \mathrm { d } x ,</annotation></semantics></math>$
so
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mfrac><mrow><mo>∂</mo><mi>u</mi></mrow><mrow><mo>∂</mo><mi>x</mi></mrow></mfrac><mo>=</mo><mfrac><mrow><mi mathvariant="normal">d</mi><mi>u</mi></mrow><mrow><mi mathvariant="normal">d</mi><mi>x</mi></mrow></mfrac><mo>.</mo></mrow><annotation encoding="application/x-tex"> \frac { \partial u } { \partial x } = \frac { \mathrm { d } u } { \mathrm { d } x } .</annotation></semantics></math>$
Thus,
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mfrac><mrow><msup><mo>∂</mo> <mn>2</mn></msup><mi>u</mi></mrow><mrow><mo>∂</mo><msup><mi>x</mi> <mn>2</mn></msup></mrow></mfrac><mo>=</mo><mfrac><mrow><mo>∂</mo><mrow><mo>(</mo><mfrac><mrow><mo>∂</mo><mi>u</mi></mrow><mrow><mo>∂</mo><mi>x</mi></mrow></mfrac><mo>)</mo></mrow></mrow><mrow><mo>∂</mo><mi>x</mi></mrow></mfrac><mo>=</mo><mfrac><mrow><mi mathvariant="normal">d</mi><mrow><mo>(</mo><mfrac><mrow><mi mathvariant="normal">d</mi><mi>u</mi></mrow><mrow><mi mathvariant="normal">d</mi><mi>x</mi></mrow></mfrac><mo>)</mo></mrow></mrow><mrow><mi mathvariant="normal">d</mi><mi>x</mi></mrow></mfrac><mo>,</mo></mrow><annotation encoding="application/x-tex"> \frac { \partial ^ 2 u } { \partial x ^ 2 } = \frac { \partial \left ( \frac { \partial u } { \partial x } \right ) } { \partial x } = \frac { \mathrm { d } \left ( \frac { \mathrm { d } u } { \mathrm { d } x } \right ) } { \mathrm { d } x } ,</annotation></semantics></math>$
which expands to
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mfrac><mrow><msup><mo>∂</mo> <mn>2</mn></msup><mi>u</mi></mrow><mrow><mo>∂</mo><msup><mi>x</mi> <mn>2</mn></msup></mrow></mfrac><mo>=</mo><mfrac><mrow><mi mathvariant="normal">d</mi><mi>x</mi><mspace width="0.16667em"/><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>u</mi><mo>−</mo><mi mathvariant="normal">d</mi><mi>u</mi><mspace width="0.16667em"/><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>x</mi></mrow><mrow><mi mathvariant="normal">d</mi><msup><mi>x</mi> <mn>3</mn></msup></mrow></mfrac><mo>.</mo></mrow><annotation encoding="application/x-tex"> \frac { \partial ^ 2 u } { \partial x ^ 2 } = \frac { \mathrm { d } x \, \mathrm { d } ^ 2 u - \mathrm { d } u \, \mathrm { d } ^ 2 x } { \mathrm { d } x ^ 3 } .</annotation></semantics></math>$
On the other hand,
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>u</mi><mo>=</mo><mfrac><mrow><msup><mo>∂</mo> <mn>2</mn></msup><mi>u</mi></mrow><mrow><mo>∂</mo><msup><mi>x</mi> <mn>2</mn></msup></mrow></mfrac><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi><mo>+</mo><mfrac><mrow><mo>∂</mo><mi>u</mi></mrow><mrow><mo>∂</mo><mi>x</mi></mrow></mfrac><mspace width="0.16667em"/><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>x</mi><mo>,</mo></mrow><annotation encoding="application/x-tex"> \mathrm { d } ^ 2 u = \frac { \partial ^ 2 u } { \partial x ^ 2 } \, \mathrm { d } x + \frac { \partial u } { \partial x } \, \mathrm { d } ^ 2 x ,</annotation></semantics></math>$
so
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>u</mi><mo>∧</mo><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>x</mi><mo>=</mo><mfrac><mrow><msup><mo>∂</mo> <mn>2</mn></msup><mi>u</mi></mrow><mrow><mo>∂</mo><msup><mi>x</mi> <mn>2</mn></msup></mrow></mfrac><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi><mo>∧</mo><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>x</mi><mo>,</mo></mrow><annotation encoding="application/x-tex"> \mathrm { d } ^ 2 u \wedge \mathrm { d } ^ 2 x = \frac { \partial ^ 2 u } { \partial x ^ 2 } \, \mathrm { d } x \wedge \mathrm { d } ^ 2 x ,</annotation></semantics></math>$
so
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mfrac><mrow><msup><mo>∂</mo> <mn>2</mn></msup><mi>u</mi></mrow><mrow><mo>∂</mo><msup><mi>x</mi> <mn>2</mn></msup></mrow></mfrac><mo>=</mo><mfrac><mrow><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>u</mi><mo>∧</mo><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>x</mi></mrow><mrow><mi mathvariant="normal">d</mi><mi>x</mi><mo>∧</mo><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>x</mi></mrow></mfrac><mo>.</mo></mrow><annotation encoding="application/x-tex"> \frac { \partial ^ 2 u } { \partial x ^ 2 } = \frac { \mathrm { d } ^ 2 u \wedge \mathrm { d } ^ 2 x } { \mathrm { d } x \wedge \mathrm { d } ^ 2 x } .</annotation></semantics></math>$
Now suppose that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>u</mi></mrow><annotation encoding="application/x-tex"> u </annotation></semantics></math>$ is a function of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex"> x </annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>y</mi></mrow><annotation encoding="application/x-tex"> y </annotation></semantics></math>$ . Then
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mi mathvariant="normal">d</mi><mi>u</mi><mo>=</mo><mfrac><mrow><mo>∂</mo><mi>u</mi></mrow><mrow><mo>∂</mo><mi>x</mi></mrow></mfrac><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi><mo>+</mo><mfrac><mrow><mo>∂</mo><mi>u</mi></mrow><mrow><mo>∂</mo><mi>y</mi></mrow></mfrac><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>y</mi><mo>,</mo></mrow><annotation encoding="application/x-tex"> \mathrm { d } u = \frac { \partial u } { \partial x } \, \mathrm { d } x + \frac { \partial u } { \partial y } \, \mathrm { d } y ,</annotation></semantics></math>$
so
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mi mathvariant="normal">d</mi><mi>u</mi><mo>∧</mo><mi mathvariant="normal">d</mi><mi>y</mi><mo>=</mo><mfrac><mrow><mo>∂</mo><mi>u</mi></mrow><mrow><mo>∂</mo><mi>x</mi></mrow></mfrac><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi><mo>∧</mo><mi mathvariant="normal">d</mi><mi>y</mi><mo>,</mo></mrow><annotation encoding="application/x-tex"> \mathrm { d } u \wedge \mathrm { d } y = \frac { \partial u } { \partial x } \, \mathrm { d } x \wedge \mathrm { d } y ,</annotation></semantics></math>$
so
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mfrac><mrow><mo>∂</mo><mi>u</mi></mrow><mrow><mo>∂</mo><mi>x</mi></mrow></mfrac><mo>=</mo><mfrac><mrow><mi mathvariant="normal">d</mi><mi>u</mi><mo>∧</mo><mi mathvariant="normal">d</mi><mi>y</mi></mrow><mrow><mi mathvariant="normal">d</mi><mi>x</mi><mo>∧</mo><mi mathvariant="normal">d</mi><mi>y</mi></mrow></mfrac><mo>.</mo></mrow><annotation encoding="application/x-tex"> \frac { \partial u } { \partial x } = \frac { \mathrm { d } u \wedge \mathrm { d } y } { \mathrm { d } x \wedge \mathrm { d } y } .</annotation></semantics></math>$
Thus,
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mfrac><mrow><msup><mo>∂</mo> <mn>2</mn></msup><mi>u</mi></mrow><mrow><mo>∂</mo><msup><mi>x</mi> <mn>2</mn></msup></mrow></mfrac><mo>=</mo><mfrac><mrow><mo>∂</mo><mrow><mo>(</mo><mfrac><mrow><mo>∂</mo><mi>u</mi></mrow><mrow><mo>∂</mo><mi>x</mi></mrow></mfrac><mo>)</mo></mrow></mrow><mrow><mo>∂</mo><mi>x</mi></mrow></mfrac><mo>=</mo><mfrac><mrow><mi mathvariant="normal">d</mi><mrow><mo>(</mo><mfrac><mrow><mi mathvariant="normal">d</mi><mi>u</mi><mo>∧</mo><mi mathvariant="normal">d</mi><mi>y</mi></mrow><mrow><mi mathvariant="normal">d</mi><mi>x</mi><mo>∧</mo><mi mathvariant="normal">d</mi><mi>y</mi></mrow></mfrac><mo>)</mo></mrow><mo>∧</mo><mi mathvariant="normal">d</mi><mi>y</mi></mrow><mrow><mi mathvariant="normal">d</mi><mi>x</mi><mo>∧</mo><mi mathvariant="normal">d</mi><mi>y</mi></mrow></mfrac><mo>,</mo></mrow><annotation encoding="application/x-tex"> \frac { \partial ^ 2 u } { \partial x ^ 2 } = \frac { \partial \left ( \frac { \partial u } { \partial x } \right ) } { \partial x } = \frac { \mathrm { d } \left ( \frac { \mathrm { d } u \wedge \mathrm { d } y } { \mathrm { d } x \wedge \mathrm { d } y } \right ) \wedge \mathrm { d } y } { \mathrm { d } x \wedge \mathrm { d } y } ,</annotation></semantics></math>$
which unfortunately can't be expanded without abandoning the $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo>∧</mo></mrow><annotation encoding="application/x-tex"> \wedge </annotation></semantics></math>$ notation.

On the other hand,
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>u</mi><mo>=</mo><mfrac><mrow><msup><mo>∂</mo> <mn>2</mn></msup><mi>u</mi></mrow><mrow><mo>∂</mo><msup><mi>x</mi> <mn>2</mn></msup></mrow></mfrac><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><msup><mi>x</mi> <mn>2</mn></msup><mo>+</mo><mn>2</mn><mfrac><mrow><msup><mo>∂</mo> <mn>2</mn></msup><mi>u</mi></mrow><mrow><mo>∂</mo><mi>x</mi><mo>∂</mo><mi>y</mi></mrow></mfrac><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>x</mi><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><mi>y</mi><mo>+</mo><mfrac><mrow><msup><mo>∂</mo> <mn>2</mn></msup><mi>u</mi></mrow><mrow><mo>∂</mo><msup><mi>y</mi> <mn>2</mn></msup></mrow></mfrac><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><msup><mi>y</mi> <mn>2</mn></msup><mo>+</mo><mfrac><mrow><mo>∂</mo><mi>u</mi></mrow><mrow><mo>∂</mo><mi>x</mi></mrow></mfrac><mspace width="0.16667em"/><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>x</mi><mo>+</mo><mfrac><mrow><mo>∂</mo><mi>u</mi></mrow><mrow><mo>∂</mo><mi>y</mi></mrow></mfrac><mspace width="0.16667em"/><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>y</mi><mo>,</mo></mrow><annotation encoding="application/x-tex"> \mathrm { d } ^ 2 u = \frac { \partial ^ 2 u } { \partial x ^ 2 } \, \mathrm { d } x ^ 2 + 2 \frac { \partial ^ 2 u } { \partial x \partial y } \, \mathrm { d } x \, \mathrm { d } y + \frac { \partial ^ 2 u } { \partial y ^ 2 } \, \mathrm { d } y ^ 2 + \frac { \partial u } { \partial x } \, \mathrm { d } ^ 2 x + \frac { \partial u } { \partial y } \, \mathrm { d } ^ 2 y ,</annotation></semantics></math>$
so
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>u</mi><mo>∧</mo><mi mathvariant="normal">d</mi><mi>x</mi><mi mathvariant="normal">d</mi><mi>y</mi><mo>∧</mo><mi mathvariant="normal">d</mi><msup><mi>y</mi> <mn>2</mn></msup><mo>∧</mo><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>x</mi><mo>∧</mo><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>y</mi><mo>=</mo><mfrac><mrow><msup><mo>∂</mo> <mn>2</mn></msup><mi>u</mi></mrow><mrow><mo>∂</mo><msup><mi>x</mi> <mn>2</mn></msup></mrow></mfrac><mspace width="0.16667em"/><mi mathvariant="normal">d</mi><msup><mi>x</mi> <mn>2</mn></msup><mo>∧</mo><mi mathvariant="normal">d</mi><mi>x</mi><mi mathvariant="normal">d</mi><mi>y</mi><mo>∧</mo><mi mathvariant="normal">d</mi><msup><mi>y</mi> <mn>2</mn></msup><mo>∧</mo><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>x</mi><mo>∧</mo><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>y</mi><mo>,</mo></mrow><annotation encoding="application/x-tex"> \mathrm { d } ^ 2 u \wedge \mathrm { d } x \mathrm { d } y \wedge \mathrm { d } y ^ 2 \wedge \mathrm { d } ^ 2 x \wedge \mathrm { d } ^ 2 y = \frac { \partial ^ 2 u } { \partial x ^ 2 } \, \mathrm { d } x ^ 2 \wedge \mathrm { d } x \mathrm { d } y \wedge \mathrm { d } y ^ 2 \wedge \mathrm { d } ^ 2 x \wedge \mathrm { d } ^ 2 y ,</annotation></semantics></math>$
so
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><mfrac><mrow><msup><mo>∂</mo> <mn>2</mn></msup><mi>u</mi></mrow><mrow><mo>∂</mo><msup><mi>x</mi> <mn>2</mn></msup></mrow></mfrac><mo>=</mo><mfrac><mrow><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>u</mi><mo>∧</mo><mi mathvariant="normal">d</mi><mi>x</mi><mi mathvariant="normal">d</mi><mi>y</mi><mo>∧</mo><mi mathvariant="normal">d</mi><msup><mi>y</mi> <mn>2</mn></msup><mo>∧</mo><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>x</mi><mo>∧</mo><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>y</mi></mrow><mrow><mi mathvariant="normal">d</mi><msup><mi>x</mi> <mn>2</mn></msup><mo>∧</mo><mi mathvariant="normal">d</mi><mi>x</mi><mi mathvariant="normal">d</mi><mi>y</mi><mo>∧</mo><mi mathvariant="normal">d</mi><msup><mi>y</mi> <mn>2</mn></msup><mo>∧</mo><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>x</mi><mo>∧</mo><msup><mi mathvariant="normal">d</mi> <mn>2</mn></msup><mi>y</mi></mrow></mfrac><mo>.</mo></mrow><annotation encoding="application/x-tex"> \frac { \partial ^ 2 u } { \partial x ^ 2 } = \frac { \mathrm { d } ^ 2 u \wedge \mathrm { d } x \mathrm { d } y \wedge \mathrm { d } y ^ 2 \wedge \mathrm { d } ^ 2 x \wedge \mathrm { d } ^ 2 y } { \mathrm { d } x ^ 2 \wedge \mathrm { d } x \mathrm { d } y \wedge \mathrm { d } y ^ 2 \wedge \mathrm { d } ^ 2 x \wedge \mathrm { d } ^ 2 y } .</annotation></semantics></math>$
- CommentRowNumber100.
- CommentAuthorTobyBartels
- CommentTimeDec 17th 2020
- (edited Dec 17th 2020)
- PermaLink
Author: TobyBartels
Format: MarkdownItexRe \#87: The coefficients appearing here are those that appear in [Bell polynomials](https://en.wikipedia.org/wiki/Bell_polynomials#Examples_2), and they are well known (although not by me, until yesterday) both to come from counting partitions and to give a formula for the higher derivatives of a composite function, [Faà di Bruno\'s formula](https://en.wikipedia.org/wiki/Fa%C3%A0_di_Bruno%27s_formula). This formula gives the higher cojet differentials of $f(x)$, where $f$ is a real-valued function of a real variable, differentiable at least $n$ times, and $x$ is a real-valued quantity (technically a real-valued function on some manifold), also differentiable at least $n$ times: $$ \mathrm{d}^n\big(f(x)\big) = \sum_\pi f^{({|\pi|})}(x) \prod_{B\in{\pi}} \mathrm{d}^{|B|}x ,$$ where the sum is taken over the set of all partitions of $\{1,\ldots,n\}$, each partition $\pi$ being thought of as a subset of the powerset of $\{1,\ldots,n\}$ (so that both $\pi$ and any $B \in \pi$ have a cardinality given by ${|{\cdot}|}$). A [partly multivariable version](https://en.wikipedia.org/wiki/Fa%C3%A0_di_Bruno%27s_formula#Multivariate_version) of the formula may be adapted to coflare forms. First some notation: if $A = \{i_1,i_2,\ldots,i_n\}$ is a finite [[multisubset]] of $\mathbb{N}$, then write $\mathrm{d}^A{u}$ for $\mathrm{d}_{i_1}\mathrm{d}_{i_2}{\cdots}\mathrm{d}_{i_n}u$ (which is unambiguously defined if $u$ is at least $n$ times differentiable). Also, if $B \subseteq \{1,\ldots,n\}$ (a set, not any multiset), then let $i_B$ be $\{i_j \;|\; j \in B\}$ (a multiset). With this notation, $$ \mathrm{d}^A\big(f(x)\big) = \sum_\pi f^{({|\pi|})}(x) \prod_{B\in{\pi}} \mathrm{d}^{i_B}x ,$$ a partial decategorification of the cojet version. A fully multivariable version of the formula would also allow $f$ to be a function of $m$ variables, with $\mathrm{d}\big(f(x_1,\ldots,x_m)\big) = \nabla{f}(x_1,\ldots,x_m) \cdot \langle{\mathrm{d}x_1,\ldots,\mathrm{d}x_m}\rangle = \sum_{j=1}^m \mathrm{D}_j{f}(x_1,\ldots,x_m) \mathrm{d}x_j$ as the order-$1$ case, but I haven\'t tried to think that through yet. ETA: You can take $A$ and $i_B$ to be tuples rather than multisets, if you prefer. But the order doesn\'t matter, just as with partial derivatives.

Re #87:

The coefficients appearing here are those that appear in Bell polynomials, and they are well known (although not by me, until yesterday) both to come from counting partitions and to give a formula for the higher derivatives of a composite function, Faà di Bruno's formula. This formula gives the higher cojet differentials of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">f(x)</annotation></semantics></math>$ , where $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ is a real-valued function of a real variable, differentiable at least $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>n</mi></mrow><annotation encoding="application/x-tex">n</annotation></semantics></math>$ times, and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ is a real-valued quantity (technically a real-valued function on some manifold), also differentiable at least $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>n</mi></mrow><annotation encoding="application/x-tex">n</annotation></semantics></math>$ times:
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mi>n</mi></msup><mo maxsize="1.2em" minsize="1.2em">(</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo maxsize="1.2em" minsize="1.2em">)</mo><mo>=</mo><munder><mo lspace="0.16667em" rspace="0.16667em">∑</mo> <mi>π</mi></munder><msup><mi>f</mi> <mrow><mo stretchy="false">(</mo><mrow><mo stretchy="false">|</mo><mi>π</mi><mo stretchy="false">|</mo></mrow><mo stretchy="false">)</mo></mrow></msup><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><munder><mo lspace="0.16667em" rspace="0.16667em">∏</mo> <mrow><mi>B</mi><mo>∈</mo><mi>π</mi></mrow></munder><msup><mi mathvariant="normal">d</mi> <mrow><mo stretchy="false">|</mo><mi>B</mi><mo stretchy="false">|</mo></mrow></msup><mi>x</mi><mo>,</mo></mrow><annotation encoding="application/x-tex"> \mathrm{d}^n\big(f(x)\big) = \sum_\pi f^{({|\pi|})}(x) \prod_{B\in{\pi}} \mathrm{d}^{|B|}x ,</annotation></semantics></math>$
where the sum is taken over the set of all partitions of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">{</mo><mn>1</mn><mo>,</mo><mi>…</mi><mo>,</mo><mi>n</mi><mo stretchy="false">}</mo></mrow><annotation encoding="application/x-tex">\{1,\ldots,n\}</annotation></semantics></math>$ , each partition $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>π</mi></mrow><annotation encoding="application/x-tex">\pi</annotation></semantics></math>$ being thought of as a subset of the powerset of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">{</mo><mn>1</mn><mo>,</mo><mi>…</mi><mo>,</mo><mi>n</mi><mo stretchy="false">}</mo></mrow><annotation encoding="application/x-tex">\{1,\ldots,n\}</annotation></semantics></math>$ (so that both $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>π</mi></mrow><annotation encoding="application/x-tex">\pi</annotation></semantics></math>$ and any $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>B</mi><mo>∈</mo><mi>π</mi></mrow><annotation encoding="application/x-tex">B \in \pi</annotation></semantics></math>$ have a cardinality given by $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mrow><mo stretchy="false">|</mo><mo>⋅</mo><mo stretchy="false">|</mo></mrow></mrow><annotation encoding="application/x-tex">{|{\cdot}|}</annotation></semantics></math>$ ).

A partly multivariable version of the formula may be adapted to coflare forms. First some notation: if $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>A</mi><mo>=</mo><mo stretchy="false">{</mo><msub><mi>i</mi> <mn>1</mn></msub><mo>,</mo><msub><mi>i</mi> <mn>2</mn></msub><mo>,</mo><mi>…</mi><mo>,</mo><msub><mi>i</mi> <mi>n</mi></msub><mo stretchy="false">}</mo></mrow><annotation encoding="application/x-tex">A = \{i_1,i_2,\ldots,i_n\}</annotation></semantics></math>$ is a finite multisubset of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>ℕ</mi></mrow><annotation encoding="application/x-tex">\mathbb{N}</annotation></semantics></math>$ , then write $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mi>A</mi></msup><mi>u</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}^A{u}</annotation></semantics></math>$ for $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi mathvariant="normal">d</mi> <mrow><msub><mi>i</mi> <mn>1</mn></msub></mrow></msub><msub><mi mathvariant="normal">d</mi> <mrow><msub><mi>i</mi> <mn>2</mn></msub></mrow></msub><mi>⋯</mi><msub><mi mathvariant="normal">d</mi> <mrow><msub><mi>i</mi> <mi>n</mi></msub></mrow></msub><mi>u</mi></mrow><annotation encoding="application/x-tex">\mathrm{d}_{i_1}\mathrm{d}_{i_2}{\cdots}\mathrm{d}_{i_n}u</annotation></semantics></math>$ (which is unambiguously defined if $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>u</mi></mrow><annotation encoding="application/x-tex">u</annotation></semantics></math>$ is at least $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>n</mi></mrow><annotation encoding="application/x-tex">n</annotation></semantics></math>$ times differentiable). Also, if $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>B</mi><mo>⊆</mo><mo stretchy="false">{</mo><mn>1</mn><mo>,</mo><mi>…</mi><mo>,</mo><mi>n</mi><mo stretchy="false">}</mo></mrow><annotation encoding="application/x-tex">B \subseteq \{1,\ldots,n\}</annotation></semantics></math>$ (a set, not any multiset), then let $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi>i</mi> <mi>B</mi></msub></mrow><annotation encoding="application/x-tex">i_B</annotation></semantics></math>$ be $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">{</mo><msub><mi>i</mi> <mi>j</mi></msub><mspace width="0.27778em"/><mo stretchy="false">|</mo><mspace width="0.27778em"/><mi>j</mi><mo>∈</mo><mi>B</mi><mo stretchy="false">}</mo></mrow><annotation encoding="application/x-tex">\{i_j \;|\; j \in B\}</annotation></semantics></math>$ (a multiset). With this notation,
$<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><semantics><mrow><msup><mi mathvariant="normal">d</mi> <mi>A</mi></msup><mo maxsize="1.2em" minsize="1.2em">(</mo><mi>f</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo maxsize="1.2em" minsize="1.2em">)</mo><mo>=</mo><munder><mo lspace="0.16667em" rspace="0.16667em">∑</mo> <mi>π</mi></munder><msup><mi>f</mi> <mrow><mo stretchy="false">(</mo><mrow><mo stretchy="false">|</mo><mi>π</mi><mo stretchy="false">|</mo></mrow><mo stretchy="false">)</mo></mrow></msup><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><munder><mo lspace="0.16667em" rspace="0.16667em">∏</mo> <mrow><mi>B</mi><mo>∈</mo><mi>π</mi></mrow></munder><msup><mi mathvariant="normal">d</mi> <mrow><msub><mi>i</mi> <mi>B</mi></msub></mrow></msup><mi>x</mi><mo>,</mo></mrow><annotation encoding="application/x-tex"> \mathrm{d}^A\big(f(x)\big) = \sum_\pi f^{({|\pi|})}(x) \prod_{B\in{\pi}} \mathrm{d}^{i_B}x ,</annotation></semantics></math>$
a partial decategorification of the cojet version.

A fully multivariable version of the formula would also allow $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ to be a function of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>m</mi></mrow><annotation encoding="application/x-tex">m</annotation></semantics></math>$ variables, with $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi mathvariant="normal">d</mi><mo maxsize="1.2em" minsize="1.2em">(</mo><mi>f</mi><mo stretchy="false">(</mo><msub><mi>x</mi> <mn>1</mn></msub><mo>,</mo><mi>…</mi><mo>,</mo><msub><mi>x</mi> <mi>m</mi></msub><mo stretchy="false">)</mo><mo maxsize="1.2em" minsize="1.2em">)</mo><mo>=</mo><mo>∇</mo><mi>f</mi><mo stretchy="false">(</mo><msub><mi>x</mi> <mn>1</mn></msub><mo>,</mo><mi>…</mi><mo>,</mo><msub><mi>x</mi> <mi>m</mi></msub><mo stretchy="false">)</mo><mo>⋅</mo><mo stretchy="false">⟨</mo><mrow><mi mathvariant="normal">d</mi><msub><mi>x</mi> <mn>1</mn></msub><mo>,</mo><mi>…</mi><mo>,</mo><mi mathvariant="normal">d</mi><msub><mi>x</mi> <mi>m</mi></msub></mrow><mo stretchy="false">⟩</mo><mo>=</mo><msubsup><mo lspace="0.16667em" rspace="0.16667em">∑</mo> <mrow><mi>j</mi><mo>=</mo><mn>1</mn></mrow> <mi>m</mi></msubsup><msub><mi mathvariant="normal">D</mi> <mi>j</mi></msub><mi>f</mi><mo stretchy="false">(</mo><msub><mi>x</mi> <mn>1</mn></msub><mo>,</mo><mi>…</mi><mo>,</mo><msub><mi>x</mi> <mi>m</mi></msub><mo stretchy="false">)</mo><mi mathvariant="normal">d</mi><msub><mi>x</mi> <mi>j</mi></msub></mrow><annotation encoding="application/x-tex">\mathrm{d}\big(f(x_1,\ldots,x_m)\big) = \nabla{f}(x_1,\ldots,x_m) \cdot \langle{\mathrm{d}x_1,\ldots,\mathrm{d}x_m}\rangle = \sum_{j=1}^m \mathrm{D}_j{f}(x_1,\ldots,x_m) \mathrm{d}x_j</annotation></semantics></math>$ as the order- $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn></mrow><annotation encoding="application/x-tex">1</annotation></semantics></math>$ case, but I haven't tried to think that through yet.

ETA: You can take $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>A</mi></mrow><annotation encoding="application/x-tex">A</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi>i</mi> <mi>B</mi></msub></mrow><annotation encoding="application/x-tex">i_B</annotation></semantics></math>$ to be tuples rather than multisets, if you prefer. But the order doesn't matter, just as with partial derivatives.

1 to 100 of 102

nForum

Discussion Feed

Not signed in

Site Tag Cloud

Atrium > Mathematics, Physics & Philosophy: What is a variable?