Not signed in

Want to take part in these discussions? Sign in if you have an account, or apply for one below

Site Tag Cloud

Vanilla 1.1.10 is a product of Lussumo. More Information: Documentation, Community Support.

Welcome to nForum
If you want to take part in these discussions either sign in now (if you have an account), apply for one now (if you don't).

Atrium > Mathematics, Physics & Philosophy: beginner's questions on coherence

Bottom of Page

1 to 7 of 7

- CommentRowNumber1.
- CommentAuthorYaron
- CommentTimeJul 10th 2011
- PermaLink
Author: Yaron
Format: MarkdownItexRecently, I read the proof of the coherence theorem for monoidal categories in CWM (Sec. VII.2). After a rather long struggle, I hope that I finally understand this proof (although my own summary turned out to be unbelievably long). I would very much like to verify that I did not get it completely wrong, and so I have a couple of question regarding this proof. (I hope it is appropriate to ask this in the $n$Forum, and I sincerely apologize if this is not the case). So, here goes: 1. In the beginning of Sec. VII.2 of CWM, it is emphasized that coherence does not concern particular monoidal categories, and it is only possible to prove that every ''formal'' diagram commutes. If I understand correctly a hint from Tom Leinster's course notes, one example where a ''too general coherence'' fails may be extracted from Isbell's proof that moving to skeleta does not guarantee that all monoidal categories are strict (there, it is shown that if $D$ is the denumerable set in a skeleton of $\mathbf{Set}$, then the parallel arrows $1, \alpha: D\times(D\times D)\to (D\times D)\times D$ cannot be equal). Is there also an example of a category in which two distinct associators turn out to be parallel yet unequal? 1. I was very confused by the following sentences in CWM (p. 166): ''\dots while its edges $v\to w$ are to be identical with certain arrows $v_b\to w_b$ in $B$. Call them the ''basic'' arrows. Here, each instance $\alpha\colon u_b\otimes(v_b \otimes w_b)\to (u_b \otimes v_b) \otimes w_b$ of associativity and each instance of $\alpha^{-1}$ is basic ..." But if we refer to _all_ arrows of $B$ of the above kind, aren't we getting exactly into the sort of problem described in Item 1 above? For example, isn't is possible that $u_b\otimes(v_b\otimes w_b) = u'_b\otimes(v'_b\otimes w'_b)$ and $(u_b\otimes v_b)\otimes w_b = (u'_b\otimes v'_b)\otimes w'_b$ in some particular monoidal category in which the parallel arrows $\alpha_{u_b,v_b,w_b}$ and $\alpha_{u'_b,v'_b,w'_b}$ are not equal? 1. If I understand correctly, one should define some ''syntactic'' arrows inductively (together with dom and cod functions that assign to each such ''syntactic'' arrow an appropriate binary word, as well as a way to interpret them in $B$), and the coherence is only for those arrows of $B$ that are interpretations of the syntactic arrows. Is this idea correct? I'm afraid I've overcomplicated matters here, but I couldn't find a different solution. 1. The part of the proof concerning the reduction from the general case to the case of associators only (on p. 168) is mentioned very briefly. It seems to me that to prove both steps of this part (first assuring that all associators in a path are at the end, and then to prove coherence for the first part of the path (that now has only unitors)), requires a not so short induction if all details are written down. Am I missing something here? 1. Looking around, I've seen some helpful remarks of Todd on the web that refer to a simpler proof of Joyal and Street (which I haven't read yet). I've also seen that there are some ''purely logic'' proofs that should be simple (I haven't read them either). My question is: Is it still recommended to read Mac Lane's proof when learning coherence for the first time? 1. Finally, where can I find ML's original 1963 paper? It doesn't seem to appear anywhere on the web. Thanks very much in advance! Yaron
Recently, I read the proof of the coherence theorem for monoidal categories in CWM (Sec. VII.2). After a rather long struggle, I hope that I finally understand this proof (although my own summary turned out to be unbelievably long). I would very much like to verify that I did not get it completely wrong, and so I have a couple of question regarding this proof. (I hope it is appropriate to ask this in the $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>n</mi></mrow><annotation encoding="application/x-tex">n</annotation></semantics></math>$ Forum, and I sincerely apologize if this is not the case).

So, here goes:
1. In the beginning of Sec. VII.2 of CWM, it is emphasized that coherence does not concern particular monoidal categories, and it is only possible to prove that every ”formal” diagram commutes. If I understand correctly a hint from Tom Leinster’s course notes, one example where a ”too general coherence” fails may be extracted from Isbell’s proof that moving to skeleta does not guarantee that all monoidal categories are strict (there, it is shown that if $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>D</mi></mrow><annotation encoding="application/x-tex">D</annotation></semantics></math>$ is the denumerable set in a skeleton of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mstyle mathvariant="bold"><mi>Set</mi></mstyle></mrow><annotation encoding="application/x-tex">\mathbf{Set}</annotation></semantics></math>$ , then the parallel arrows $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1</mn><mo>,</mo><mi>α</mi><mo>:</mo><mi>D</mi><mo>×</mo><mo stretchy="false">(</mo><mi>D</mi><mo>×</mo><mi>D</mi><mo stretchy="false">)</mo><mo>→</mo><mo stretchy="false">(</mo><mi>D</mi><mo>×</mo><mi>D</mi><mo stretchy="false">)</mo><mo>×</mo><mi>D</mi></mrow><annotation encoding="application/x-tex">1, \alpha: D\times(D\times D)\to (D\times D)\times D</annotation></semantics></math>$ cannot be equal).
  Is there also an example of a category in which two distinct associators turn out to be parallel yet unequal?
2. I was very confused by the following sentences in CWM (p. 166): ”\dots while its edges $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>v</mi><mo>→</mo><mi>w</mi></mrow><annotation encoding="application/x-tex">v\to w</annotation></semantics></math>$ are to be identical with certain arrows $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi>v</mi> <mi>b</mi></msub><mo>→</mo><msub><mi>w</mi> <mi>b</mi></msub></mrow><annotation encoding="application/x-tex">v_b\to w_b</annotation></semantics></math>$ in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>B</mi></mrow><annotation encoding="application/x-tex">B</annotation></semantics></math>$ . Call them the ”basic” arrows. Here, each instance $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>α</mi><mo lspace="0.11111em">:</mo><msub><mi>u</mi> <mi>b</mi></msub><mo>⊗</mo><mo stretchy="false">(</mo><msub><mi>v</mi> <mi>b</mi></msub><mo>⊗</mo><msub><mi>w</mi> <mi>b</mi></msub><mo stretchy="false">)</mo><mo>→</mo><mo stretchy="false">(</mo><msub><mi>u</mi> <mi>b</mi></msub><mo>⊗</mo><msub><mi>v</mi> <mi>b</mi></msub><mo stretchy="false">)</mo><mo>⊗</mo><msub><mi>w</mi> <mi>b</mi></msub></mrow><annotation encoding="application/x-tex">\alpha\colon u_b\otimes(v_b \otimes w_b)\to (u_b \otimes v_b) \otimes w_b</annotation></semantics></math>$ of associativity and each instance of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>α</mi> <mrow><mo lspace="0.11111em" rspace="0em">−</mo><mn>1</mn></mrow></msup></mrow><annotation encoding="application/x-tex">\alpha^{-1}</annotation></semantics></math>$ is basic …” But if we refer to all arrows of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>B</mi></mrow><annotation encoding="application/x-tex">B</annotation></semantics></math>$ of the above kind, aren’t we getting exactly into the sort of problem described in Item 1 above? For example, isn’t is possible that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi>u</mi> <mi>b</mi></msub><mo>⊗</mo><mo stretchy="false">(</mo><msub><mi>v</mi> <mi>b</mi></msub><mo>⊗</mo><msub><mi>w</mi> <mi>b</mi></msub><mo stretchy="false">)</mo><mo>=</mo><mi>u</mi><msub><mo>′</mo> <mi>b</mi></msub><mo>⊗</mo><mo stretchy="false">(</mo><mi>v</mi><msub><mo>′</mo> <mi>b</mi></msub><mo>⊗</mo><mi>w</mi><msub><mo>′</mo> <mi>b</mi></msub><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">u_b\otimes(v_b\otimes w_b) = u'_b\otimes(v'_b\otimes w'_b)</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><msub><mi>u</mi> <mi>b</mi></msub><mo>⊗</mo><msub><mi>v</mi> <mi>b</mi></msub><mo stretchy="false">)</mo><mo>⊗</mo><msub><mi>w</mi> <mi>b</mi></msub><mo>=</mo><mo stretchy="false">(</mo><mi>u</mi><msub><mo>′</mo> <mi>b</mi></msub><mo>⊗</mo><mi>v</mi><msub><mo>′</mo> <mi>b</mi></msub><mo stretchy="false">)</mo><mo>⊗</mo><mi>w</mi><msub><mo>′</mo> <mi>b</mi></msub></mrow><annotation encoding="application/x-tex">(u_b\otimes v_b)\otimes w_b = (u'_b\otimes v'_b)\otimes w'_b</annotation></semantics></math>$ in some particular monoidal category in which the parallel arrows $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi>α</mi> <mrow><msub><mi>u</mi> <mi>b</mi></msub><mo>,</mo><msub><mi>v</mi> <mi>b</mi></msub><mo>,</mo><msub><mi>w</mi> <mi>b</mi></msub></mrow></msub></mrow><annotation encoding="application/x-tex">\alpha_{u_b,v_b,w_b}</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi>α</mi> <mrow><mi>u</mi><msub><mo>′</mo> <mi>b</mi></msub><mo>,</mo><mi>v</mi><msub><mo>′</mo> <mi>b</mi></msub><mo>,</mo><mi>w</mi><msub><mo>′</mo> <mi>b</mi></msub></mrow></msub></mrow><annotation encoding="application/x-tex">\alpha_{u'_b,v'_b,w'_b}</annotation></semantics></math>$ are not equal?
3. If I understand correctly, one should define some ”syntactic” arrows inductively (together with dom and cod functions that assign to each such ”syntactic” arrow an appropriate binary word, as well as a way to interpret them in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>B</mi></mrow><annotation encoding="application/x-tex">B</annotation></semantics></math>$ ), and the coherence is only for those arrows of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>B</mi></mrow><annotation encoding="application/x-tex">B</annotation></semantics></math>$ that are interpretations of the syntactic arrows. Is this idea correct? I’m afraid I’ve overcomplicated matters here, but I couldn’t find a different solution.
4. The part of the proof concerning the reduction from the general case to the case of associators only (on p. 168) is mentioned very briefly. It seems to me that to prove both steps of this part (first assuring that all associators in a path are at the end, and then to prove coherence for the first part of the path (that now has only unitors)), requires a not so short induction if all details are written down. Am I missing something here?
5. Looking around, I’ve seen some helpful remarks of Todd on the web that refer to a simpler proof of Joyal and Street (which I haven’t read yet). I’ve also seen that there are some ”purely logic” proofs that should be simple (I haven’t read them either). My question is: Is it still recommended to read Mac Lane’s proof when learning coherence for the first time?
6. Finally, where can I find ML’s original 1963 paper? It doesn’t seem to appear anywhere on the web.
Thanks very much in advance!

Yaron
- CommentRowNumber2.
- CommentAuthorTodd_Trimble
- CommentTimeJul 11th 2011
- (edited Jul 11th 2011)
- PermaLink
Author: Todd_Trimble
Format: MarkdownItexHi Yaron -- I do think Mac Lane is a fine first source from which to read a proof of the coherence theorem. I don't know where to find that Rice University study, where Mac Lane first published his proof, anywhere online. But in some sense that shouldn't matter, because it turned out that there was some redundancy in his description of monoidal categories, which Max Kelly straightened out in 1964. The proof in CWM is not a bad source, except that it wimps out slightly when it comes to treating the unit, as you noted in your point 4. The overall idea of the proof is however sound. One comment which I hope will clarify matters is that the coherence theorem is really all about the structure of the *free* monoidal category generated by a single object, denoted $F[1]$. If you have experience with building free structures of other types (e.g., free groups), the construction of a free monoidal category, in this case on one object, is similar in spirit, but maybe just slightly more involved. It's maybe not a bad idea to work through that construction once. The free monoidal category generated by a set of objects is quite similar, and in fact there is a systematic way to build the free monoidal category generated by a *category* $C$, by means of something called a "wreath product" of $F[1]$ and $C$. Anyway, the correct formal statement of Mac Lane's theorem is that all diagrams in $F[1]$ commute. That is, that any two morphisms which share the same domain and codomain in $F[1]$ are equal. My recommendation then is that you try to read through the proof one more time as applying specifically to $F[1]$. If $x$ denotes the generating object, then objects of $F[1]$ are formal bracketed words in $x$ and $I$ such as $x \otimes ((x \otimes I) \otimes x)$. The morphisms of $F[1]$ are equivalence classes of directed paths in a certain directed graph whose vertices are the objects. The edges in this directed graph are defined by recursion: * If $u$, $v$, $w$ are objects, then there is an edge labeled $\alpha_{u v w}$ from $(u \otimes v) \otimes w$ to $u \otimes (v \otimes w)$. Similarly, there are graph edges labeled $\alpha_{u v w}^{-1}$ and $\rho_u$, $\rho_{u}^{-1}$, $\lambda_u$, $\lambda_{u}^{-1}$. These are the "basic edges". * If there is an edge labeled $f$ from $u$ to $v$, then for any object $w$ there are edges labeled $f \otimes w: u \otimes w \to v \otimes w$ and $w \otimes f: w \otimes u \to w \otimes v$. (These give "expanded instances" of associators, unitors, etc.). Any morphism in $F[1]$ can be built up as a formal composite of these expanded instances of basic edges -- the directed paths play the role of the formal composites -- but we need to impose an equivalence relation on directed paths to force the evident $\otimes$ to be functorial, $\alpha$ to be natural with inverse $\alpha^{-1}$, the coherence conditions to hold, and so on. Thus the morphisms are certain equivalence classes of paths. A lot of Mac Lane's proof makes a lot more sense when you hold this picture in mind. For example, ignoring all structure that has to do with units and unitors, as in the first part of Mac Lane's proof, the _rank_ that he uses to perform the induction can be equivalently described as the length of the longest path of expanded instances of $\alpha$ that takes you from a given bracketing of $n$ copies of $x$ to the final bracketing, with all parentheses to the right. Any move along an expanded instance of $\alpha$ will then move to an object of lesser rank. The keystone of his proof is a "diamond lemma" which is very reminiscent of proofs of confluence used in rewriting theory. In each case, completing or filling out the diamond involves a commutative square, and the cases fall into three general patterns, one involving functoriality of $\otimes$, one involving naturality of $\alpha$, and one involving the pentagon condition. I am rediscovering the fact that even though I understand this proof, it isn't so easy to describe in a few words! A different way of visualizing it is to imagine that we are building a 2-dimensional CW complex. The 0-skeleton is the set of objects of $F[1]$, the 1-skeleton comes from the directed graph, and the 2-skeleton is obtained by adjoining 2-cells whose boundaries are the (commutative) diagrams which express basic instances of functoriality, naturality, and the coherence conditions. The coherence theorem says that this 2-dim CW complex has trivial fundamental group in each component. Hope this helps. (?)
Hi Yaron –

I do think Mac Lane is a fine first source from which to read a proof of the coherence theorem. I don’t know where to find that Rice University study, where Mac Lane first published his proof, anywhere online. But in some sense that shouldn’t matter, because it turned out that there was some redundancy in his description of monoidal categories, which Max Kelly straightened out in 1964. The proof in CWM is not a bad source, except that it wimps out slightly when it comes to treating the unit, as you noted in your point 4. The overall idea of the proof is however sound.

One comment which I hope will clarify matters is that the coherence theorem is really all about the structure of the free monoidal category generated by a single object, denoted $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>F</mi><mo stretchy="false">[</mo><mn>1</mn><mo stretchy="false">]</mo></mrow><annotation encoding="application/x-tex">F[1]</annotation></semantics></math>$ . If you have experience with building free structures of other types (e.g., free groups), the construction of a free monoidal category, in this case on one object, is similar in spirit, but maybe just slightly more involved. It’s maybe not a bad idea to work through that construction once. The free monoidal category generated by a set of objects is quite similar, and in fact there is a systematic way to build the free monoidal category generated by a category $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>C</mi></mrow><annotation encoding="application/x-tex">C</annotation></semantics></math>$ , by means of something called a “wreath product” of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>F</mi><mo stretchy="false">[</mo><mn>1</mn><mo stretchy="false">]</mo></mrow><annotation encoding="application/x-tex">F[1]</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>C</mi></mrow><annotation encoding="application/x-tex">C</annotation></semantics></math>$ .

Anyway, the correct formal statement of Mac Lane’s theorem is that all diagrams in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>F</mi><mo stretchy="false">[</mo><mn>1</mn><mo stretchy="false">]</mo></mrow><annotation encoding="application/x-tex">F[1]</annotation></semantics></math>$ commute. That is, that any two morphisms which share the same domain and codomain in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>F</mi><mo stretchy="false">[</mo><mn>1</mn><mo stretchy="false">]</mo></mrow><annotation encoding="application/x-tex">F[1]</annotation></semantics></math>$ are equal.

My recommendation then is that you try to read through the proof one more time as applying specifically to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>F</mi><mo stretchy="false">[</mo><mn>1</mn><mo stretchy="false">]</mo></mrow><annotation encoding="application/x-tex">F[1]</annotation></semantics></math>$ . If $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ denotes the generating object, then objects of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>F</mi><mo stretchy="false">[</mo><mn>1</mn><mo stretchy="false">]</mo></mrow><annotation encoding="application/x-tex">F[1]</annotation></semantics></math>$ are formal bracketed words in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>I</mi></mrow><annotation encoding="application/x-tex">I</annotation></semantics></math>$ such as $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi><mo>⊗</mo><mo stretchy="false">(</mo><mo stretchy="false">(</mo><mi>x</mi><mo>⊗</mo><mi>I</mi><mo stretchy="false">)</mo><mo>⊗</mo><mi>x</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">x \otimes ((x \otimes I) \otimes x)</annotation></semantics></math>$ . The morphisms of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>F</mi><mo stretchy="false">[</mo><mn>1</mn><mo stretchy="false">]</mo></mrow><annotation encoding="application/x-tex">F[1]</annotation></semantics></math>$ are equivalence classes of directed paths in a certain directed graph whose vertices are the objects. The edges in this directed graph are defined by recursion:
- If $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>u</mi></mrow><annotation encoding="application/x-tex">u</annotation></semantics></math>$ , $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>v</mi></mrow><annotation encoding="application/x-tex">v</annotation></semantics></math>$ , $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>w</mi></mrow><annotation encoding="application/x-tex">w</annotation></semantics></math>$ are objects, then there is an edge labeled $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi>α</mi> <mrow><mi>u</mi><mi>v</mi><mi>w</mi></mrow></msub></mrow><annotation encoding="application/x-tex">\alpha_{u v w}</annotation></semantics></math>$ from $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">(</mo><mi>u</mi><mo>⊗</mo><mi>v</mi><mo stretchy="false">)</mo><mo>⊗</mo><mi>w</mi></mrow><annotation encoding="application/x-tex">(u \otimes v) \otimes w</annotation></semantics></math>$ to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>u</mi><mo>⊗</mo><mo stretchy="false">(</mo><mi>v</mi><mo>⊗</mo><mi>w</mi><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">u \otimes (v \otimes w)</annotation></semantics></math>$ . Similarly, there are graph edges labeled $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msubsup><mi>α</mi> <mrow><mi>u</mi><mi>v</mi><mi>w</mi></mrow> <mrow><mo lspace="0.11111em" rspace="0em">−</mo><mn>1</mn></mrow></msubsup></mrow><annotation encoding="application/x-tex">\alpha_{u v w}^{-1}</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi>ρ</mi> <mi>u</mi></msub></mrow><annotation encoding="application/x-tex">\rho_u</annotation></semantics></math>$ , $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msubsup><mi>ρ</mi> <mi>u</mi> <mrow><mo lspace="0.11111em" rspace="0em">−</mo><mn>1</mn></mrow></msubsup></mrow><annotation encoding="application/x-tex">\rho_{u}^{-1}</annotation></semantics></math>$ , $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi>λ</mi> <mi>u</mi></msub></mrow><annotation encoding="application/x-tex">\lambda_u</annotation></semantics></math>$ , $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msubsup><mi>λ</mi> <mi>u</mi> <mrow><mo lspace="0.11111em" rspace="0em">−</mo><mn>1</mn></mrow></msubsup></mrow><annotation encoding="application/x-tex">\lambda_{u}^{-1}</annotation></semantics></math>$ . These are the “basic edges”.
- If there is an edge labeled $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi></mrow><annotation encoding="application/x-tex">f</annotation></semantics></math>$ from $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>u</mi></mrow><annotation encoding="application/x-tex">u</annotation></semantics></math>$ to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>v</mi></mrow><annotation encoding="application/x-tex">v</annotation></semantics></math>$ , then for any object $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>w</mi></mrow><annotation encoding="application/x-tex">w</annotation></semantics></math>$ there are edges labeled $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>f</mi><mo>⊗</mo><mi>w</mi><mo>:</mo><mi>u</mi><mo>⊗</mo><mi>w</mi><mo>→</mo><mi>v</mi><mo>⊗</mo><mi>w</mi></mrow><annotation encoding="application/x-tex">f \otimes w: u \otimes w \to v \otimes w</annotation></semantics></math>$ and $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>w</mi><mo>⊗</mo><mi>f</mi><mo>:</mo><mi>w</mi><mo>⊗</mo><mi>u</mi><mo>→</mo><mi>w</mi><mo>⊗</mo><mi>v</mi></mrow><annotation encoding="application/x-tex">w \otimes f: w \otimes u \to w \otimes v</annotation></semantics></math>$ . (These give “expanded instances” of associators, unitors, etc.).
Any morphism in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>F</mi><mo stretchy="false">[</mo><mn>1</mn><mo stretchy="false">]</mo></mrow><annotation encoding="application/x-tex">F[1]</annotation></semantics></math>$ can be built up as a formal composite of these expanded instances of basic edges – the directed paths play the role of the formal composites – but we need to impose an equivalence relation on directed paths to force the evident $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo>⊗</mo></mrow><annotation encoding="application/x-tex">\otimes</annotation></semantics></math>$ to be functorial, $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>α</mi></mrow><annotation encoding="application/x-tex">\alpha</annotation></semantics></math>$ to be natural with inverse $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msup><mi>α</mi> <mrow><mo lspace="0.11111em" rspace="0em">−</mo><mn>1</mn></mrow></msup></mrow><annotation encoding="application/x-tex">\alpha^{-1}</annotation></semantics></math>$ , the coherence conditions to hold, and so on. Thus the morphisms are certain equivalence classes of paths.

A lot of Mac Lane’s proof makes a lot more sense when you hold this picture in mind. For example, ignoring all structure that has to do with units and unitors, as in the first part of Mac Lane’s proof, the rank that he uses to perform the induction can be equivalently described as the length of the longest path of expanded instances of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>α</mi></mrow><annotation encoding="application/x-tex">\alpha</annotation></semantics></math>$ that takes you from a given bracketing of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>n</mi></mrow><annotation encoding="application/x-tex">n</annotation></semantics></math>$ copies of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>x</mi></mrow><annotation encoding="application/x-tex">x</annotation></semantics></math>$ to the final bracketing, with all parentheses to the right. Any move along an expanded instance of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>α</mi></mrow><annotation encoding="application/x-tex">\alpha</annotation></semantics></math>$ will then move to an object of lesser rank.

The keystone of his proof is a “diamond lemma” which is very reminiscent of proofs of confluence used in rewriting theory. In each case, completing or filling out the diamond involves a commutative square, and the cases fall into three general patterns, one involving functoriality of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo>⊗</mo></mrow><annotation encoding="application/x-tex">\otimes</annotation></semantics></math>$ , one involving naturality of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>α</mi></mrow><annotation encoding="application/x-tex">\alpha</annotation></semantics></math>$ , and one involving the pentagon condition.

I am rediscovering the fact that even though I understand this proof, it isn’t so easy to describe in a few words! A different way of visualizing it is to imagine that we are building a 2-dimensional CW complex. The 0-skeleton is the set of objects of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>F</mi><mo stretchy="false">[</mo><mn>1</mn><mo stretchy="false">]</mo></mrow><annotation encoding="application/x-tex">F[1]</annotation></semantics></math>$ , the 1-skeleton comes from the directed graph, and the 2-skeleton is obtained by adjoining 2-cells whose boundaries are the (commutative) diagrams which express basic instances of functoriality, naturality, and the coherence conditions. The coherence theorem says that this 2-dim CW complex has trivial fundamental group in each component.

Hope this helps. (?)
- CommentRowNumber3.
- CommentAuthorYaron
- CommentTimeJul 11th 2011
- (edited Jul 11th 2011)
- PermaLink
Author: Yaron
Format: MarkdownItexHi Todd, Thank you very much for your answer, it is most appreciated and very helpful! I think that the formal arrows you describe are exactly some of those ''syntactic arrows'' I was talking about. Because I didn’t know of what you explained about the way $F[1]$ is constructed, I only thought of it as a directed graph, with vertices the binary words and edges those arrows $\alpha_{u,v,w}$ (where here $\alpha$ is just a symbol) $\beta\otimes 1_w$ etc. The arrows can be interpreted in a monoidal category $B$, and the pair of functions consisting of interpreting binary words (vertices) and arrows is a morphism of graphs from the above graph to the underlying graph of $B$. Then coherence is the fact that interpreting any two parallel paths and then composing the arrows in $B$ results in two identical arrows in $B$. In CWM, $F[1]$ (which is there called $W$) is already built with a single element in every hom-set, and it is proved that this category is indeed the object part of a universal arrow from $\{-\}$ to $U$, where $U$ is the forgetful functor from$Moncat$ (monoidal categories with strict monoidal functors) to $Set$ (this is the way I understand “free on one element”). I will now try to understand this using the construction in your answer: First to build F[1] as you said (hom-sets are equivalence classes etc.), and then verifying that all diagrams there commute and that we indeed have a universal arrow (i.e., that it is free). If I understand correctly, the way you described to build $F[1]$ can be obtained by first taking the free category on the graph that I have, and then perhaps identifying some arrows. By the way, I understand (from ML’s proof) that coherence (as described in the first paragraph here) implies that ML’s $W$ is the free monoidal category on one element. What I don’t understand is the opposite direction: In ML it is said that coherence is nothing but the statement that $W$ is free, and I don’t immediately see how this implies coherence (every diagram in the above graph commutes when interpreted in $B$). Thanks again, Yaron

Hi Todd,

Thank you very much for your answer, it is most appreciated and very helpful!

I think that the formal arrows you describe are exactly some of those ”syntactic arrows” I was talking about. Because I didn’t know of what you explained about the way $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>F</mi><mo stretchy="false">[</mo><mn>1</mn><mo stretchy="false">]</mo></mrow><annotation encoding="application/x-tex">F[1]</annotation></semantics></math>$ is constructed, I only thought of it as a directed graph, with vertices the binary words and edges those arrows $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><msub><mi>α</mi> <mrow><mi>u</mi><mo>,</mo><mi>v</mi><mo>,</mo><mi>w</mi></mrow></msub></mrow><annotation encoding="application/x-tex">\alpha_{u,v,w}</annotation></semantics></math>$ (where here $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>α</mi></mrow><annotation encoding="application/x-tex">\alpha</annotation></semantics></math>$ is just a symbol) $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>β</mi><mo>⊗</mo><msub><mn>1</mn> <mi>w</mi></msub></mrow><annotation encoding="application/x-tex">\beta\otimes 1_w</annotation></semantics></math>$ etc. The arrows can be interpreted in a monoidal category $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>B</mi></mrow><annotation encoding="application/x-tex">B</annotation></semantics></math>$ , and the pair of functions consisting of interpreting binary words (vertices) and arrows is a morphism of graphs from the above graph to the underlying graph of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>B</mi></mrow><annotation encoding="application/x-tex">B</annotation></semantics></math>$ . Then coherence is the fact that interpreting any two parallel paths and then composing the arrows in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>B</mi></mrow><annotation encoding="application/x-tex">B</annotation></semantics></math>$ results in two identical arrows in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>B</mi></mrow><annotation encoding="application/x-tex">B</annotation></semantics></math>$ .

In CWM, $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>F</mi><mo stretchy="false">[</mo><mn>1</mn><mo stretchy="false">]</mo></mrow><annotation encoding="application/x-tex">F[1]</annotation></semantics></math>$ (which is there called $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>W</mi></mrow><annotation encoding="application/x-tex">W</annotation></semantics></math>$ ) is already built with a single element in every hom-set, and it is proved that this category is indeed the object part of a universal arrow from $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mo stretchy="false">{</mo><mo lspace="0.11111em" rspace="0em">−</mo><mo stretchy="false">}</mo></mrow><annotation encoding="application/x-tex">\{-\}</annotation></semantics></math>$ to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>U</mi></mrow><annotation encoding="application/x-tex">U</annotation></semantics></math>$ , where $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>U</mi></mrow><annotation encoding="application/x-tex">U</annotation></semantics></math>$ is the forgetful functor from $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>Moncat</mi></mrow><annotation encoding="application/x-tex">Moncat</annotation></semantics></math>$ (monoidal categories with strict monoidal functors) to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>Set</mi></mrow><annotation encoding="application/x-tex">Set</annotation></semantics></math>$ (this is the way I understand “free on one element”). I will now try to understand this using the construction in your answer: First to build F[1] as you said (hom-sets are equivalence classes etc.), and then verifying that all diagrams there commute and that we indeed have a universal arrow (i.e., that it is free). If I understand correctly, the way you described to build $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>F</mi><mo stretchy="false">[</mo><mn>1</mn><mo stretchy="false">]</mo></mrow><annotation encoding="application/x-tex">F[1]</annotation></semantics></math>$ can be obtained by first taking the free category on the graph that I have, and then perhaps identifying some arrows.

By the way, I understand (from ML’s proof) that coherence (as described in the first paragraph here) implies that ML’s $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>W</mi></mrow><annotation encoding="application/x-tex">W</annotation></semantics></math>$ is the free monoidal category on one element. What I don’t understand is the opposite direction: In ML it is said that coherence is nothing but the statement that $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>W</mi></mrow><annotation encoding="application/x-tex">W</annotation></semantics></math>$ is free, and I don’t immediately see how this implies coherence (every diagram in the above graph commutes when interpreted in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>B</mi></mrow><annotation encoding="application/x-tex">B</annotation></semantics></math>$ ).

Thanks again, Yaron
- CommentRowNumber4.
- CommentAuthorTodd_Trimble
- CommentTimeJul 13th 2011
- PermaLink
Author: Todd_Trimble
Format: MarkdownItex> I don’t immediately see how this implies coherence (every diagram in the above graph commutes when interpreted in $B$) I'm maybe not sure what the objection is, but my immediate reaction is that we might follow Eric Forgy when he advocated the view some months ago that "a functor preserves commutative diagrams" (as the most intuitive way of saying what a functor is for people with a background similar to his). Does that speak to your concern?

I don’t immediately see how this implies coherence (every diagram in the above graph commutes when interpreted in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>B</mi></mrow><annotation encoding="application/x-tex">B</annotation></semantics></math>$ )

I’m maybe not sure what the objection is, but my immediate reaction is that we might follow Eric Forgy when he advocated the view some months ago that “a functor preserves commutative diagrams” (as the most intuitive way of saying what a functor is for people with a background similar to his). Does that speak to your concern?
- CommentRowNumber5.
- CommentAuthorYaron
- CommentTimeJul 13th 2011
- PermaLink
Author: Yaron
Format: MarkdownItexYes, I understand that functors preserve commutative diagrams, so obviously every commutative diagram (that is, every diagram) in $W$ is mapped to a commutative diagram in B (given our unique monoidal functor W-->B). This defines certain diagrams of $B$ that commute, but how can I verify that these diagrams are exactly those obtained from interpreting paths of the "syntactic" graph G_n in $B$? (For me, the more intuitive definition of coherence is that any two paths from $u$ to $v$ in G_n, when interpreted in $B$, compose to the same arrow of $B$) I suppose that I should use the fact that our functor $W\to B$ is _monoidal_ (not just a functor) and induction.

Yes, I understand that functors preserve commutative diagrams, so obviously every commutative diagram (that is, every diagram) in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>W</mi></mrow><annotation encoding="application/x-tex">W</annotation></semantics></math>$ is mapped to a commutative diagram in B (given our unique monoidal functor W–>B). This defines certain diagrams of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>B</mi></mrow><annotation encoding="application/x-tex">B</annotation></semantics></math>$ that commute, but how can I verify that these diagrams are exactly those obtained from interpreting paths of the “syntactic” graph G_n in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>B</mi></mrow><annotation encoding="application/x-tex">B</annotation></semantics></math>$ ? (For me, the more intuitive definition of coherence is that any two paths from $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>u</mi></mrow><annotation encoding="application/x-tex">u</annotation></semantics></math>$ to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>v</mi></mrow><annotation encoding="application/x-tex">v</annotation></semantics></math>$ in G_n, when interpreted in $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>B</mi></mrow><annotation encoding="application/x-tex">B</annotation></semantics></math>$ , compose to the same arrow of $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>B</mi></mrow><annotation encoding="application/x-tex">B</annotation></semantics></math>$ ) I suppose that I should use the fact that our functor $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>W</mi><mo>→</mo><mi>B</mi></mrow><annotation encoding="application/x-tex">W\to B</annotation></semantics></math>$ is monoidal (not just a functor) and induction.
- CommentRowNumber6.
- CommentAuthorTodd_Trimble
- CommentTimeJul 13th 2011
- (edited Jul 13th 2011)
- PermaLink
Author: Todd_Trimble
Format: MarkdownItexSorry -- don't take my last comment as implying you didn't realize this; it was just a way of drawing you out further, nothing more. I guess we're back to the alternative viewpoint I was offering in my first comment. The $F[1]$ was described in a manifestly syntactic way, and the coherence theorem identifies this monoidal category as monoidally isomorphic to Mac Lane's $W$. A (strict) monoidal functor from $F[1]$ to $B$ obviously interprets paths in the syntactic graph correctly, but since the coherence theorem then identifies this with a strict monoidal functor from $W$ to $B$, all should be well. (hm, strange problems with rendering latex here; can't get arrows to appear)

Sorry – don’t take my last comment as implying you didn’t realize this; it was just a way of drawing you out further, nothing more.

I guess we’re back to the alternative viewpoint I was offering in my first comment. The $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>F</mi><mo stretchy="false">[</mo><mn>1</mn><mo stretchy="false">]</mo></mrow><annotation encoding="application/x-tex">F[1]</annotation></semantics></math>$ was described in a manifestly syntactic way, and the coherence theorem identifies this monoidal category as monoidally isomorphic to Mac Lane’s $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>W</mi></mrow><annotation encoding="application/x-tex">W</annotation></semantics></math>$ . A (strict) monoidal functor from $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>F</mi><mo stretchy="false">[</mo><mn>1</mn><mo stretchy="false">]</mo></mrow><annotation encoding="application/x-tex">F[1]</annotation></semantics></math>$ to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>B</mi></mrow><annotation encoding="application/x-tex">B</annotation></semantics></math>$ obviously interprets paths in the syntactic graph correctly, but since the coherence theorem then identifies this with a strict monoidal functor from $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>W</mi></mrow><annotation encoding="application/x-tex">W</annotation></semantics></math>$ to $<math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mi>B</mi></mrow><annotation encoding="application/x-tex">B</annotation></semantics></math>$ , all should be well. (hm, strange problems with rendering latex here; can’t get arrows to appear)
- CommentRowNumber7.
- CommentAuthorYaron
- CommentTimeJul 13th 2011
- PermaLink
Author: Yaron
Format: MarkdownItexThanks, Todd! This is exactly the explanation I needed.

Thanks, Todd! This is exactly the explanation I needed.

1 to 7 of 7

nForum

Discussion Feed

Not signed in

Site Tag Cloud

Atrium > Mathematics, Physics & Philosophy: beginner's questions on coherence