Tuesday, September 13, 2022

Yet another post on simplicity

I see that I finally arrive to an end of my journey in formalizing in Lean the simplicity of the alternating group in 5 letters or more, so it may be a good time to summarize what I did, from the mathematical side. 

On a first blog post, “Not simple proofs of simplicity”, I had described my initial plan, but it was not clear at that time that I would either arrive at a final proof, nor that I would be able to formalize it in Lean. In fact, a few weeks after I had started this experiment, I doubted I would make it and went on formalizing the traditional proof that the alternating group is simple. I added a few simplifications—which I was later told were already explained in Jacobson's Basic Algebra, say that's life…– leading to “The very simple proof that the alternating groups of five letters (or more) is simple”. I managed to formalize that proof at the end of 2021, and spent a lot of energy of the 8 next months to formalize the proof that I initially had in mind.

As I had already explained, the goal/constraint is to apply the Iwasawa criterion to the alternating group. This criterion says that if a group GG acts primitively on a set XX, and if we attach to each point xXx\in X a commutative subgroup TxTx of GG, in such a way that T(gx)=gTxg1T(g\cdot x)=g\cdot Tx\cdot g^{-1} for every gGg\in G and every xXx\in X, and if the subgroups TxTx generate GG, then every normal subgroup of GG that acts nontrivially on XX contains the commutator subgroup. We take G=AnG=\mathfrak A_n. For n5n\geq 5, its commutator subgroup is An\mathfrak A_n itself (for example because any two 3-cycles are conjugated; in particular, a 3-cycle is conjugate to its square, which implies that it maps to 11 in the abelianization of An\mathfrak A_n). So we need to get primitive actions of An\mathfrak A_n and commutative subgroups. 

One of the equivalent criteria for primitivity of a transitive actions is that the stabilizers of points are maximal subgroups. As I had explained at the end of the first post, the maximal subgroups of Sn\mathfrak S_n and An\mathfrak A_n are known by the O'Nan–Scott theorem, combined with its converse which is a theorem of Liebeck, Praeger and Saxl. These theorems give a precise list of the maximal subgroups of Sn\mathfrak S_n and An\mathfrak A_n, of which the first entry is precisely Sp×Snp\mathfrak S_p\times \mathfrak S_{n-p} (where the first factor acts on {1;;p}\{1;\dots;p\} and the second acts on {p+1;;n}\{p+1;\dots;n\}) and its intersection with An\mathfrak A_n, if 0<p<n0<p<n and n2pn\neq 2p.

We need to understand the limitation n2pn\neq 2p, the point being that if n=2pn=2p, the subgroup Sp×Sp\mathfrak S_p\times\mathfrak S_p is not maximal in S2p\mathfrak S_{2p}, it is a subgroup of index 2 of a “wreath product” obtained by adding one permutation that exchanges the two blocks {1,,p}\{1,\dots,p\} and {p+1,,2p}\{p+1,\dots,2p\}, for example (1p+1)(2p+2)(p2p)(1\,p+1)(2\,p+2)\dots (p\,2p). This group is the second entry in the O'Nan–Scott theorem.

These two entries are labelled as intransitive and imprimitive respectively, because Sp×Snp\mathfrak S_p\times \mathfrak S_{n-p} has two orbits on {1;;n}\{1;\dots;n\}, while the wreath product is transitive but it preserves the partition consisting of the two blocks {1,,p}\{1,\dots,p\} and {p+1,,2p}\{p+1,\dots,2p\}.

These two entries seem to be obvious to the group theorists. It is given without proof in the paper of Liebeck, Praeger and Saxl.

The case of Sn\mathfrak S_n is easy, and occupies a subsection of Wilson's book on Finite Simple Groups. It is even funny to prove by hand, and not so hard to formalize in Lean. Take a subgroup KK of Sn\mathfrak S_n such that Sp×SnpK\mathfrak S_p\times \mathfrak S_{n-p} \subsetneq K and let us prove that K=SnK=\mathfrak S_n.  To that end, it suffices to show that KK contains any transposition (ab)(a\,b). This is obvious if both aa and bb belong to {1;;p}\{1;\dots;p\} or if they both belong to {p+1;dots;n}\{p+1;dots;n\}, so assume that a{1;;p}a\in\{1;\dots;p\} and b{p+1;;n}b\in\{p+1;\dots;n\}. Since KK does not stabilize {1;;p}\{1;\dots;p\}, there is x{1;;p}x\in\{1;\dots;p\} and kKk\in K such that y=kx{p+1;;n}y=k\cdot x \in\{p+1;\dots;n\}. If n>2pn>2p, there exists z{p+1;;n}z\in\{p+1;\dots;n\} such that zyz\neq y and t=k1z{p+1;;n}t=k^{-1}\cdot z\in\{p+1;\dots;n\}; from the relation k1(yz)k=(xt)k^{-1} \cdot (y\,z) \cdot k=(x\,t) and the fact that (yz)Sp×Snp(y\,z)\in \mathfrak S_p\times\mathfrak S_{n-p}, we deduce that (xt)(x\,t) belongs to KK. This gives us one transposition of the desired form; finally, the relation (ab)=h(xt)h1(a\,b)=h (x\,t) h^{-1} with h=(xa)(tb)Sp×Snph=(x\,a)(t\,b)\in\mathfrak S_p\times\mathfrak S_{n-p} shows that (ab)K(a\,b)\in K. The other case, n<2pn<2p is symmetric.

Bizarrely, the analogous result for the alternating group looked more difficult to me, although some colleague assured me that it could be done, an other one that I could certainly do it, and a last one did it for n>7n>7. Since Liebeck, Praeger and Saxl gave no reference at all, I asked Liebeck about and he explained me a short proof that uses totally different ideas.

Let G=AnG=\mathfrak A_n or Sn\mathfrak S_n and consider a subgroup KK such that (Sp×Snp)GKG(\mathfrak S_p\times\mathfrak S_{n-p})\cap G \subsetneq K\subseteq G; we wish to prove that K=GK=G. Arguments as given above already show that KK acts transitively on {1;;n}\{1;\dots;n\}. But we can do more: it acts primitively. Now, one just needs to invoke a 1870 theorem of Jordan: a primitive subgroup of Sn\mathfrak S_n that contains a transposition is Sn\mathfrak S_n, and a primitive subgroup of Sn\mathfrak S_n that contains a 3-cycle contains An\mathfrak A_n!

To prove that KK acts primitively, it is convenient to use the standard definition of a primitive action. If a group GG acts on a set XX, call block of the action a nonempty subset BB of XX which, for every gGg\in G, is either fixed or moved to a disjoint subset by GG; it follows from the definition that the translates of a block by the action form a partition of XX. Singletons are blocks, the full subset is a block, and one definition of a primitive action is that the only blocks are these trivial ones (and XX is nonempty). Orbits are blocks, so that a primitive action is transitive. Conversely, one can prove that if the action is transitive, then it is primitive if and only if stabilizers of points in XX are maximal subgroups. A more general result is that for every point aXa\in X, associating with a set BB its stabilizer GBG_B gives a bijection from the set of blocks that contain aa to the set of subgroups of GG that contain GaG_a, with inverse bijection associating with a subgroup KK containing GaG_a the orbit KaK\cdot a, and these bijections preserve inclusion. 

Proof. — Let B,BB,B' be blocks such that BBB\subseteq B' and let gGBg\in G_B; then gBg\cdot B' contains gB=Bg\cdot B=B, hence gBg\cdot B' is not disjoint from BB', so that gB=Bg\cdot B'=B' by definition of a block. This proves that GBG_B is a subgroup of GB G_{B'}.

Let BB be a block that contains aa; then GBa=BG_B \cdot a=B. Indeed, the inclusion GBaBG_B\cdot a\subseteq B follows from the definition of GBG_B. To prove the other inclusion, let bBb\in B. Since the action is transitive, there exists gGg\in G such that ga=bg\cdot a=b; then gBg\cdot B and BB both contain bb, hence gB=Bg\cdot B=B, so that gGBg\in G_B and bGBab\in G_B\cdot a.

Finally, let KK a a subgroup of GG containing GaG_a and let B=KaB=K\cdot a. Let us prove that BB is a block such that K=GBK=G_B. Let gGg\in G such that gBg\cdot B and BB are not disjoint; let b,cBb,c\in B be such that b=gcb=g\cdot c; write b=kab=k\cdot a and c=hac=h\cdot a for k,hKk,h\in K. Then ka=ghak\cdot a = gh\cdot a so that k1ghGak^{-1}gh\in G_a, hence k1ghKk^{-1}gh\in K; we conclude that gKg\in K, hence gB=gKa=Ka=B.g\cdot B=gK\cdot a = K\cdot a=B. So BB is a block. This also shows that GBKG_B\subseteq K, and the converse inclusion is obvious.

Going back to our initial problem, it remains to show that the action of KK on {1;;n}\{1;\dots;n\} only has trivial blocks. The proof uses two remarks.

  1. The trace of a block on {1;;p}\{1;\dots;p\}, respectively {p+1;;n}\{p+1;\dots;n\}, is either a singleton, or all of it. Indeed, this trace is a block for the induced action of (Sp×Snp)G(\mathfrak S_p\times\mathfrak S_{n-p})\cap G on {1;;p}\{1;\dots;p\} (respectively {p+1;;n}\{p+1;\dots;n\}), and this action contains that of Ap\mathfrak A_p (respectively…) and even that of Sp\mathfrak S_p if pn1p\neq n-1. On the other hand, the symmetric group acts 2-transitively, hence primitively.  (The cases p=1p=1 or p=n1p=n-1 need minor adjustements.)
  2. If 2p<n2p<n, then no nontrivial block can contain {p+1;;n}\{p+1;\dots;n\}. Indeed, there is not enough space in the complementary subset so that disjoint translates of this block make a partition of {1;;n}\{1;\dots;n\}.

Let us now conclude the proof. (I still find the following argument a bit convoluted but have nothing really better to propose yet.) Consider a block B{1;;n}B\subset\{1;\dots;n\} for the action of KK, and assume that BB is not a singleton, nor the full set. If BB meets {p+1;;n}\{p+1;\dots;n\} in at least two elements, then it contains {p+1;;n}\{p+1;\dots;n\}, hence is the full block, a contradiction. If BB meets {1;;p}\{1;\dots;p\} in at least two elements, then it contains {1;;p}\{1;\dots;p\}, and some disjoint translate of it  is contained in {p+1;;n}\{p+1;\dots;n\}; this translate is a block that contains {p+1;;n}\{p+1;\dots;n\}, hence is the full set, so that the initial block is the full set as well.  By similar arguments, BB meets both {1;;p}\{1;\dots;p\} and {p+1;;n}\{p+1;\dots;n\} in exactly one element, and the same hold for any translate kBk\cdot B of BB. However, using the hypothesis that pnpp\neq n-p and that KK strictly contains (Sp×Snp)G(\mathfrak S_p\times\mathfrak S_{n-p})\cap G, we find kKk\in K such that kBk\cdot B meets {p+1;;n}\{p+1;\dots;n\} in at least two elements, and we can conclude as earlier that BB is the full set.

To terminate this blog spot, I need to say something about Jordan's theorem. Jordan was concerned about the concept multiple transitivity: a group GG acting on a set XX is mm-transitive if whenever systems of distinct elements a1,,ama_1,\dots,a_m on the one side, b1,,bmb_1,\dots,b_m on the other side, are given, there exists gGg\in G such that ga1=b1,gam=bmg\cdot a_1=b_1,\dots g \cdot a_m=b_m (one assumes here that mCard(X)m\leq {\mathrm{Card}(X)}). Many theorems from this time (Matthieu, Bertrand, Serret, Jordan…), partly in relation with Galois theory of equations, aim at limiting the multiple transitivity of subgroups of the symmetric group. The symmetric group itself is nn-transitive, if n=Card(X)n={\mathrm {Card}(X)}, the alternating group is (n2)(n-2)-transitive, and other subgroups have to be much less transitive.

The general result of Jordan, proved in the Note C (page 664) to §398 of his Traité des substitutions et des équations algébriques (1870, Gauthier-Villars)  is that a primitive subgroup of Sn\mathfrak S_n containing a cycle of prime order pp is np+1n-p+1-transitive. For p=2p=2, we get that this subgroup is (n1)(n-1)-transitive, hence is Sn\mathfrak S_n; for p=3p=3, we get that it is (n2)(n-2)-transitive, and that implies that it contains the alternating group An\mathfrak A_n. I formalized these results in Lean, following the presentation of Wielandt's book on Finite permutation groups (theorem 13.3 of that reference). A later theorem of Jordan (1873; see theorem 13.9 in Wielandt's book) asserts that such a subgroup always contains the alternating group provided np3n-p\geq 3; I have not (not yet?) formalized it in Lean.

All in all, this gives a fairly sophisticated proof that the alternating group is simple. One of its merit is to follow a general line, that applies to many other groups. In particular, Iwasawa's criterion is also used by Wilson in his book Finite simple groups to prove that the simplicity of the Mathieu groups M11,M12M_{11}, M_{12}, and of many other finite groups.

I just opened Jordan's book to write this blog post. Let me add that it contains (§85) another proof of simplicity of the alternating group, and I will try to explain it in a later post.