Tuesday, December 21, 2021

The very simple proof that the alternating group on 5 letters (or more) is simple

\def\supp {\operatorname{supp}}

As explained in the previous post, I wanted to formalize the proof that the alternating group on 5 letters or more is simple, using the Iwasawa criterion. This formalization is in the middle of nowhere, because it requires a proof that the natural action of the alternating group AnA_n on kk-elements subsets of {1,,n}\{1,\dots,n\} is primitive, provided n2kn\neq 2k, k0,nk\neq 0,n and n5n\geq 5.

A simple proof

There is a simple, slightly computational proof that the alternating group of 5 letters (or more) is simple, it is explained in many textbooks, for example in James Milne's Group theory notes. However, its formalization is not so easy, because even Milne has a lot of implicit stuff.

The situation, as in Lemma 4.36 of Milne's booklet, is the following.

One is given a nontrivial normal subgroup NN of AnA_n and one wishes to prove that N=AnN=A_n. We know that the 3-cycles generate AnA_n. We also know that 3-cycles are pairwise conjugate in the symmetric group, and also, using n5n\geq 5, in the alternating group AnA_n. Consequently, it suffices to prove that NN contains a 3-cycle. To that aim, one argues by induction on the cardinality of the support $\supp(g)$ of a nontrivial element gg of NN

Here is an attempt at presenting the induction argument in a way which is as straightforward as possible.

Since g1g\neq 1, $\supp(g)$ has cardinality 2\geq 2.

If $\supp(g)$ has cardinality 22, then gg is a transposition, contradicting the hypothesis that gg belongs to the alternating group. So the support of gg has cardinality 3\geq 3.

If $\supp(g)$ has cardinality 33, then gg is a 3-cycle, and we are done. So we may assume that the support of gg has cardinality 4\geq 4.

Let us choose an element a{1,,n}a\in\{1,\dots,n\} which belongs to the largest cycle of gg and set b=g(a)b=g(a); by assumption, one has bab\neq a. The proof consists in considering an element c{1,,n}c\in\{1,\dots,n\} such that cac\neq a and cbc\neq b, the 3-cycle h=(abc)h=(a\,b\,c) and the conjugate g=hgh1g'=h g h^{-1}

Since hh is a 3-cycle, it belongs to the alternating group; since NN is normal, one has gNg'\in N.

We wish to apply the induction hypothesis to the element gg1g' g^{-1} of NN. So we need to prove

  1. ggg'\neq g, and
  2. The support of gg1g' g^{-1} has cardinality strictly smaller than the one of gg.

To guarantee (1), that ggg'\neq g, it suffices to choose cc such that g(b)g(b)g'(b)\neq g(b). But g(b)=hgh1(b)=hg(a)=h(b)=x, g'(b) = hgh^{-1}(b) = hg(a) = h(b) = x, so the new assumption we make is that cg(b)c\neq g(b).

The rest of the argument is devoted to finding appropriate conditions on cc that guarantee (2). First, observe the inclusion $\supp(g'g^{-1})\subset g(\supp(h))\cup \supp(h)$, which is proved by contraposition. Indeed, if xx does not belong to the right hand side, then $g^{-1}(x) \notin \supp(h)$, hence h1g1(x)=g1(x)h^{-1}g^{-1}(x)=g^{-1}(x) (for example, using that $\supp(h)=\supp(h^{-1})$), and then gg1(x)=hgh1(g1(x))=hg(g1(x))=h(x)=xg' g^{-1}(x)=hgh^{-1}(g^{-1}(x))=hg(g^{-1}(x))=h(x)=x, since x∉h(x)x\not\in h(x). This proves that the cardinality of the support of gg1g'g^{-1} is at most 6.

However, since g(a)=bg(a)=b belongs both to $g(\supp(h))$ and to $\supp(h)$, the cardinality is at most 5. Explicitly, $\supp(g'g^{-1})\subset \{a,b,c,g(b), g(c)\}$. In particular, a clever choice for cc is only needed when $\supp(g)$ has cardinality 4 or 5!

To conclude in the remaining cases, we remark that there are only two possibilities for the cycle-type of gg: it can only be (5)(5) or (2,2)(2,2), since it is an alternating permutation, and we split the discussion according to these two cases:

  • If the cycle-type of gg is (5)(5), then we choose for cc the “last” element of the cycle of aa, namely c=g1(a)c=g^{-1}(a). Then, g(c)=ag(c)=a, so that $\supp(g'g^{-1})\subset\{a,b,c,g(b)\}$ which has at most 4 elements.
  • If the cycle-type of gg is (2,2)(2,2), then we have g(b)=ag(b)=a and we choose for cc any fixed point of gg. Then $\supp(g'g^{-1})\subset\{a,b,c\}$ has at most 3 elements.

About the formalization

One annoying part for formalizing this argument is the elimination of cycle-types. One would like that a computer assistant is able to list all possible cycle-types of a given size. Presumably it can, by I cannot (yet), so I need to do the argument by hand, for that specific value.

In principle, that argument needs to be spelt out in class too. We use two formulas:

  1. The sum of the length of the cycles is the cardinality of the support, hence 44 or 55 in this case.
  2. The signature of a permutation is even if and only if the number of cycles and the cardinality of the support have the same parity.

One way to write it down consists in taking the length mm of the smallest cycle of gg. One has m2m\geq 2 by assumption.

  1. If there are no other cycles, then the cycle-type of gg is (m)(m). Then m=4m=4 or 55, and only (5)(5) respects the parity constraint.
  2. Otherwise, there is only one other cycles, otherwise the sum of their lengths would be at least 3263\cdot 2\geq 6. If mm' is the length of that other cycle, one has 2mm2\leq m\leq m'. Then 2mm+m52m\leq m+m'\leq 5, hence m2m\leq 2, so that m=2m=2. This gives m3m'\leq 3, giving two cycle-types (2,3)(2,3) and (2,2)(2,2), of which the second one only satisfies the parity constraint.

Monday, December 13, 2021

Not simple proofs of simplicity

The last few weeks, I started my self-education in proof formalization (in Lean) by a test case, the simplicity of the alternating group. In this blog post, I want to discuss some aspects of the underlying mathematics which I think I learnt during the formalization process.

Simple groups. Some examples

This is about groups, eventually finite groups. Let us recall that a subgroup NN of a group GG is normal if for every gGg \in G and nNn\in N, one has gng1Ngng^{-1}\in N. Concretely, this means that the two equivalence relations modulo NN (left or right) coincide, and that the quotient set G/NG/N inherits a group structure that makes the natural projection GG/NG \to G/N a morphism of groups.

In this setting, the group GG can be viewed as an extension of the two groups G/NG/N and NN, which are possibly simpler, and this extension may help at understanding properties of the group GG. At the opposite, one says that GG is simple if it has no normal subgroup besides {e}\{e\} and GG itself (and if G{e}G\neq\{e\}).

Trivial examples of normal subgroups of a group GG are {e}\{e\} and GG itself. Less trivial examples are the center Z(G)Z(G) of GG (those elements gg such that gh=hggh=hg for all hGh\in G), and the derived subgroup D(G)D(G), the subgroup generated by all commutators ghg1h1ghg^{-1}h^{-1}. This commutator subgroup is interesting: any subgroup NN of GG containing the commutator subgroup D(G)D(G) is normal, and the quotient is abelian; and conversely.

The kernel of a group morphism is a normal subgroup, and the construction of a quotient shows that all normal subgroups appear in this way. In particular, the alternating group AnA_n is a normal subgroup of the symmetric group SnS_n.

A simple group has either Z(G)={e}Z(G)=\{e\} or Z(G)=GZ(G)=G; in the latter case, this means that GG is commutative, and it quickly appears that GG has to be finite of prime order. Consequently, our discussion will now concern groups with trivial center.

The concept of simplicity of groups is classically presented in connection with Galois theory, and the question of solvability of equations by radicals. Namely, a “general” polynomial equation of degre nn has SnS_n, the full symmetric group on nn elements, for its Galois group, and, if n5n\geq 5, the only possible dévissage of this group consists in introducing the alternating group AnA_n of even permutations, the subgroup AnA_n being normal and simple. On the other hand, the solvability of polynomial equation by radicals is equivalent to such a dévissage where all successive quotients are cyclic groups (equivalently abelian groups). Since AnA_n is not abelian, this implies that a “general” polynomial equation of degree nn is not solvable by radicals. However, using simplicity of the alternating group is much stronger than what we need: what would be needed is solvability of the symmetric group, and that this does not hold if n5n\geq 5 is much simpler. Consequently, for the problem of resolution by radicals, it suffices to prove that the derived subgroup D(An)D(A_n) of the alternating group is equal to AnA_n.

Theorem. — For n5n\geq 5, one has D(An)=AnD(A_n)=A_n. In particular, the alternating group and the symmetric group on nn letters are not solvable.
I give two (equivalent) proofs, the second one being a computational interpretation of the first one. Both use that the 3-cycles generate AnA_n and are conjugate in AnA_n. The computational proof is shorter, arguingly simpler. As a matter of fact, I never understood it, nor could remember it, until I translated the conceptual proof into the proof assistant.
Consider the morphism p ⁣:AnAn/D(An)p\colon A_n\to A_n/D(A_n). Since An/D(An)A_n/D(A_n) is commutative, all 3-cycles have the same image. Since the square of a 3-cycle is again a 3-cycle, both have the same image. This implies that for every 3-cycle gAng\in A_n, one has p(g)=p(g2)p(g)=p(g^2), hence p(g)=ep(g)=e. Since the 3_cycles generate AnA_n, the morphism pp is trivial; since it is surjective, one has An/D(An)={e}A_n/D(A_n)=\{e\} and D(An)=AnD(A_n)=A_n.
Computationally, consider a 3-cycle gg and its square g2g^2. Since they are conjugate, there exists hAnh\in A_n such that g2=hgh1g^2=hgh^{-1}. Then g=hgh1g1g=hgh^{-1}g^{-1}, so that gg is a commutator; in particular, D(An)D(A_n) contains all commutators and D(An)=AnD(A_n)=A_n.

The remaining cases, for $n\leq 4, are interesting, but classically left as exercises in text books:

  1. One has A1=S1={e}A_1=S_1=\{e\};
  2. The group S2S_2 is the cyclic group of order 2, hence is simple and solvable, and A2A_2 is trivial;
  3. The group S3S_3 is a noncommutative group of order 6, and A3A_3 is a cyclic group of order 2.
  4. The groups S4S_4 and A4A_4 are noncommutative and solvable, of orders 24 and 12. The derived subgroups D(S4)D(S_4) and D(A4)D(A_4) are both equal to the Klein subgroup V4V_4 of S4S_4, consisting of the permutations of the form (ab)(cd)(ab)(cd) for a,b,c,da,b,c,d any enumeration of 1,2,3,41,2,3,4 — “double transpositions” – and of the identity. The group V4V_4 is commutative, isomorphic to (Z/2Z)2(\mathbf Z/2\mathbf Z)^2, and the quotient D(A4)/V4D(A_4)/V_4 is cyclic of order 33.

Another classical series of simple groups appears in linear algebra. Let FF be a field and let nn be an integer such that n2n\geq 2. The group GL(n,F)\mathrm{GL}(n,F) of n×nn\times n invertible matrices is not simple, for it is noncommutative but its center consists of homotheties; we can try to mod out by the center, getting the group PGL(n,F)=GL(n,F)/F×\mathrm{PGL}(n,F)=\mathrm{GL}(n,F)/F^\times but that one may not be simple. Indeed, another reason for GL(n,F)\mathrm{GL}(n,F) not to be simple is that it admits the special linear group SL(n,F)\mathrm{SL}(n,F), kernel of determinant, as a normal subgroup. The group SL(n,F)\mathrm{SL}(n,F) has a nontrivial center in general, it consists of homotheties of ratio aF×a\in F^\times such that an=1a^n=1 — let us denote it by μn\mu_n. But the quotient PSL(n,F)=SL(n,F)/μn\mathrm{PSL}(n,F)=\mathrm{SL}(n,F)/\mu_n is simple in general — in general meaning that is is always the case but for two exceptions:

  1. n=2n=2 and F=F2F=\mathbf F_2. Then PSL(2,F2)S3\mathrm{PSL}(2,\mathbf F_2)\simeq S_3 (by the action on P1(F2)\mathbf P_1(\mathbf F_2), see below), hence is not simple.
  2. n=2n=2 and F=F3F=\mathbf F_3. Then PSL(2,F3)S4\mathrm{PSL}(2,\mathbf F_3)\simeq S_4 (again by the action on P1(F3)\mathbf P_1(\mathbf F_3)), and is not simple.

Bilinear algebra gives rise to new groups, orthogonal, unitary and symplectic, which also furnish simple groups up to elementary transformations. By the famous “classification of finite simple groups”, these constructions furnish all finite simple groups, up to 26 (or 27) examples called sporadic groups. This remarkable theorem has a fantastic proof, encompassing thousands of pages across the years 1960-2010.

But the question here is: How can one prove that a given group is simple?

Alternating groups

There is a large supply of proofs that the alternating group AnA_n is simple for n5n\geq 5. Here is a sketch of one.

Let NN be a normal subgoup of AnA_n (n5n\geq 5) and assume that N{e}N\neq\{e\}. An element of AnA_n can be written as a product of an even number of transpositions. If two successive permutations in the product are equal, we can cancel them; if the share exactly one a common member, as in (ab)(ac)(a\,b)(a\,c), their product is a 3-cycle (acb)(a\,c\,b); if they have no common member, their product is a double transposition. On the other hand, if n5n\geq 5, we can either insert (bc)(bc)(b\,c)(b\,c) in the product (ab)(cd)(a\,b)(c\,d), writing a double transposition as a product of two 3-cycles, or insert (de)(de)(d\,e)(d\,e) in the product (ab)(ac)(a\,b)(a\,c), writing a 3-cycle as a product of two double transpositions. Consequently, AnA_n is generated by, either the 3-cycles, or the double transpositions. Moreover, since n5n\geq 5, we can check that 3-cycles are pairwise conjugate, and similarly for double transpositions; consequently, if the normal subgroup NN of AnA_n contains either a 3-cycle, or a double transposition, it will contain all of them, hence be equal to AnA_n.

When n=5n=5, the only case that remains to consider is when NN contains a 5-cycle, say g=(12345)g=(1\,2\,3\,4\,5). Conjugating gg by the 3-cycle h=(451)h=(4\,5\,1), we get hgh1=(h1h2h3h4h5)=(42351)Nhgh^{-1}=(h1\,h2\,h3\,h4\,h5)=(4\,2\,3\,5\,1)\in N. By construction, this element behaves as gg on 55, but differs. Consequently, the commutator hgh1g1hgh^{-1}g^{-1} is a nontrivial element of NN that fixes 55. By the first two cases, one has N=A5N=A_5.

A similar argument works in general, arguing by descending induction on the cardinality on the fixed point set of an element geg\neq e of NN. One considers an element hh of AnA_n and the conjugate hgh1hgh^{-1}; if g=(a1a2)(b1b2)g=(a_1\,a_2\,\dots)(b_1\,b_2\,\dots)\dots, then hgh1=(ha1ha2)(hb1hb2)hgh^{-1}=(ha_1\,ha_2\,\dots)(hb_1\,hb_2\,\dots)\dots is an element of NN that behaves as gg on many elements, but not all. Consquently, hgh1g1hgh^{-1}g^{-1} is a non trivial element of NN that fixes more elements than gg, and we conclude by induction. (Details can be found in James Milne's lectures, 4.34.)

The Iwasawa criterion

In 1941, Kenkiti Iwasawa published a proof of the simplicity of the projective special linear group. From this proof, a general criterion for simplicity has been abstracted:

Theorem (Iwasawa). — Let GG be a group with a primitive action on a set XX. Assume that one is given, for every xXx\in X, a subgroup T(x)T(x) of GG satisfying the following properties:

  • For every xXx\in X, the subgroup T(x)T(x) is commutative;
  • For every gGg\in G and xXx\in X, T(gx)=gT(x)g1T(g\cdot x)=g T(x)g^{-1};
  • The union of the groups T(x)T(x), for xXx\in X, generate GG.
Then any normal subgroup NN of GG that acts nontrivially on XX contains the commutator subgroup of GG. In particular, if G=D(G)G=D(G), then GG is simple.

There are two classical ways to state that the action is primitive. The simplest states that it is transitive and that the stabilizers of points are maximal subgroups of GG. Another is that there is no imprimitivity block, a nontrivial partition (Xi)(X_i) of XX such that for every gGg\in G and every ii, there exist jj such that gXi=g\cdot X_i=X_j$. One can prove that a 2-transitive action (= transitive on ordered pairs of distinct elements) is primitive.

Iwasawa applies his criterion to G=SL(n,F)G=\mathrm{SL}(n,F) acting on the projective space Pn1(F)\mathbf P_{n-1}(F) of lines in FnF^n. Except when n=2n=2 and FF has 2 or 3 elements, this action is 2-transitive, hence primitive. For a nonzero xFnx\in F^n, one considers the group T(x)T(x) of linear transformations of the form yy+ϕ(y)xy\mapsto y + \phi(y) x (transvections), for all linear forms ϕ\phi on FnF^n such that ϕ(x)=0\phi(x)=0. They constitute a commutative subgroup of SL(n,F)\mathrm{SL}(n,F) (isomorphic, as a group, to Fn1F^{n-1}). The map TT gives rise to the data as in Iwasawa's theorem. Consequently, every normal subgroup NN of SL(n,F)\mathrm{SL}(n,F) that acts nontrivially on Pn1(F)\mathbf P_{n-1}(F) contains the commutator subgroup of SL(n,F)\mathrm{SL}(n,F), which is SL(n,F)\mathrm{SL}(n,F). Explicitly, either NN consists of homotheties, or NN contains SL(n,F)\mathrm{SL}(n,F). This implies that PSL(n,F)\mathrm{PSL}(n,F) is simple.

Applying the Iwasawa criterion for symmetric groups

One may wish to apply the Iwasawa criterion to the symmetric group. However, the conclusion is not as nice as what I had hoped initially.

Let SnS_n act on the set of 2-element subsets of X={1,,n}X=\{1,\dots,n\}. If n5n\geq 5, the action is primitive. This is not completely obvious, because it this action is not 2-transitive (you cannot map {1,2}\{1,2\} and {1,3}\{1,3\} to {1,2}\{1,2\} and {3,4}\{3,4\})! Its primitivity means that stabilizers are maximal subgroups, and one can prove that the stabilizer of a 2-element subset is indeed maximal unless n=4n=4. It is also faithful (no nontrivial acts trivially). To a pair {a,b}\{a,b\}, associate the subgroup of order 2 generated by the transposition (ab)(a\,b). It satisfies the criterion, and this shows that the nontrivial normal subgroups of SnS_n contain D(Sn)=AnD(S_n)=A_n. Since AnA_n has index 2, this shows N=AnN=A_n or N=SnN=S_n.

It is interesting to guess how the criterion breaks for n=4n=4. In fact, the action of S4S_4 on pairs is transitive, but not primitive: the stabilizer of {1,2}\{1,2\} is a subgroup of S4S_4 of order 44, consisting of the identiy, two transpositions (12)(1\,2) and (34)(3\,4), and their product. Since Card(S4)=24!=233\mathrm{Card}(S_4)=24!=2^3\cdot 3, this subgroup is contained in a 2-sylow subgroup, of order 8, so is not maximal.

However, and I find it unfortunate, this proof does not imply that the alternating group AnA_n is simple for n5n\geq 5. To prove the simplicity of AnA_n along these lines, we need to consider the following actions.

  • Let AnA_n act on the set X3X_3 of 3-element subsets of XX. For x={a,b,c}X3x=\{a,b,c\}\in X_3, consider the subgroup T(x)T(x) of order 3 of AnA_n consisting of the identity and the 3-cycles (abc)(a\,b\,c) and (acb)(a\,c\,b). (It is the alternating group on xx, viewed as a subgroup of AnA_n.) The Iwasawa criterion applies provided the action is primitive, which presumably holds for n>6n>6. Consequently, AnA_n is simple for n7n\geq 7. However, if n=6n=6, the stabilizer of {1,23}\{1,2\,3\} in A6A_6 is not maximal, for a reason analogous to the one explained for S4S_4.
  • Let AnA_n act on the set X4X_4 of 4-element subsets of XX. For xX4x\in X_4, we can consider the Klein subgroup T(x)T(x) of A4A_4, acting on xx, viewed as a subgroup of AnA_n. I hope that the action is primitive, except for n=8n=8, and this would prove that AnA_n is simple for n8n\geq 8. This uses that double transpositions generate AnA_n.
  • One can improve the two preceding arguments a little bit. If n=5n=5, we can have AnA_n act on X2X_2, associating with xx the alternating group on the three remaining letters. Therefore, A5A_5 is simple (the action is primitive because the stabilizer of a point is the Klein subgroup of A4A_4, as a subgroup of A5A_5, its cardinality is 12, that of A5A_5 is 60, and since 60/12=5 is prime, the Klein subgroup of A4A_4 is maximal in A5A_5). Similarly, if n=6n=6, we have A6A_6 act on X2X_2, associating with xX2x\in X_2 the Klein subgroup corresponding to the four remaining letters. Provided one can prove that the stabilizer of a pair in A6A_6 is a maximal subgroup, this gives a proof that A6A_6 is simple!

The primitivity assertions seem to hold. In fact, maximal subgroups of AnA_n and SnS_n are classified by the O'Nan–Scott theorem. Those we're concerned with have type (a) in the notation of (Liebeck, Praeger, Saxl, 1987. “A Classification of the Maximal Subgroups of the Finite Alternating and Symmetric Groups.” Journal of Algebra 111 (2): 365–83. DOI:10.1016/0021-8693(87)90223-7), and according to Theorem 1 of that paper, the relevant subgroups are indeed maximal.

Formalization: where am I now?

I have already formalized the Iwasawa criterion in Lean (there's an ongoing pull request for this), as well as the result that the normal subgroups of SnS_n are {e}\{e\}, AnA_n and SnS_n for n5n\geq 5.

It remains to formalize the rest of the proof, and the main difficulty will be in proving primitivity, plus some nice way of defining the maps TT so that their properties are visible.

I also wish to formalize the simplicity of the special linear group along these lines, and it should be ready for an application to orthogonal, unitary or symplectic groups as well.