Monday, December 13, 2021

Not simple proofs of simplicity

The last few weeks, I started my self-education in proof formalization (in Lean) by a test case, the simplicity of the alternating group. In this blog post, I want to discuss some aspects of the underlying mathematics which I think I learnt during the formalization process.

Simple groups. Some examples

This is about groups, eventually finite groups. Let us recall that a subgroup NN of a group GG is normal if for every gGg \in G and nNn\in N, one has gng1Ngng^{-1}\in N. Concretely, this means that the two equivalence relations modulo NN (left or right) coincide, and that the quotient set G/NG/N inherits a group structure that makes the natural projection GG/NG \to G/N a morphism of groups.

In this setting, the group GG can be viewed as an extension of the two groups G/NG/N and NN, which are possibly simpler, and this extension may help at understanding properties of the group GG. At the opposite, one says that GG is simple if it has no normal subgroup besides {e}\{e\} and GG itself (and if G{e}G\neq\{e\}).

Trivial examples of normal subgroups of a group GG are {e}\{e\} and GG itself. Less trivial examples are the center Z(G)Z(G) of GG (those elements gg such that gh=hggh=hg for all hGh\in G), and the derived subgroup D(G)D(G), the subgroup generated by all commutators ghg1h1ghg^{-1}h^{-1}. This commutator subgroup is interesting: any subgroup NN of GG containing the commutator subgroup D(G)D(G) is normal, and the quotient is abelian; and conversely.

The kernel of a group morphism is a normal subgroup, and the construction of a quotient shows that all normal subgroups appear in this way. In particular, the alternating group AnA_n is a normal subgroup of the symmetric group SnS_n.

A simple group has either Z(G)={e}Z(G)=\{e\} or Z(G)=GZ(G)=G; in the latter case, this means that GG is commutative, and it quickly appears that GG has to be finite of prime order. Consequently, our discussion will now concern groups with trivial center.

The concept of simplicity of groups is classically presented in connection with Galois theory, and the question of solvability of equations by radicals. Namely, a “general” polynomial equation of degre nn has SnS_n, the full symmetric group on nn elements, for its Galois group, and, if n5n\geq 5, the only possible dévissage of this group consists in introducing the alternating group AnA_n of even permutations, the subgroup AnA_n being normal and simple. On the other hand, the solvability of polynomial equation by radicals is equivalent to such a dévissage where all successive quotients are cyclic groups (equivalently abelian groups). Since AnA_n is not abelian, this implies that a “general” polynomial equation of degree nn is not solvable by radicals. However, using simplicity of the alternating group is much stronger than what we need: what would be needed is solvability of the symmetric group, and that this does not hold if n5n\geq 5 is much simpler. Consequently, for the problem of resolution by radicals, it suffices to prove that the derived subgroup D(An)D(A_n) of the alternating group is equal to AnA_n.

Theorem. — For n5n\geq 5, one has D(An)=AnD(A_n)=A_n. In particular, the alternating group and the symmetric group on nn letters are not solvable.
I give two (equivalent) proofs, the second one being a computational interpretation of the first one. Both use that the 3-cycles generate AnA_n and are conjugate in AnA_n. The computational proof is shorter, arguingly simpler. As a matter of fact, I never understood it, nor could remember it, until I translated the conceptual proof into the proof assistant.
Consider the morphism p ⁣:AnAn/D(An)p\colon A_n\to A_n/D(A_n). Since An/D(An)A_n/D(A_n) is commutative, all 3-cycles have the same image. Since the square of a 3-cycle is again a 3-cycle, both have the same image. This implies that for every 3-cycle gAng\in A_n, one has p(g)=p(g2)p(g)=p(g^2), hence p(g)=ep(g)=e. Since the 3_cycles generate AnA_n, the morphism pp is trivial; since it is surjective, one has An/D(An)={e}A_n/D(A_n)=\{e\} and D(An)=AnD(A_n)=A_n.
Computationally, consider a 3-cycle gg and its square g2g^2. Since they are conjugate, there exists hAnh\in A_n such that g2=hgh1g^2=hgh^{-1}. Then g=hgh1g1g=hgh^{-1}g^{-1}, so that gg is a commutator; in particular, D(An)D(A_n) contains all commutators and D(An)=AnD(A_n)=A_n.

The remaining cases, for $n\leq 4, are interesting, but classically left as exercises in text books:

  1. One has A1=S1={e}A_1=S_1=\{e\};
  2. The group S2S_2 is the cyclic group of order 2, hence is simple and solvable, and A2A_2 is trivial;
  3. The group S3S_3 is a noncommutative group of order 6, and A3A_3 is a cyclic group of order 2.
  4. The groups S4S_4 and A4A_4 are noncommutative and solvable, of orders 24 and 12. The derived subgroups D(S4)D(S_4) and D(A4)D(A_4) are both equal to the Klein subgroup V4V_4 of S4S_4, consisting of the permutations of the form (ab)(cd)(ab)(cd) for a,b,c,da,b,c,d any enumeration of 1,2,3,41,2,3,4 — “double transpositions” – and of the identity. The group V4V_4 is commutative, isomorphic to (Z/2Z)2(\mathbf Z/2\mathbf Z)^2, and the quotient D(A4)/V4D(A_4)/V_4 is cyclic of order 33.

Another classical series of simple groups appears in linear algebra. Let FF be a field and let nn be an integer such that n2n\geq 2. The group GL(n,F)\mathrm{GL}(n,F) of n×nn\times n invertible matrices is not simple, for it is noncommutative but its center consists of homotheties; we can try to mod out by the center, getting the group PGL(n,F)=GL(n,F)/F×\mathrm{PGL}(n,F)=\mathrm{GL}(n,F)/F^\times but that one may not be simple. Indeed, another reason for GL(n,F)\mathrm{GL}(n,F) not to be simple is that it admits the special linear group SL(n,F)\mathrm{SL}(n,F), kernel of determinant, as a normal subgroup. The group SL(n,F)\mathrm{SL}(n,F) has a nontrivial center in general, it consists of homotheties of ratio aF×a\in F^\times such that an=1a^n=1 — let us denote it by μn\mu_n. But the quotient PSL(n,F)=SL(n,F)/μn\mathrm{PSL}(n,F)=\mathrm{SL}(n,F)/\mu_n is simple in general — in general meaning that is is always the case but for two exceptions:

  1. n=2n=2 and F=F2F=\mathbf F_2. Then PSL(2,F2)S3\mathrm{PSL}(2,\mathbf F_2)\simeq S_3 (by the action on P1(F2)\mathbf P_1(\mathbf F_2), see below), hence is not simple.
  2. n=2n=2 and F=F3F=\mathbf F_3. Then PSL(2,F3)S4\mathrm{PSL}(2,\mathbf F_3)\simeq S_4 (again by the action on P1(F3)\mathbf P_1(\mathbf F_3)), and is not simple.

Bilinear algebra gives rise to new groups, orthogonal, unitary and symplectic, which also furnish simple groups up to elementary transformations. By the famous “classification of finite simple groups”, these constructions furnish all finite simple groups, up to 26 (or 27) examples called sporadic groups. This remarkable theorem has a fantastic proof, encompassing thousands of pages across the years 1960-2010.

But the question here is: How can one prove that a given group is simple?

Alternating groups

There is a large supply of proofs that the alternating group AnA_n is simple for n5n\geq 5. Here is a sketch of one.

Let NN be a normal subgoup of AnA_n (n5n\geq 5) and assume that N{e}N\neq\{e\}. An element of AnA_n can be written as a product of an even number of transpositions. If two successive permutations in the product are equal, we can cancel them; if the share exactly one a common member, as in (ab)(ac)(a\,b)(a\,c), their product is a 3-cycle (acb)(a\,c\,b); if they have no common member, their product is a double transposition. On the other hand, if n5n\geq 5, we can either insert (bc)(bc)(b\,c)(b\,c) in the product (ab)(cd)(a\,b)(c\,d), writing a double transposition as a product of two 3-cycles, or insert (de)(de)(d\,e)(d\,e) in the product (ab)(ac)(a\,b)(a\,c), writing a 3-cycle as a product of two double transpositions. Consequently, AnA_n is generated by, either the 3-cycles, or the double transpositions. Moreover, since n5n\geq 5, we can check that 3-cycles are pairwise conjugate, and similarly for double transpositions; consequently, if the normal subgroup NN of AnA_n contains either a 3-cycle, or a double transposition, it will contain all of them, hence be equal to AnA_n.

When n=5n=5, the only case that remains to consider is when NN contains a 5-cycle, say g=(12345)g=(1\,2\,3\,4\,5). Conjugating gg by the 3-cycle h=(451)h=(4\,5\,1), we get hgh1=(h1h2h3h4h5)=(42351)Nhgh^{-1}=(h1\,h2\,h3\,h4\,h5)=(4\,2\,3\,5\,1)\in N. By construction, this element behaves as gg on 55, but differs. Consequently, the commutator hgh1g1hgh^{-1}g^{-1} is a nontrivial element of NN that fixes 55. By the first two cases, one has N=A5N=A_5.

A similar argument works in general, arguing by descending induction on the cardinality on the fixed point set of an element geg\neq e of NN. One considers an element hh of AnA_n and the conjugate hgh1hgh^{-1}; if g=(a1a2)(b1b2)g=(a_1\,a_2\,\dots)(b_1\,b_2\,\dots)\dots, then hgh1=(ha1ha2)(hb1hb2)hgh^{-1}=(ha_1\,ha_2\,\dots)(hb_1\,hb_2\,\dots)\dots is an element of NN that behaves as gg on many elements, but not all. Consquently, hgh1g1hgh^{-1}g^{-1} is a non trivial element of NN that fixes more elements than gg, and we conclude by induction. (Details can be found in James Milne's lectures, 4.34.)

The Iwasawa criterion

In 1941, Kenkiti Iwasawa published a proof of the simplicity of the projective special linear group. From this proof, a general criterion for simplicity has been abstracted:

Theorem (Iwasawa). — Let GG be a group with a primitive action on a set XX. Assume that one is given, for every xXx\in X, a subgroup T(x)T(x) of GG satisfying the following properties:

  • For every xXx\in X, the subgroup T(x)T(x) is commutative;
  • For every gGg\in G and xXx\in X, T(gx)=gT(x)g1T(g\cdot x)=g T(x)g^{-1};
  • The union of the groups T(x)T(x), for xXx\in X, generate GG.
Then any normal subgroup NN of GG that acts nontrivially on XX contains the commutator subgroup of GG. In particular, if G=D(G)G=D(G), then GG is simple.

There are two classical ways to state that the action is primitive. The simplest states that it is transitive and that the stabilizers of points are maximal subgroups of GG. Another is that there is no imprimitivity block, a nontrivial partition (Xi)(X_i) of XX such that for every gGg\in G and every ii, there exist jj such that gXi=g\cdot X_i=X_j$. One can prove that a 2-transitive action (= transitive on ordered pairs of distinct elements) is primitive.

Iwasawa applies his criterion to G=SL(n,F)G=\mathrm{SL}(n,F) acting on the projective space Pn1(F)\mathbf P_{n-1}(F) of lines in FnF^n. Except when n=2n=2 and FF has 2 or 3 elements, this action is 2-transitive, hence primitive. For a nonzero xFnx\in F^n, one considers the group T(x)T(x) of linear transformations of the form yy+ϕ(y)xy\mapsto y + \phi(y) x (transvections), for all linear forms ϕ\phi on FnF^n such that ϕ(x)=0\phi(x)=0. They constitute a commutative subgroup of SL(n,F)\mathrm{SL}(n,F) (isomorphic, as a group, to Fn1F^{n-1}). The map TT gives rise to the data as in Iwasawa's theorem. Consequently, every normal subgroup NN of SL(n,F)\mathrm{SL}(n,F) that acts nontrivially on Pn1(F)\mathbf P_{n-1}(F) contains the commutator subgroup of SL(n,F)\mathrm{SL}(n,F), which is SL(n,F)\mathrm{SL}(n,F). Explicitly, either NN consists of homotheties, or NN contains SL(n,F)\mathrm{SL}(n,F). This implies that PSL(n,F)\mathrm{PSL}(n,F) is simple.

Applying the Iwasawa criterion for symmetric groups

One may wish to apply the Iwasawa criterion to the symmetric group. However, the conclusion is not as nice as what I had hoped initially.

Let SnS_n act on the set of 2-element subsets of X={1,,n}X=\{1,\dots,n\}. If n5n\geq 5, the action is primitive. This is not completely obvious, because it this action is not 2-transitive (you cannot map {1,2}\{1,2\} and {1,3}\{1,3\} to {1,2}\{1,2\} and {3,4}\{3,4\})! Its primitivity means that stabilizers are maximal subgroups, and one can prove that the stabilizer of a 2-element subset is indeed maximal unless n=4n=4. It is also faithful (no nontrivial acts trivially). To a pair {a,b}\{a,b\}, associate the subgroup of order 2 generated by the transposition (ab)(a\,b). It satisfies the criterion, and this shows that the nontrivial normal subgroups of SnS_n contain D(Sn)=AnD(S_n)=A_n. Since AnA_n has index 2, this shows N=AnN=A_n or N=SnN=S_n.

It is interesting to guess how the criterion breaks for n=4n=4. In fact, the action of S4S_4 on pairs is transitive, but not primitive: the stabilizer of {1,2}\{1,2\} is a subgroup of S4S_4 of order 44, consisting of the identiy, two transpositions (12)(1\,2) and (34)(3\,4), and their product. Since Card(S4)=24!=233\mathrm{Card}(S_4)=24!=2^3\cdot 3, this subgroup is contained in a 2-sylow subgroup, of order 8, so is not maximal.

However, and I find it unfortunate, this proof does not imply that the alternating group AnA_n is simple for n5n\geq 5. To prove the simplicity of AnA_n along these lines, we need to consider the following actions.

  • Let AnA_n act on the set X3X_3 of 3-element subsets of XX. For x={a,b,c}X3x=\{a,b,c\}\in X_3, consider the subgroup T(x)T(x) of order 3 of AnA_n consisting of the identity and the 3-cycles (abc)(a\,b\,c) and (acb)(a\,c\,b). (It is the alternating group on xx, viewed as a subgroup of AnA_n.) The Iwasawa criterion applies provided the action is primitive, which presumably holds for n>6n>6. Consequently, AnA_n is simple for n7n\geq 7. However, if n=6n=6, the stabilizer of {1,23}\{1,2\,3\} in A6A_6 is not maximal, for a reason analogous to the one explained for S4S_4.
  • Let AnA_n act on the set X4X_4 of 4-element subsets of XX. For xX4x\in X_4, we can consider the Klein subgroup T(x)T(x) of A4A_4, acting on xx, viewed as a subgroup of AnA_n. I hope that the action is primitive, except for n=8n=8, and this would prove that AnA_n is simple for n8n\geq 8. This uses that double transpositions generate AnA_n.
  • One can improve the two preceding arguments a little bit. If n=5n=5, we can have AnA_n act on X2X_2, associating with xx the alternating group on the three remaining letters. Therefore, A5A_5 is simple (the action is primitive because the stabilizer of a point is the Klein subgroup of A4A_4, as a subgroup of A5A_5, its cardinality is 12, that of A5A_5 is 60, and since 60/12=5 is prime, the Klein subgroup of A4A_4 is maximal in A5A_5). Similarly, if n=6n=6, we have A6A_6 act on X2X_2, associating with xX2x\in X_2 the Klein subgroup corresponding to the four remaining letters. Provided one can prove that the stabilizer of a pair in A6A_6 is a maximal subgroup, this gives a proof that A6A_6 is simple!

The primitivity assertions seem to hold. In fact, maximal subgroups of AnA_n and SnS_n are classified by the O'Nan–Scott theorem. Those we're concerned with have type (a) in the notation of (Liebeck, Praeger, Saxl, 1987. “A Classification of the Maximal Subgroups of the Finite Alternating and Symmetric Groups.” Journal of Algebra 111 (2): 365–83. DOI:10.1016/0021-8693(87)90223-7), and according to Theorem 1 of that paper, the relevant subgroups are indeed maximal.

Formalization: where am I now?

I have already formalized the Iwasawa criterion in Lean (there's an ongoing pull request for this), as well as the result that the normal subgroups of SnS_n are {e}\{e\}, AnA_n and SnS_n for n5n\geq 5.

It remains to formalize the rest of the proof, and the main difficulty will be in proving primitivity, plus some nice way of defining the maps TT so that their properties are visible.

I also wish to formalize the simplicity of the special linear group along these lines, and it should be ready for an application to orthogonal, unitary or symplectic groups as well.

1 comment :