Sunday, January 1, 2023

The Klein group, the centralizer of a permutation, and its relation with the alternating group

The following reflexion came out of my irrepressible need to understand why the 3 double transpositions in S4\mathfrak S_4, together with the identity, formed a group VV. Of course, one might just say: “they are stable under multiplication, as one sees by computing the 4·3/2 = 6 different products”, but who makes this computation anyway? And since I wanted not only to understand this, but to explain it to Lean, I needed an argument that could actually be done, for real. So here is an argument that requires no computation, besides the one that says that there are 3 double transpositions.

Prop.The subgroup of S4\mathfrak S_4 generated by the 3 double transpositions is the unique 2-sylow subgroup of A4\mathfrak A_4. In particular, it has order 4 and consists of these 3 double transpositions and of the identity.
Proof. — Let VV be the subset of S4\mathfrak S_4 consisting of these 3 double transpositions and of the identity. Let SS be a 2-sylow subgroup in A4\mathfrak A_4.
We first prove SVS \subseteq V. The subgroup SS has order 4. Let gSg\in S. The order of gg divides 4, so its cycles have lengths 1, 2 or 4. If there were one cycle of length 4, then gg would be that cycle, hence of odd sign. Consequently, either g=1g=1, or gg has a cycle of length 2, and then there must be a second because gg is even. Consequently, SVS\subseteq V, as claimed.
Since 4=Card(S)=Card(V)4=\operatorname{\rm Card}(S)=\operatorname{\rm Card}(V), this shows that S=VS=V, hence S=VS=\langle V\rangle.

At this point, we still need to understand why there are 3 double transpositions. More generally, I wanted to prove that the number of permutations in Sn\mathfrak S_n of given orbit type. The orbit type a permutation gg is a multiset of strictly positive integers with sum nn given by the cardinalities of the orbits of gg on {1,,n}\{1,\dots,n\}. We write it as 1n12n2rnr1^{n_1} 2^{n_2}\dots r^{n_r}, meaning that gg has n1n_1 orbits of length 11 (fixed points), n2n_2 orbits of cardinality 22, etc., so that n=niin= \sum n_i i. Let Og\mathscr O_g be the set of orbits of gg. The action of gg on a given orbit cc coincides with a circular permutation with order the length (c)\ell(c) of this orbit; when it is nontrivial, such a permutation will be called a cycle of gg. The supports of these cycles are pairwise disjoint, so that these cycles commute, and their product is exactly gg. In fact, this is the only way of writing gg as a product of cycles with pairwise disjoint supports. (By convention, the identity is not a cycle.)

Theorem.There are exactly N(1n1rnr)=n!1n1rnr n1!nr! N(1^{n_1}\dots r^{n_r}) = \frac{n!}{1^{n_1}\dots r^{n_r} n_1!\dots n_r!} permutations with orbit type 1n12n2rnr1^{n_1} 2^{n_2}\dots r^{n_r}.

A standard proof of this result goes as follows. Write the decomposition of such a permutation gg into cycles with disjoint supports as g=()()(,)(,,)g=(\cdot)\dots (\cdot)(\cdot,\cdot)\dots(\cdot,\cdot,\dots), leaving blank spaces for the values of the cycles (and, contradicting our convention, allowing for cycles of length 1…). There are n!n! ways to fill these spaces with the nn distinct integers between 11 and nn, but some of them will give rise to the same permutation. Indeed, the entries in a cycle of length ss only count up to a circular permutation, so that we need to divide the result by 1n1rnr1^{n_1}\dots r^{n_r}. Moreoveer, we can switch the order of the cycles of given length, hence we also need to divide that result by ns!n_s! (number of ways of switching the various cycles of length ss), for all possible length ss.

This is reasonably convincing but one could wish for something more precise, both in the proof, and in the statement. In fact, in the preceding formula, the numerator n!n! is the order of Sn\mathfrak S_n. Since all permutations with a given orbit type are conjugate by Sn\mathfrak S_n, the left hand side appears as the cardinality of the orbit of a permutation gg of that orbit type, so that the denominator has to be equal the cardinality of the stabilizer of this permutation under the action by conjugation. Therefore, a more precise proof of this formula could run by elucidating the structure of this centralizer. This may also be interesting once one wishes to relativize the result to the alternating group An\mathfrak A_n in order to obtain a formula for the cardinality of the various conjugacy classes in An\mathfrak A_n.

Let us fix a permutation gSng\in\mathfrak S_n with orbit type 1n1rnr1^{n_1}\dots r^{n_r}. The stabilizer of gg under the action by conjugation is its centralizer ZgZ_g, the subgroup of all kSnk\in\mathfrak S_n which commute with gg.

We first define a morphism of groups  τ ⁣:ZgSn1×Sn2×Snr. \tau \colon Z_g \to \mathfrak S_{n_1}\times \mathfrak S_{n_2}\times\dots \mathfrak S_{n_r}. Let Og\mathscr O_g be the set of orbits of gg; this is a set with cardinality n1+n2++nrn_1+n_2+\dots+n_r. Restricted to one orbit, the action of gg coincides with that of a circular permutation on (which fixes the complementary subset); these circular permuations have disjoint supports, hence they commute pairwise and their product is equal to gg. For cOgc\in\mathscr O_g, we write (c)\ell(c) for its cardinality of its support, this is also the order of the cycle of gg defined by this orbit. If kZgk\in Z_g, then kgk1=gkgk^{-1}=g. Consequently, the action of kk permutes the orbits of gg, respecting their cardinalities. This defines the desired group morphism τ\tau from ZgZ_g to a product of permutation groups Sn1×Snr\mathfrak S_{n_1}\times \dots \mathfrak S_{n_r}.

This morphism τ\tau is surjective.
Indeed, given permutations σ1\sigma_1 of the set of fixed points of gg, σ2\sigma_2 of the set of orbits of length 2, etc., we construct kσZgk_\sigma\in Z_g such that τ(kτ)=(σ1,,σr)\tau(k_\tau)=(\sigma_1,\dots,\sigma_r). We fix a point aca_c in each orbit cc and decide that kσ(ac)=aσi(c)k_\sigma(a_c)=a_{\sigma_i(c)} if cc has length ii. The formula kσg=gσk_\sigma g=g_\sigma imposes kσ(gnac)=gnaσi(c)k_\sigma (g^n a_c)=g^n a_{\sigma_i(c)} for all nZn\in\mathbf Z, and it remains to check that this formula gives a well defined element in ZgZ_g. In fact, this formula defines a group theoretic section of τ\tau.

What is the kernel of this morphism τ\tau?
If τ(k)=1\tau(k)=1, then kk fixes every orbit cOgc\in\mathscr O_g. Since kg=gkkg=gk, we obtain that on each orbit cc, kk coincides with some power of the corresponding cycle, which has order (c)\ell(c). We thus obtain an isomorphism  ker(τ)cCg(Z/(c)Z). \ker(\tau) \simeq \prod_{c\in\mathscr C_g} (\mathbf Z/\ell(c)\mathbf Z).

To compute the cardinality of ZgZ_g, it is now sufficient to compute those of im(τ)\operatorname{\rm im}(\tau) and ker(τ)\ker(\tau), and this gives the formula  Card(Zg)=Card(ker(τ))Card(im(τ))=1n1rnrn1!nr!, \operatorname{\rm Card} (Z_g) = \operatorname{\rm Card} (\ker(\tau)) \operatorname{\rm Card} (\operatorname{\rm im}(\tau)) = 1^{n_1}\dots r^{n_r} n_1! \dots n_r!, as was to be shown.

One of the interest of this argument is that it can be pushed forward to understand the structure of the conjugacy classes in the alternating group An\mathfrak A_n. The case n1n\leq 1 is uninteresting, hence we assume n2n\geq 2. Then An\mathfrak A_n has index 2 in Sn\mathfrak S_n, and the formulas   Card((g)An)=Card(An)Card(ZgAn)and Card((g)Sn)=Card(Sn)Card(Zg) \operatorname  {\rm Card}((g)_{\mathfrak A_n}) = \frac{{\rm Card}({\mathfrak A_n})}{{\rm Card}(Z_g \cap {\mathfrak A_n})} \quad\text{\rm and}\quad \operatorname  {\rm Card}((g)_{\mathfrak S_n}) = \frac{{\rm Card}({\mathfrak S_n})}{{\rm Card}(Z_g)} for the cardinalities of the conjugacy classes (g)An(g)_{\mathfrak A_n} and (g)Sn(g)_{\mathfrak S_n} imply that both are equal if and only if ZgZ_g is not contained in An\mathfrak A_n; otherwise, the conjugacy class (g)Sn(g)_{\mathfrak S_n} is the disjoint union of (g)An(g)_{\mathfrak A_n} and of a conjugacy class (g)An(g')_{\mathfrak A_n} of a permutation gg' which is conjugate to gg in Sn\mathfrak S_n but not in An\mathfrak A_n, and both have the same cardinality.

Examples of this phenomenon are classical. For example, the 5-cycles in S5\mathfrak S_5 are conjugate, but they constitute two distinct conjugacy classes under A5\mathfrak A_5. Even more elementary, the 3-cycles (123)(1\,2\,3) and (132)(1\,3\,2) are conjugate in S3\mathfrak S_3, but they are not conjugate in A3\mathfrak A_3 since that group is commutative!

So let us use our description of ZgZ_g to give a full description of this phenomenon.

As a first step, when is ker(τ)\ker(\tau) contained in An\mathfrak A_n? We have seen that ker(τ)\ker(\tau) is generated by the cycles cCgc\in\mathscr C_g. Consequently, ker(τ)\ker(\tau) is contained in An\mathfrak A_n if and only if all of them are contained in An\mathfrak A_n, which means that their lengths are odd.

We assume that this condition holds, so that ker(τ)An\ker(\tau)\subseteq \mathfrak A_n, and now work on the image of τ\tau. Its surjectivity was proved by the means of an explicit section σkσ\sigma\mapsto k_\sigma. Given the preceding condition that ker(τ)An\ker(\tau)\subseteq \mathfrak A_n, a necessary and sufficient condition for the inclusion ZgAnZ_g\subseteq \mathfrak A_n will be that the image of this section consists of even permutations. This section is a morphism of groups, so it suffices to understand the sign of kσk_\sigma when σ\sigma consists of a cycle (c1,,cs)(c_1,\dots,c_s) in Sni\mathfrak S_{n_i} and is trivial on the other factors. Then (c1)==(cs)\ell(c_1)=\dots=\ell(c_s), by definition of σ\sigma. The formula kσ(gnac)=gnaσ(c)k_\sigma(g^n a_c)=g^n a_{\sigma(c)} shows that the non trivial cycles of kσk_\sigma are of the form (gnac1,,gnacs)(g^n a_{c_1},\dots, g^n a_{c_s}); they all have the same length, ss, and there are (c1)\ell(c_1) of them. Consequently, the sign of kσk_\sigma is equal to (1)(s1)(c1)=(1)s1(-1)^{(s-1)\ell(c_1)}=(-1)^{s-1} since (c1)\ell(c_1) is odd. This proves that the sign of kσk_\sigma is equal to the sign of σ\sigma. In addition to the condition that the orbits of gg have odd cardinalities, a necessary and sufficient condition for the image of σkσ\sigma\mapsto k_\sigma to be contained in An\mathfrak A_n is thus that all symmetric groups Sni\mathfrak S_{n_i} coincide with their alternating groups, that is, ni1n_i\leq 1 for all ii. We can now conclude:

Theorem. — Let 1n1rnr1^{n_1}\dots r^{n_r} be a partition of nn.

  1. If ni=0n_i=0 for even ii, and ni1n_i\leq 1 for all ii, then there exist two permutations in Sn\mathfrak S_n with orbit type 1n1rnr1^{n_1}\dots r^{n_r} which are not conjugate in An\mathfrak A_n.
  2. Otherwise, any two permutations in Sn\mathfrak S_n with that orbit type are conjugate in An\mathfrak A_n.

We can check the initial example of two 5-cycles in S5\mathfrak S_5 which are not conjugate in A5\mathfrak A_5. Their orbit type is 515^1: the only length that appears is 5, hence odd, and it has multiplicity 11. In fact, this is the only orbit type in S5\mathfrak S_5 where this phenomenon appears!