Wednesday, October 17, 2012

Ultrafilters

Ultrafilters won't make american coffee better, but they have nice applications in mathematics. In fact, the last three (working) days, I happened to hear about them with three different colleagues, in three different directions.

Let me recall that an ultrafilter U\cal U on a set XX is a subset of P(X)\mathfrak P(X) satisfying the following axioms:
  1. The empty set does not belong to U\cal U;
  2. If AUA\in\cal U and BAB\supset A, then BUB\in\cal U;
  3. If A,BUA,B\in \cal U, then ABUA\cap B\in\cal U;
  4. For any subset AXA\subset X, either AA or A\complement A belongs to U\cal U (but not both).
In informal terms, an element of U\cal U may be thought of as a “large” subset of XX; the axioms say 1. that the empty set is not large, 2. that a superset of a large set is again large, 3. that large sets are large enough so that intersections of two large sets is large, and finally, 4. that if a set is not large, then its complement is large. In fact, this last axiom is the one which makes a filter “ultra”.

Here is an example of an ultrafilter. Fix a point xXx\in X and decree that a set AXA\subset X is large iff it contains xx. Such ultrafilters are called principal and they do not quite respect the intuition of elements of an ultrafilter being a large set. Unfortunately, there is no explicit construction of an ultrafilter which is not principal. Fortunately, choice-like axioms imply the existence of non principal ultrafilters.

Ultrafilters are used to construct ultraproducts

Take a family (Ax)(A_x) of  non-empty sets indexed by the set XX and fix a nonprincipal ultrafilter on XX. (The definition below is not the correct one when some sets AxA_x may be empty; the correct one is slightly more elaborate. I learnt it from Laurent Moret-Bailly, see Section 2 of his beautiful paper An extension of Greenberg's theorem to general valuation ringsarXiv:1106.0984v3.) Define an equivalence relation on the product set A=AxA=\prod A_x by saying that two families (ax)(a_x) and (bx)(b_x) are equivalent iff the set of xx such that ax=bxa_x=b_x is large. Let AA^* be the quotient of AA by this equivalence relation; it is called the ultraproduct of the AxA_x. When the AxA_x have some extra-structure (relations, operations, etc.) on can transport them to the ultraproduct AA^*. For example, if, for each xx, RxR_x is a binary relation on AxA_xa,bAa,b\in A^* are classes of (ax)(a_x) and (bx)(b_x), say that R(a,b)R(a,b) iff the set of xx such that Rx(ax,bx)R_x(a_x,b_x) is large. We shall see some specific examples below.

There are many nice bits of mathematics where ultrafilters are involved. Here are three of them which I would like not to forget.

1) Cardinality of ultraproducts of countable sets.

If there is a large subset of XX on which Card(Ax)\mathrm{Card}(A_x) is bounded, then AA^* is finite. Otherwise, AA^* is infinite, and has in fact at least the power of the continuum.

Here is a classical proof in the case X=NX=\mathbf N, with some twists that helped me understand it.

We want to construct a map from 2N2^{\mathbf N} to AA^*. Here, 2N2^{\mathbf N} is the set of subsets of N\mathbf N,  but can be almost viewed as the set of 22-adic integers, a subset of N\mathbf N corresponding to a 22-adic integer whose nonzero digits are exactly those belonging to the given subset. In formula, SNS\subset\mathbf N corresponds to the 22-adic integer f(S)=n=0χS(n)2nf(S)=\sum_{n=0}^\infty \chi_S(n) 2^n. If we sum only nn terms, we get an approximation fn(S)f_n(S) modulo (2n)(2^n) which is an element of {0,,2n1}\{0,\dots,2^n-1\}. In particular, fn(S)f(S)f_n(S)\rightarrow f(S). This shows that if STS\neq T are two distincts subsets of BB, then fn(S)fn(T)f_n(S)\neq f_n(T) for all but finitely many integers~nn.

Enumerate a large subset X1={x0,}X_1=\{x_0,\dots\} of XX such that AxnA_{x_n} has at least 2n2^n elements and enumerate those elements. For any S2NS\in 2^{\mathbf N}, consider the family a(S)=(ax)a(S)=(a_x) of Ax\prod A_x such that for every nnaxna_{x_n} is the fn(S)f_n(S)th element of AxnA_{x_n}, and axa_x is any fixed element of AxA_x otherwise. Let a(S)a^*(S) be the class of this family in AA^*. If STS\neq T, then fS(n)fT(n)f_S(n)\neq f_T(n) for all but finitely many nn, so that a(S)a(S) and a(T)a(T) differ in a large set (the complement to a finite set in a large set), hence a(S)a(T)a^*(S)\neq a^*(T). We have thus constructed an injective map from 2N2^{\mathbf N} to AA^*, hence Card(A)2N\mathrm{Card}(A^*)\geq 2^{\mathbf N}.

2) An “ultrafilter proof” of the infinite Ramsey theorem

Any infinite graph contains an infinite subset which is either a complete subgraph (any two vertices are linked by an edge) or a discrete subgraph (no two vertices are linked by an edge). In this statement, a graph is assumed to have no loop.

Let VV be the set of vertices of your graph GG. Choose a non principal ultrafilter on VV. Say a vertex vVv\in V is social if the set of its neighbors (those linked to vv by an edge of GG) is large; otherwise, say it is lonely. Now, assuming that the set of social vertices is large, we will construct an infinite complete subgraph of GG. Otherwise, the set of lonely vertices is large and the same construction (or the consideration of the opposite graph) gives an infinite discrete subgraph.

Fix a social vertex v0v_0. The set of social neighbors of v0v_0 is large, because it is the intersection of the large set of social vertices with the large set of neighbors of v0v_0; so v0v_0 has a social neighbor v1v_1. Assume we have constructed social vertices v0,,vn1v_0,\dots,v_{n-1} which are pairwise linked by an edge, the set of social vertices of VV which are linked to all of the viv_i is large, again because it is the intersection of the large set of social vertices with finitely many large sets of neighbors of each viv_i. So there is a social vertex vnv_n which is a neighbor of v0,,vn1v_0,\dots,v_{n-1}. Go on.

This infinite Ramsey theorem implies the finite one: For any integer nn, there is an integer r(n)r(n) such that  each graph with at least r(n)r(n) vertices possesses a subgraph of cardinality nn which is either complete or discrete. The proof uses ultrafilters again, but in a different way. Assume by contradiction that the finite Ramsey theorem is false. So there is an integer nn and a family (Gi)(G_i) of finite graphs with an increasing number of vertices, none of which contains a complete or discrete subgraph with nn vertices. Take the ultraproduct GG of the graphs GiG_i, namely the product graph Gi\prod G_i modulo the equivalence relation (gi)(gi)(g_i)\sim (g'_i) iff the set of ii such that gi=gig_i=g'_i
is large, for some given non principal ultraproduct on the set II of indices. Write [gi][g_i] for the class of a family (gi)(g_i); say that [gi][g_i] is linked to [gi][g'_i] if the set of indices ii for which gig_i is linked to gig'_i in GiG_i is large. Then GG is a graph, an infinite graph (by the result on cardinalities of ultraproducts).

While I write this proof, I realize that this proof prompts naturally the question of the Ramsey theorem for cardinals: Assuming VV has some (infinite) cardinality, what is the smallest cardinality for which
there may not exist an either complete or discret subgraph of that cardinality? For the previous proof to work, one needs transfinite induction, so one needs the ultrafilter to be stable by possibly infinite intersections.

3) Ax's proof of Hilbert's Nullstellensatz

Let kk be an algebraically closed field, let f1,,fmf_1,\dots,f_m be polynomials in dd variables with coefficients in kk. Assume that the system of polynomial equations
f1(x1,,xd)==fm(x1,,xd)=0 f_1(x_1,\dots,x_d)=\dots=f_m(x_1,\dots,x_d)=0
has a solution in some extension KK of kk. Then, it already has a solution in kk.

The following proof is given by Ax in the surprising paper A Metamathematical Approach to some Problems in Number Theory, published in the proceedings of a conference  (Proc. Symp. Pure Math., 20, Amer. Math. Soc). I say surprising, because the main part of this 30-pages paper is about 6 pages, the rest is an appendix providing a beautiful introduction the theory of valuations!

I basically copy Ax. One may assume that both fields kk and KK are countable and algebraically closed (first replace kk by the algebraic closure of the subfield generated by the coefficients
of the polynomials f1,,fmf_1,\dots,f_m; then replace KK by the algebraic closure of the subfield generated over kk by the coordinates (x1,,xd)(x_1,\dots,x_d) of some solution). 

Consider a nonprincipal ultrafilter on the set N\mathbf N of integers and look at the ultraproducts kk^* and KK^* of the families (kn)(k_n) and (Kn)(K_n), where kn=kk_n=k and Kn=KK_n=K for each KK. These are algebraically closed fields; moreover,  kk^* contains kk (the set of classes of all families (an)(a_n) where anaa_n\equiv a for  some aka\in k). From a solution (x1,,xd)(x_1,\dots,x_d) in KK and considering the class of the constant family, one gets a solution (x1,,xd)(x^*_1,\dots,x^*_d) in (K)d(K^*)^d of the system of equations f1(x)==fm(x)=0f_1(x^*)=\dots=f_m(x^*)=0.

By set theory, Card(k)NN=2N\mathrm{Card}(k^*)\leq {\mathbf N}^{\mathbf N}=2^{\mathbf N}, while we have shown that Card(k)2N\mathrm{Card}(k^*)\geq 2^{\mathbf N}, so that kk^* has the cardinality of the continuum. Likewise, KK^* has the cardinality of the continuum. Consequently (by set theory again),  kk^* and KK^* both have a transcendence basis over kk of cardinality the continuum; since they are algebraically closed, they are isomorphic as extensions of kk. The image of the solution xx^* by a kk-isomorphism from KK^* to kk^* is a solution yy^* of our system in kk^*. By definition, y=(y1,,yd)y^*=(y^*_1,\dots,y^*_d) where for each ii, yiy^*_i is the class of a sequence (yi,n)(y_{i,n}) of elements of kk. Since fj(y)=0f_j(y^*)=0, the set of integers nn such that fj(y1,n,,yd,n)=0f_j(y_{1,n},\dots,y_{d,n})=0 is large. Consequently, there is a large set of integers nn such that (y1,n,,yd,n)(y_{1,n},\dots,y_{d,n}) is a solution of our system. Any such integer nn is a proof that this system already has a solution in kk. Q.E.D.

I do no know whether this proof is really easier than the ones usually given in an algebra course, for all the set theory involved is rather delicate, but I found it rather appealing.



No comments :

Post a Comment