Monday, May 11, 2015

Model theory and algebraic geometry, 3 — Real closed fields and o-minimality

In this third post devoted to some interactions between model theory and algebraic geometry, we describe the concept of o-minimality and the o-minimal complex analysis of Peterzil and Starchenko.

1. Real closed fields and the theorem of Tarski-Seidenberg

To begin with, we work in the language LorL_{\mathrm{or}} of ordered rings which is the language of rings Lr={+,,,0,1}L_{\mathrm r}=\{+,-,\cdot,0,1\} enlarged with an order relation \leq.

Let us recall the definition of a real closed field: this is an field KK endowed with an ordering which is compatible with the field laws (the sum of positive elements is positive and the product of positive elements is positive) which satisfies the intermediate value theorem for polynomials: for every polynomial PK[T]P\in K[T], any pair (a,b)(a,b) of elements of KK such that a<ba<b, P(a)<0P(a)<0 and P(b)>0P(b)>0, there exists cKc\in K such that P(c)=0P(c)=0 and a<c<ba<c<b. Observe that this property can be expressed by a sequence of first-order formulas, one for each degree.

The field R\mathbf R of real numbers is real closed, but there are many other. For example, the field of formal Puiseux series with real coefficients is also real closed.

A theorem of Artin-Schreier asserts that a field KK is real closed if and only if 1∉K\sqrt{-1}\not\in K and K(1)K(\sqrt{-1}) is an algebraic closure of KK. This is also equialent to the fact that “the” algebraic closure of KK is a finite non-trivial extension of KK. While the algebraic notion adapted to the language of rings is that of an algebraically closed field, the notion of a real closed field is the one which is adapted to the language of ordered rings. In model theoretic terms, the theory of real closed fields is the model companion of the theory of ordered fields.

The analogue of the theorem of Chevalley is the classical theorem of Tarski-Seidenberg:

Theorem (Tarski-Seidenberg). — The theory of real closed fields eliminates quantifiers in the language of ordered rings.

There is a very classical example of this theorem, namely, the resolution of polynomial equation of degree 2. Indeed, in a real closed field, every positive element has a square root (if a>0a>0, then the polynomial T2aT^2-a is negative at 00 and positive at max(a,1)\max(a,1), so that it admits a positive root). The usual algebraic computation thus shows that the formula x,x2+ax+b=0\exists x, x^2+ax+b=0 is equivalent to the formula a24b0a^2-4b\geq 0.

Corollary 1. — If MM is a real closed field and AA is a subset of AA, then Def(Mn,A)\mathop{\rm Def}(M^n,A) is the set of all semi-algebraic subsets of MM defined by polynomials with coefficients in AA.

Corollary 2. — If MM is a real closed field, the definable subsets of MM are the finite unions of intervals (open, closed or half-open, ]a;b[\mathopen]a;b\mathclose[, ]a;b]\mathopen]a;b], [a;b[[\mathopen a;b\mathclose[, [a;b][a;b], possibly unbounded, possibly reduced to singletons).

2. O-minimality

The seemingly innocuous property stated in corollary 2 leads to a definition which is surprisingly important and powerful.

Definition. — Let TT be the theory of a real closed field MM in an expansion LL of the language of ordered rings. One says that TT is o-minimal if the definable subsets of MM are the finite unions of intervals.

It is a non-trivial result that the o-minimality is indeed a property of the theory TT, and not a property of the model MM: if it holds, then for every elementary extension NN of MM, the definable subsets of NN still are finite unions of intervals.

By the theorem of Tarski-Seidenberg, the theory of real closed fields is o-minimal. The discovery of more complicated o-minimal theories is a remarkable fact from the 80s.

Example. — Let Lan,expL_{\mathrm{an},\mathrm{exp}} be the language obtained by adjoining to the language LorL_{\mathrm{or}} of ordered rings symbols of functions exp\exp and ff, for every real analytic function f ⁣:[0;1]nRf\colon [0;1]^n\to\mathbf R. The field of real numbers is viewed as a structure for this language by interpreting exp\exp as the exponential function from R\mathbf R to R\mathbf R, and every function symbol ff as the function from Rn\mathbf R^n to R\mathbf R that maps xx to f(x)f(x) if x[0;1]nx\in [0;1]^n, and to 00 otherwise. The theory (denoted Ran,exp)\mathbf R_{\mathrm{an},\mathrm{exp}})) of R\mathbf R in this language is o-minimal.

This is a thorem of van den Dries and Miller; the case of LanL_{\mathrm{an}} (without the exponential function) had been established Denef and van den Dries, while the case of LexpL_{\mathrm{exp}} is due to Wilkie.

To give a non-example, let us consider the language obtained by adjoining a symbol sin\sin and view R\mathbf R as a structure for this language, the symbol sin\sin being interpreted as the sine function from R\mathbf R to R\mathbf R. Then the theory of R\mathbf R in this language is not o-minimal. Indeed, the set 2πZ2\pi\mathbf Z is definable by the formula sin(x)=0\sin(x)=0, but 2πZ2\pi\mathbf Z has infinitely many connected components, so is not a finite union of intervals.

One motivation for o-minimality is that it realizes (part of) Grothendieck quest towards tame topology as described in his Esquisse d'un programme. Indeed, sets which are definable in an o-minimal structure have many tameness properties:
  • The interior, the closure, the boundary of a definable set is definable.
  • Every definable set is homeomorphic to (the topological realization) of a simplicial complex
  • Every definable set has a celllular decomposition. Precisely, let us call a cell of Rn+1\mathbf R^{n+1} any subset CC of the following form: one is given a definable subset AA of Rn\mathbf R^n and definable functions f,g ⁣:ARf,g\colon A\to\mathbf R such that f(x)<g(x)f(x)<g(x) for every xAx\in A, and the set CC is defined by the condition xAx\in A, and by one of the conditions t<f(x)t<f(x), or t=f(x)t=f(x), or f(x)<t<g(x)f(x)<t<g(x), or t>f(x)t>f(x).  Then for every finite family (Bi)(B_i) of definable subsets of Rn+1\mathbf R^{n+1}, there is a finite partition of Rn+1\mathbf R^{n+1} into cells such that every BiB_i is a union of cells.
  • Every definable function is piecewise smooth.
  • Definable continuous functions are definably piecewise trivial (theorem of Hardt): for every function f ⁣:XYf\colon X\to Y between definable sets which is definable and continuous, there is a finite partition (Yi)(Y_i) of YY into definable subsets such that the map fi ⁣:f1(Yi)Yif_i\colon f^{-1}(Y_i)\to Y_i deduced from ff by restriction is isomorphic to a projection Yi×SiYiY_i\times S_i\to Y_i.

Recently, o-minimality has had spectacular and fantastic applications via the approach of Pila-Zannier to the conjecture of Pink, leading to new proofs of the Manin-Mumford conjecture (Pila-Zannier), and to proofs of the André-Oort conjecture (Pila, Pila-Tsimerman, Klingler-Ullmo-Yafaev), and, more recently, to partial results towards the conjecture of Pink (Gao, Habegger-Pila,...). However, this is not the goal of that post, so let me refer the interested reader to Tom Scanlon's Bourbaki talk on that topic.

3. O-minimal complex analysis

The standard identification of the field C\mathbf C of complex numbers with R2\mathbf R^2 (associating with a complex number its real and imaginary parts) allows to talk of complex valued functions (on a subset of Cn\mathbf C^n) which are definable in a given language. In a remarkable series of papers, Peterzil and Starchenko have shown that holomorphic functions which are definable in an o-minimal structure possess very rigid properties. Let us quote some of their theorems.

So we fix an expansion of the language LorL_{\mathrm{or}} of which the field R\mathbf R is a structure whose theory is o-minimal. By “definable”, we mean definable in that language. The typical language considered in the applications here is the language Lan,expL_{\mathrm{an},\mathrm{exp}}.

Theorem. — Let AA be a finite subset of C\mathbf C and let f ⁣:CACf\colon \mathbf C\setminus A\to \mathbf C be a holomorphic function. If ff is definable, then it is a rational function.

Theorem. — Let VCnV\subset\mathbf C^n be a closed analytic subset. If VV is definable, then VV is algebraic.

Corollary (Theorem of Chow). — Let VPn(C)V\subset\mathbf P^n(\mathbf C) be a closed analytic subset. Then VV is algebraic.

Indeed, working on the standard charts of Pn(C)\mathbf P^n(\mathbf C), we see that VV is locally definable by analytic functions. By compactness of Pn(C)\mathbf P^n(\mathbf C), it is thus definable in the language LanL_{\mathrm{an}}. Since the theory of R\mathbf R in this language is o-minimal, the corollary is a consequence of the previous theorem.

Let us finally give an important example. Let XX be an bounded symmetric domain. This means that XX is a bounded open subset of Cn\mathbf C^n such that for every point pXp\in X, there exists a biholomorphic involution f ⁣:XXf\colon X\to X such that pp is an isolated fixed point of ff. This implies that XX is a homogeneous space G/KG/K under a semisimple Lie group GG which acts by holomorphisms, and KK is a maximal compact subgroup of GG. Moreover, XX has a canonical Kähler metric which is invariant under GG.

The most classical example is given by the Poincaré upper half-plane on which PGL(2,R)\mathrm{PGL}(2,\mathbf R) acts by homographies; of course, the upper half-plane is not bounded, but is biholomorphic to the open unit disk.

A more sophisticated example is given by the Siegel upper half-plane or, rather, its bounded version. That is, XX is the set of n×nn\times n symmetric complex matrices ZZ such that InZZ\mathrm I_n-Z^* Z is positive definite. It is a homogeneous space for the symplectic group Sp(2n,R)\mathrm{Sp}(2n,\mathbf R); the fixator of Z=0Z=0 is the unitary group U(n)U(n).

Let now Γ\Gamma be an arithmetic subgroup of Sp(2n,R)\mathrm{Sp}(2n,\mathbf R); for example, let us take Γ\Gamma be a subgroup of finite index of Sp(2n,Z)\mathrm{Sp}(2n,\mathbf Z). Then the quotient S=X/ΓS=X/\Gamma admits a structure of an analytic set and the projection p ⁣:XSp\colon X\to S is an analytic map. If Γ\Gamma is “small enough” (torsion free, say), then SS is even complex manifold manifold, and pp is a covering. An important and difficult theorem of Baily-Borel asserts that SS is an algebraic variety.

In fact, it is classical in this context that there exist Siegel sets, which are explicit subsets FF of XX such that ΓF=X\Gamma\cdot F=X and such that the set of γΓ\gamma\in\Gamma such that γFF\gamma\cdot F\cap F\neq\emptyset is finite. So Siegel sets are almost fundamental domains. An important remark is that they are semi-algebraic, that is, definable in the language of ordered rings. For example in the upper half-plane, one may take FF to be the set of all zCz\in\mathbf C such that 12(z)12-\frac12\leq \Re(z)\leq \frac12 and (z)3/2\Im(z)\geq \sqrt 3/2. One may even take “fundamental sets” (which are fundamental domains up to something of empty interior) such as the one defined by the inequalities 12(z)12-\frac12\leq \Re(z)\leq\frac12 and z1\lvert z\rvert \geq1.

Peterzil and Starchenko have proved that there restriction to FF of the projection pp is definable in the language Lan,expL_{\mathrm{an},\mathrm{exp}}. An immediate consequence is that SS is definable in this language, hence is algebraic.

These results have been generalized by Klinger, Ullmo and Yafaev to any bounded symmetric domain. This is an important technical part of their proof of the hyperbolic Ax-Lindemann conjecture.

Link to Part 4 — Elimination of imaginaries

Saturday, May 2, 2015

Model theory and algebraic geometry, 2 — Definable sets, types; quantifier elimination

This is the second post in a series of 4 devoted to the exposition of interactions between model theory and algebraic geometry. In the first one, I explained the notions of language, structures and theories, with examples taken from algebra. Here, I shall discuss the notion of definable set, of types, as well as basic results from dimension theory (ω\omega-stability).

So we fix a theory TT in a language LL. A definable set is defined, in a given model MM of TT, by a formula. More precisely, we consider definable sets in cartesian powers MnM^n of the model MM, which can be defined by a formula in nn free variables with parameters in some subset AA of MM. By definition, such a formula is a formula of the form ϕ(x;a)\phi(x;a), where ϕ(x;y)\phi(x;y) is a formula in n+mn+m free variables, split into two groups x=(x1,,xn)x=(x_1,\dots,x_n) and y=(y1,,ym)y=(y_1,\dots,y_m) and a=(a1,,am)Ama=(a_1,\dots,a_m)\in A^m is an mm-tuple of parameters; the formula ϕ(x;y)\phi(x;y) can have quantifiers and bounded variables too. Given such a formula, we define a subset [ϕ(x;a)][\phi(x;a)] of MnM^n by {xMnϕ(x;a)}\{ x\in M^n\mid \phi(x;a)\}. We write Def(Mn;A)\mathrm{Def}(M^n;A) for the set of all subsets of MnM^n which are definable with parameters in AA.

Let us give examples, where LL is the language of rings and TT is the theory ACF\mathrm{ACF} of algebraically closed fields:
  • V1={xx0}MV_1=\{x\mid x\neq 0 \}\subset M , given by the formula “x0x\neq 0” with 1 variable and 00 parameter;
  • V2={xy,2xy=1}MV_2=\{x\mid \exists y, 2xy=1\} \subset M , given by the formula “y,2xy=1\exists y, 2xy=1” with 1 free variable xx, and one bounded variable yy;
  • V3={(x,y)x2+2y2=π}C2V_3=\{(x,y)\mid x^2+\sqrt 2 y^2=\pi \}\subset \mathbf C^2, where the model C\mathbf C is the field of complex numbers, ϕ((x,y),(a,b))\phi((x,y),(a,b)) is the formula x2+ay2=bx^2+ay^2=b in 4 free variables, and the parameters are given by (a,b)=(2,π)(a,b)=(\sqrt 2,\pi).
Theorem (Chevalley). — Let LL be the language of rings, T=ACFT=\mathrm{ACF} and MM be an algebraically closed field; let AA be a subset of MM. The set Def(Mn;A)\mathrm{Def}(M^n;A) is the smallest boolean algebra of subsets of MnM^n which contains all subsets of MnM^n of the form [P(x;a)][P(x;a)] where PP is a polynomial in n+mn+m variables with coefficients in Z\mathbf Z and a=(a1,,am)a=(a_1,\dots,a_m) is an mm-tuple of elements of AA. In other words, a subsets of MnM^n is definable with parameters in AA if and only if it is constructible with parameters in AA.

The reason behind this theorem is the following set-theoretic interpretation of quantifiers and logical connectors. Precisely, if ϕ\phi is a formula in n+m+pn+m+p variables, and aApa\in A^p, the definable subset [yϕ(x,y,a)][\exists y \phi(x,y,a)] of MnM^n coincides with the image of the definable subset [ϕ(x,y;a)][\phi(x,y;a)] of Mn+mM^{n+m} under the projection px ⁣:Mn+mMnp_x \colon M^{n+m}\to M^n. Similarly, if ϕ(x)\phi(x) and ψ(x)\psi(x) are two formulas in nn free variables, then the definable subset [ϕ(x)ψ(x)][\phi(x)\wedge\psi(x)] is the union of the definable subsets [ϕ(x)][\phi(x)] and [ψ(x)][\psi(x)]. And if ϕ(x)\phi(x) is a formula in nn variables, then the definable subset [¬ϕ(x)][\neg\phi(x)] is the complement in MnM^n of the definable subset [ϕ(x)][\phi(x)].

For example, the subset V2=[y,2xy=1]V_2=[\exists y, 2xy=1] defined above can also be defined by M[2x=0]M\setminus [2x=0].

One says that the theory ACF admits elimination of quantifiers: modulo the axioms of algebraically closed fields, every formula of the language LL is equivalent to a formula without quantifiers.

An important consequence of this property is that for every extension MMM\hookrightarrow M' of models of ACF, the theory of MM' is equal to the theory of MM—one says that every extension of models is elementary.

Let pp be either 00 or a prime number. Observe that every algebraically closed field of characteristic pp is an extension of Q\overline{\mathbf Q} if p=0p=0, or of Fp\overline{\mathbf F_p} if pp is a prime number. As a consequence, for every characteristic p0p\geq0, the theory ACFp\mathrm{ACF}_p of algebraically closed fields of characteristic pp (defined by the axioms of ACF\mathrm{ACF}, and  the axiom 1+1++1=01+1+\dots+1=0 that the characteristic is pp if pp is a prime number, or the infinite list of axioms that assert that the characteristic is \neq \ell, if p=0p=0) is complete: this list of axioms determines everything that can be said about algebraically closed fields of characteristic pp.

Definition. — Let aMna\in M^n and let AA be a subset of MM. The type of aa (with parameters in AA) is the set tp(a/A)\mathrm{tp}(a/A) of all formulas ϕ(x;b)\phi(x;b) in nn free variables with parameters in AA such that ϕ(a;b)\phi(a;b) holds in the model MM.

Definition. — Let AA be a subset of MM. For every integer n0n\geq 0, the set Sn(A)S_n(A) of types (with parameters in AA) is the set of all types tp(a/A)\mathrm{tp}(a/A), where NN is an extension of MM which is a model of TT and aNna\in N^n. One then says that this type is realized in NN.

Gödel's completeness theorem allows us to give an alternative description of Sn(A)S_n(A). Namely, let pp be a set of formulas in nn free variables and parameters in AA which contains the diagram of AA (that is, all formulas which involve only elements of AA and are true in MM). Assume that pp is consistent (there exists a model NN which is an extension of MM and and element aMna\in M^n such that ϕ(a)\phi(a) holds in NN for every ϕp\phi\in p) and maximal (for every formula ϕ∉p\phi\not\in p, then for every model NN and every aNna\in N^n such that ptp(a/A)p\subset \mathrm{tp}(a/A), then ϕ(a)\phi(a) does not hold). Then pSn(A)p\in S_n(A).

For every formula ϕL(A)\phi\in L(A) in nn free variables and parameters in AA, let VϕV_\phi be the set of types pSn(A)p\in S_n(A) such that ϕp\phi\in p. Then the subsets VϕV_\phi of Sn(A)S_n(A) consistute a basis of open sets for a natural topology on Sn(A)S_n(A).

Theorem. — The topological space Sn(A)S_n(A) is compact and totally discontinuous.

Let us detail the case of the theory ACF in the langage of rings. I claim that if KK is a field, then Sn(K)S_n(K) is homeomorphic to the spectrum Spec(K[T1,,Tn])\mathop{\rm Spec}(K[T_1,\dots,T_n]) endowed with its constructible topology. Concretely, for every algebraically closed extension MM of KK and every aMna\in M^n, the homeomorphism jj maps tp(a/K)\mathrm{tp}(a/K) to the prime ideal pa\mathfrak p_a consisting of all polynomials PK[T1,,Tn]P\in K[T_1,\dots,T_n] such that P(a)=0P(a)=0.

A type p=tp(a/K)p=\mathrm{tp}(a/K) is isolated if and only if the prime ideal pa\mathfrak p_a is maximal. Consequently, if n=1n=1, there is exactly one non-isolated type in S1(K)S_1(K), corresponding to the generic point of the spectrum Spec(K[T])\mathop{\rm Spec}(K[T]).

As for any compact topological space, a space of types can be studied via its Cantor-Bendixson analysis, which is a decreasing sequence of subspaces, indexed by ordinals, defined by transfinite induction. First of all, for every topological space XX, one denotes by D(X)D(X) the set of all non-isolated points of XX. One then defines X0=XX_0=X, Xα=D(Xβ)X_{\alpha}=D(X_\beta) if α=β+1\alpha=\beta+1 is a successor-ordinal, and Xα=β<αXβX_\alpha=\bigcap_{\beta<\alpha} X_\beta if α\alpha is a limit-ordinal. For xXx\in X, the Cantor-Bendixson rank of xx is defined by rCB(x)=αr_{CB}(x)=\alpha if xXαx\in X_\alpha and x∉Xβx\not\in X_\beta for β>α\beta>\alpha, and rCB(x)=r_{CB}(x)=\infty if xXαx\in X_\alpha for every ordinal α\alpha. The set of points of infinite rank is the largest perfect subset of XX.

Let us return to the example of the theory ACF. If a type pSn(K)p\in S_n(K) corresponds to a prime ideal p=j(p)\mathfrak p=j(p) of Spec(K[T1,,Tn])\mathop{\rm Spec}(K[T_1,\dots,T_n]), its Cantor-Bendixson rank is the Zariski dimension of V(I)V(I). More generally, if FF is a constructible subset of Spec(K[T1,,Tn])\mathop{\rm Spec}(K[T_1,\dots,T_n]), then rCB(F)r_{CB}(F) is the Zariski-dimension of the Zariski-closure of FF. Moreover, the points of maximal Cantor-Bendixson rank correspond to the generic points of the irreducible components of maximal dimension; in particular, there are only finitely many of them.

Definition. — One says that a theory TT is ω\omega-stable if for every finite or countable set of parameters AA, the space of 1-types S1(A)S_1(A) is finite or countable.

The theory ACF is ω\omega-stable. Indeed, if KK is the field generated by AA, then K[T]K[T] being
a countable noetherian ring, it has only countably many prime ideals.

Since any non-empty perfect set is uncountable, one has the following lemma.

Lemma. — Let TT be an ω\omega-stable theory and let MM be a model of TT. Then the Cantor-Bendixson rank of every type xSn(M)x\in S_n(M) is finite.

Let us assume that TT is ω\omega-stable and let FF be a closed subset of Sn(M)S_n(M). Then rCB(F)=sup{rCB(x);xF}r_{CB}(F)=\sup \{ r_{CB}(x)\,;\, x\in F\} is finite, and the set of points xFx\in F such that rCB(x)=rCB(F)r_{CB}(x)=r_{CB}(F) is finite and non-empty.

This example gives a strong indication that the model theory approach may be extremly fruitful for the study of algebraic theories whose geometry is not as well developed than algebraic geometry.

Link to Part 3 — Real closed fields and o-minimality