This post is inspired by a
paper of Azé and Hiriart-Urruty published in a French high school math journal; in fact, it is mostly a paraphrase of that paper with the hope that it be of some interest to young university students, or to students preparing Agrégation. The topic is Rolle's theorem.
1. The one-dimensional theorem, a generalization and two other proofs
Let us first quote the theorem, in a nonstandard form.
Theorem. —
Let I=]a;b[ be a nonempty but possibly unbounded interval of R and let f:I→R be a continuous function. Assume that f has limits at a and b, equal to some element ℓ∈R∪{+∞}. Then f is bounded from below.
-
If infI(f)<ℓ, then there exists a point c∈I such that f(c)=infI(f). If, moreover, f has a right derivative and a left derivative at c, then fl′(c)≤0 and fr′(c)≥0.
-
If infI(f)≥ℓ, then f is bounded on I and there exists a point c∈I such that f(c)=supI(f). If, moreover, f has a right derivative and a left derivative at c, then fl′(c)≥0 and fr′(c)≤0.
Three ingredients make this version slightly nonstandard:
- The interval I may be taken to be infinite;
- The function f may tend to +∞ at the endpoints of I;
- Only left and right derivatives are assumed.
Of course, if f has a derivative at each point, then the statement implies that f′(c)=fl′(c)=fr′(c)=0.
a) As stated in this way, the proof is however quite standard and proceeds in two steps.
- Using that f has a limit ℓ which is not −∞ at a and b, it follows that there exists a′ and b′ in I such that a<a′<b′<b such that f is bounded from below on ]a;a′] and on [b′;b[. Since f is continuous on the compact interval [a′;b′], it is then bounded from below on I.
If infI(f)<ℓ, then we can choose ℓ′∈R such that infI(f)<ℓ′<ℓ and a′, b′ such that f(x)>ℓ′ outside of [a′;b′]. Then, let c∈[a′;b′] such that f(c)=inf[a′;b′](f); then f(c)=infI(f).
If supI(f)>ℓ, then we have in particular ℓ=+∞, and we apply the preceding analysis to −f.
In the remaining case, infI(f)=supI(f)=ℓ and f is constant.
- For x>c, one has f(x)≥f(c), hence fr′(c)≥0; for x<c, one has f(x)≥f(c), hence fl′(c)≤0.
The interest of the given formulation can be understood by looking at the following two examples.
- If f(x)=∣x∣, on R, then f attains its lower bound at x=0 only, where one has fr′(0)=1 and fl′(0)=−1.
- Take f(x)=e−x2. Then there exists c∈R such that f′(c)=0. Of course, one has f′(x)=−2xe−x2, so that c=0. However, it is readily seen by induction that for any integer n, the nth derivative of f is of the form Pn(x)e−x2, where Pn has degree n. In particular, f(n) tends to 0 at infinity. And, by induction again, the theorem implies that Pn has n distinct roots in R, one between any two consecutive roots of Pn−1, one larger than the largest root of Pn, and one smaller than the smallest root of Pn.
b) In a 1959 paper, the Rumanian mathematician Pompeiu proposed an alternative proof of Rolle's theorem, when the interval I is bounded, and which works completely differently. Here is how it works, following the 1979
paper published in
American Math. Monthly by Hans Samelson.
First of all, one uses the particular case n=2 of the Levi chord lemma :
Lemma. —
Let f:[a;b]→R be a continuous function such that f(a)=f(b). For every integer n≥2, there exists a′,b′∈[a;b] such that f(a′)=f(b′) and b′−a′=(b−a)/n.
Let h=(b−a)/n. From the equality
0=f(b)−f(a)=(f(a+h)−f(a))+(f(a+2h)−f(a+h))+⋯+(f(a+nh)−f(a+(n−1)h),
one sees that the function x↦f(x+h)−f(x) from [a;b−h] to R does not have constant sign. By the intermediate value theorem, it vanishes at some point a′∈[a;b−h]. If b′=a′+h, then b′∈[a;b], b′−a′=(b−a)/n and f(a′)=f(b′).
Then, it follows by induction that there exists a sequence of nested intervals ([an;bn]) in [a;b] with f(an)=f(bn) and bn−an=(b−a)/2n for all n. The sequences (an) and (bn) converge to a same limit c∈[a;b]. Since f(bn)=f(c)+(bn−c)(f′(c)+o(1)), f(an)=f(c)+(an−c)(f′(c)+o(1)), one has
f′(c)=limbn−anf(bn)−f(an)=0.
What makes this proof genuinely distinct from the classical one is that the obtained point c may not be a local minimum or maximum of f, also I don't have an example to offer now.
c) In 1979, Abian furnished yet
another proof, which he termed as the “ultimate” one. Here it is:
It focuses on functions f:[a;b]→R on a bounded interval of R which are not monotone and, precisely, which are up-down, in the sense that f(a)≤f(c) and f(c)≥f(b), where c=(a+b)/2 is the midpoint of f. If f(a)=f(b), then either f or −f is up-down.
Then divide the interval [a;b] in four equal parts: [a;p], [p;c], [c;q] and [q;b]. If f(p)≥f(c), the f∣[a;c] is up-down. Otherwise, one has f(p)≤f(c). In this case, if f(c)≥f(q), we see that f∣[p;q] is up-down. And otherwise, we observe that f(q)≤f(c) and f(c)≥f(b), so that f∣[c;b] is up-down. Conclusion: we have isolated within the interval [a;b] a subinterval [a′;b′] of length (b−a)/2 such that f∣[a′;b′] is still up-down.
Iterating the procedure, we construct a sequence ([an;bn]) of nested intervals, with (bn−an)=(b−a)/2n such that the restriction of f to each of them is up-down. Set cn=(an+bn)/2.
The sequences (an),(bn),(cn) satisfy have a common limit c∈[a;b]. From the inequalities f(an)≤f(cn) and an≤cn, we obtain f′(c)≥0; from the inequalities f(cn)≥f(bn) and cn≤bn, we obtain f′(c)≤0. In conclusion, f′(c)=0.
2. Rolle's theorem in normed vector spaces
Theorem. —
Let E be a normed vector space, let U be an open subset of E and let f:U→R be a differentiable function. Assume that there exists ℓ∈R∪{+∞} such that f(x)→ℓ when x tends to the “boundary” of U — for every ℓ′<ℓ, there exists a compact subset K of U such that f(x)≥ℓ′ for all x∈U but x∈K. Then f is bounded below on U, there exists a∈U such that f(a)=infU(f) and Df(a)=0.
The proof is essentially the same as the one we gave in dimension 1. I skip it here.
If E is finite dimensional, then this theorem applies in a vast class of examples : for example, bounded open subsets U of E, and continuous functions f:U→R which are constant on the boundary ∂(U)=U−U of U and differentiable on U.
However, if E is infinite dimensional, the closure of a bounded open set is no more compact, and it does not suffice that f extends to a function on U with a constant value on the boundary.
Example. — Let E be an infinite dimensional Hilbert space, let U be the open unit ball and B be the closed unit ball. Let g(x)=21⟨Ax,x⟩+⟨b,x⟩+c be a quadratic function, where A∈L(E), b∈E and c∈R, and let f(x)=(1−∥x∥2)g(x). The function f is differentiable on E and one has
∇f(x)= (1−∥x∥2)(Ax+b)−2(21⟨Ax,x⟩+⟨b,x⟩+c)x.
Assume that there exists x∈U such that ∇f(x)=0. Then Ax+b=λx, with
λ=1−∥x∥22(21⟨Ax,x⟩+⟨b,x⟩+c).
Azé and Hiriart-Urruty take E=L2([0;1]), for A the operator of multiplication by the function t, b(t)=t(1−t), and c=4/27. Then, one has g(x)>0, hence λ>0, and x(t)=λ−t1b(t) for t∈[0;1]. This implies that λ≥1, for, otherwise, the function x(t) would not belong to E. This allows to compute λ in terms of μ, obtaining λ≤3/4, which contradicts the inequality λ≥1. (I refer to the paper of Azé and Hiriart-Urruty for more details.)
3. An approximate version of Rolle's theorem
Theorem. —
Let B the closed euclidean unit ball in Rn, let U be its interior, let f:B→R be a continuous function on B. Assume that ∣f∣≤ϵ on the boundary ∂(U) and that f is differentiable on U. Then there exists x∈U such that ∥Df(x)∥≤ϵ.
In fact, replacing f by f/ϵ, one sees that it suffices to treat the case ϵ=1.
Let g(x)=∥x∥2−f(x)2. This is a continuous function on B; it is differentiable on U, with ∇g(x)=2(x−f(x)∇f(x)). Let μ=infB(g). Since g(0)=−f(0)2≤0, one has μ≤0. We distinguish two cases:
- If μ=0, then ∣f(x)∣≤∥x∥ for all x∈B. This implies that ∥∇f(0)∥≤1.
- If μ<0, let x∈B be such that g(x)=μ; in particular, f(x)2≥∥x∥2−μ>0, which implies that f(x)=0. Since g≥0 on ∂(U), we have x∈B, hence ∇g(x)=0. Then x=f(x)∇f(x), hence ∇f(x)=x/f(x). Consequently,
∥∇f(x)∥≤f(x)∥x∥≤(∥x∥2−μ)1/2∥x∥<1.
This concludes the proof.
Thanks to the
Twitter users @AntoineTeutsch, @paulbroussous and @apauthie for having indicated me
some misprints and incorrections.