Let
T1,…,Tn be indeterminates. For all
m≥0, denote by
Sm the
mth
elementary symmetric polynomial, given by
Sm=i1<⋯<im∑Ti1…Tim.
These are polynomials in
T1,…,Tn, and as their names suggest, these polynomials are symmetric, meaning that
Sm(Tσ(1),…,Tσ(n))=Sm(T1,…,Tn)
for any permutation
σ of
{1,…,n}.
By definition, one has
S0=1 (there is exactly one family of integers, the empty one, and the empty product is equal to~
1) and
Sm=0 for
m>n (there are no families of integers
1≤i1<⋯<im≤n).
A well-known theorem asserts that any symmetric polynomial in
T1,…,Tn
(with coefficients in a commutative ring
R) can be written uniquely as a polynomial in
S1,…,Sn (with coefficients in
R).
It is in particular the case for the Newton polynomials, defined for
p≥0 by
Np=T1p+⋯+Tnp.
In particular,
N0=n and
N1=S1. The next Newton polynomial can be computed
by “completing the square”:
N2=T12+⋯+Tn2=(T1+⋯+Tn)2−2S1=S12−2S1.
These are the first two (or three) of a family of relations,
called the
Newton identities, that relate the polynomials
Np and the polynomials
Sm:
m=0∑p−1(−1)mNp−m⋅Sm+(−1)pp⋅Sp=0.
Using that the coefficient of
Np is given by
S0=1, they allow to express inductively the polynomials
Np in terms of
S1,…,Sp.
If
p>n, all terms with
m>n vanish.
Using that the coefficient of
Sp is
N0=p, these formulas also show, conversely, that if
p! is invertible in the commutative ring
R, then
S1,…,Sp can be expressed in terms of
N1,…,Np.
There are various proofs of these identities. Most of the one I have seen are algebraic in nature, in so that they rely on the relations between the roots of a polynomial and its coefficients, and/or on expansion in power series.
The following proof, due to Doron Zeilberger (1984) (
Discrete Math. 49, p. 319, doi:
10.1016/0012-365X(84)90171-7), is purely combinatorial.
I have been made aware of it because it is the one that is formalized in the proof assistant system Lean. (See
there for the source code.)
We consider the set
S={1,…,n} and define a set
X whose elements are the pairs
(A,k) satisfying the following properties:
- A is a subset of S;
- k is an element of S;
- the cardinality of A is less or equal than p;
- if Card(A)=p, then k∈A.
Define a map
f:X→X by the following rule: for
(A,k) in
X, set
- f(A,k)=(A∖{k},k) if k∈A;
- f(A,k)=(A∪{k},k) if k∈/A.
Write
f(A,k)=(A′,k); observe that this is again an element of
X. The first two conditions are obvious. In the first case, when
k∈A, then the cardinality of
A′ is one less than the cardinality of
A, hence is at most
p−1, and the last condition is vacuous. In the second case, when
k∈/A, the cardinality of
A
is strictly smaller than
p, because
(A,k)∈X, so that the cardinality of
A′ is at most
p; moreover,
k∈A′ hence the last condition holds.
Observe that
f:X→X is an involution:
f∘f=idX.
Indeed, with the same notation, one has
f(A,k)=(A′,k). In the first case,
k∈A, hence
k∈/A′, so that
f(A′,k)=(A′∪{k},k)=(A,k);
in the second case, one has
m∈A′, hence
f(A′,k)=(A′∖{k},k)=(A,k)
since
A′=A∪{k} and
k∈/A.
Moreover,
f has no fixed point because it switches pairs
(A,k) such that
k∈A
and pairs such that
k∈/A.
Zeilberger's proof of the Newton identities build on a function
w:X→R[T1,…,Tn] such that
w(f(A,k))=−f(A,k) for all
(A,k)∈X.
Since
f has no fixed point, the expression vanishes
(A,k)∈X∑w(A,k)
since one can cancel a term
(A,k) with its image
f(A,k).
Zeilberger's definition of the function
w is
w(A,k)=(−1)Card(A)Tkp−Card(A)⋅a∈A∏Ta.
It satisfies the desired relation: for
(A,k)∈X such that
k∈/A, one has
f(A,k)=(A∪{k},k), so that
w(f(A,k))=(−1)Card(A)+1Tkk−Card(A)−1a∈A∪{k}∏Ta=−(−1)Card(A)Tkk−Card(A)a∈A∏Ta=w(A,k).
When
k∈A, one has
f(A,k)=(A′,k) with
k∈/A′, and the formula applied to
(A′,k) implies the one for
(A,k), because
f is an involution.
Now, in the relation
∑(A,k)w(A,k)=0, we first sum on
A and then on
k:
A⊂S∑k∣(A,k)∈X∑(−1)Card(A)Tkp−Card(A)a∈A∏Ta=0
and put together all those sets
A which have the same cardinality, say,
m. This gives
m=0∑n(−1)mCard(A)=m∑a∈A∏Ta⋅k∣ (A,k)∈X∑Tkp−m=0.
Let us compute separately the contribution to the sum of each
m.
When
m>p, no subset
A is authorized so that the corresponding sum is zero.
On the opposite, when
m<p, for any subset
A such that
Card(A)=m,
all pairs
(A,k) belong to
X, so that the inner sum is
Np−m,
and the whole term is
(−1)mSmNp−m.
Finally, when
m=p, the only
k that appear in the inner sum are the elements of
A, and the corresponding term is
∑k∈ATk0=Card(A)=p;
this term contributes to
(−1)ppSp.