I. Wigner’s theorem
Definition
Let . The probability distribution on
defined by
is called the semi-circular distribution. We denote

Wigner’s Theorem
– Let i.i.d. centered, complex valued and with
.
– Let i.i.d. centered, real valued with
.
– The ‘s and the
‘s (
) are independent.
– Consider and
the
hermitian matrices defined by
Then almost surely,
where




Proof
- Show that semi circular law is characterized by its moments (using a corollary of Carleman theorem). Compute the moments of the smi-circular law (and find the Catalan numbers).
- Compute the moments of the spectral density of the Wigner matrix.
- Show that the moments convergences to the Catalan numbers.
- Convergence of moments + limit distribution characterized by moments -> weak convergence
Additional results
- If
, then
In particular, - If
, then
Remark.
We refer to the paper “The eigenvalues of random symmetric matrices” to get an extension of the Wigner’s Theorem.
II. Marcenko pastur
1. Preliminaries
a) Key results for the proof of the Marcenko-Pastur Theorem
Definition
Let us consider a probability measure on
. The Stieltjes transform of
denoted
is defined by
Theorem (Properties of the Stieltjes Transform)
Let be some measure on
with Stieltjes Transform
.
is analytic on
.
- If Supp(
)
, then
.
.
.
- For any
is bounded and continuous,
- For all
points of continuity of
,
Remark. The previous Theorem states that the Stieltjes Transform is analytic on and when
goes to the real axis, the holomorph property is lost but this allows to recover the measure
(see properties 6, 7 and 8).
Theorem (Weak convergence and pointwise convergence of Stieltjes Transform)
Let us consider probability measures on
.
- If
, then
- a) Let us consider
with an accumulation point. If
for all
, then
– there exists a measuresatisfying
such that
–
b) If it also holds that(i.e.
from point 4) of the previous Theorem), then
is a probability measure and
.
The previous Theorem can be seen as the counterpart of the famous Levy Theorem. The Levy Theorem states the link between the weak convergence and the pointwise convergence of the characteristic function.
Definition
The characteristic function of a real-valued random variable is defined by
Theorem (Levy)
Let us consider real-valued random variables and
.
- If
, then
- If there exists some function
such that
is continuous at
, then
– there existsa real valued random variable such that
.
–.
Just for the sake of beauty, let me mention the following result that can allow to identify functions that can be written as the Stieltjes Transform of some measure on
.
Theorem (Recognize a Stieltjes Transform)
If satisfies
is analytic
- There exists
such that
Then there exists a unique measure on
satisfying
such that
If additionally it holds
4. ,
Then .
Example: If is the Stieltjes Transform of some measure
then
is the Stieltjes Transform of some probability measure
on
.
b) Proof of the key preliminary result
We will need the following additional properties in the proof.
Theorem (Helly’s selection theorem)
From every sequence of proba measures , one can extract a subsequence that converges vaguely to a measure
. (Note that
is not necessary a probability measure).
Remark. vaguely implies
for every
where
Proof.
1) To prove that for any ,
, we show that for any
,
and
. Let us consider some
. Since
and since





Using an analogous approach, one can show that
2a) We consider some set with an accumulation point such that
.
By the Helly’s selection Theorem, there exists a subsequence of the sequence of probabilities
that converges vaguely to some measure
on
. Let us consider some
. Using the previous remark and since
belongs to
, we get that
Since



(1)
We know from the properties of the Stieltjes Transform that



We also get that




Using again the analytic continuation, we obtain that
leading to


2b) This directly follows from the equivalece between points and
of the following Lemma.
Lemma
Let be probability measures and
be a measure on
. The following statements are equivalent.
.
and
.
and
is tight, namely for any
, there exists some compact set
such that
2. Marcenko-Pastur Theorem
Theorem (Marcenko-Pastur)
Let us consider a matrix
with i.i.d. entries such that
with and
of the same order and
the spectral measure of
:
Then, almost surely (i.e. for almost every realization),
where is the Marcenko-Pastur distribution
with
In what follows, we denote the Stieltjes Transform of the measure
.
Remark
- The behavior of the spectral measure brings information about the vast majority of the eigenvalues but is not affected by some individual eigenvalues’ behavior. For example, one may have
without affecting the behaviour of the whole sum.
- The Dirac measure at zero is an artifact due to the dimensions of the matrix if
.
- If
, that is
, then typical from the usual regime “small dimensional data vs large samples”. The support of Marcenko-Pastur distribution
concentrates around
and
(2)
This will allow to show that there exists some set

Indeed, suppose that we know that (2) holds et let us consider some sequence








(3)
holds for any





(note that we used the continuity of the functions




We will then conclude the proof of the Theorem using the Theorem of the previous section (called Weak convergence and pointwise convergence of Stieltjes Transform).
To prove (2), we use the decomposition
- To deal with the term
, we use the Efron-Stein inequality to show that
which allows to prove that almost surelytends to
using Borel-Cantelli Lemma.
- We then show that
satisfies
(4)
and thatsatisfies the equation
(5)
- To conclude the proof (i.e. to get (2)), we use some “stability” result. More precisely, equations (4) and (5) are close and we need to show that solutions of these equations are as a consequence, close to each other. We formalize this result with the following Lemma.
Lemma
Let . We assume that there exists two Stieltjes Transform of probability measures on
, denoted
, that are respectively the solutions of the equation
where

Let
and
Theorem (convergence of extremal eigenvalues)
- If
, then
- If
, then
Remark.
Exactly like with Wigner matrices, when the 4th moment of the random varaibles are not finite,
goes to
. However, contrary to the Wigner case, for Wishart matrices
still converges to a finite value.
Theorem (Fluctuations of and Tracy-Widom distribution)
We can fully describe the fluctuations of :
where