Suppose the assumptions to guarantee that our Markov chain converges to its invariant distribution. We now ask
How fast does it converges?
To formalize this we need a notion of distance, we use the total variation:
For a finite markov chain there exists such that
where the constants depend on the second largest eigenvector of the transition matrix (think of the Power method).
This could depend on the starting distribution, so we define (finite markov chains)
the distance at time . With this we define
The following lemma is usefull for establishing bounds on the mix time
Lemma Let and and let be a coupling of the markov chains. Then
Proof Given any we have
We can use this when has initial distribution (stationary) to bound the mixing time.
Lemma (coupling lemma) Let be a coupling, if for any there exists such that
then .
Proof We use the previous lemma and the definition of :
since .
Card shuffling, r.w. lazy on hypercube