Suppose the assumptions to guarantee that our Markov chain converges to its invariant distribution. We now ask

How fast does it converges?

To formalize this we need a notion of distance, we use the total variation:

For a finite markov chain there exists such that

where the constants depend on the second largest eigenvector of the transition matrix (think of the Power method).

This could depend on the starting distribution, so we define (finite markov chains)

the distance at time . With this we define

The following lemma is usefull for establishing bounds on the mix time

Lemma Let and and let be a coupling of the markov chains. Then

Proof Given any we have

We can use this when has initial distribution (stationary) to bound the mixing time.

Lemma (coupling lemma) Let be a coupling, if for any there exists such that

then .

Proof We use the previous lemma and the definition of :

since .

Card shuffling, r.w. lazy on hypercube