A review and proof of the mathematics of concentration of volume to near the surface in high-dimensional spheres. In particular, an intuitive probabilistic perspective based on collections of dice.
Introduction
The Dice Thrower’s Perspective
Let’s explore this concept from a dice thrower’s perspective, the probabilistic interpretation of rolling pools of dice and examining the highest result: roll n six-sided dice nd6 and consider the roll a success if any die shows the high possible value: a six. We intuitively know that rolling more dice, increasing n. Mathematically we write this as:
p(success;n)=1−(65)n.
Extension to n dice with k sides
More generally roll nk-sided dice and consider the roll a success if at least one die shows a k.
p(success;n)=1−(kk−1)n,
which can be envisioned as an n-dimensional cube of side length k. Highlight the shell of the cube: where xi=k, for some i∈{1,...,n}. With two parameters we can consider multiple limits:
n fixed and k→∞,
k fixed and n→∞,
the multi-scale limit of k:=k(m) and n:=n(m) while m→∞.
For fixed k as n→∞:
(kk−1)n→0.
For fixed n as k→∞:
(kk−1)n→1.
Next, we can learn a lot about our system by considering the multi-scale limit: k=k(m) and n=n(m) as m→∞. By continuity of the exponential, let’s first transform the problem:
The previously considered limits: constant k and constant n are special cases of this limit and provides an interesting nugget of intuition. For constant k(m):=k∗ and n(m):=m, limm→∞L(k∗,m;m)=∞, but we don’t learn how quickly this limit diverges. Of course, a quick look at the definition of L(k,n;m) shows that it diverges linearly. To show this rigorously, one considers not limm→∞L, but instead:
m→∞limmL(k∗,m;m)=logk∗k∗−1=C (a constant).
Similarly for the constant n case: n(m)=n∗ and k(m)=m, limm→∞L(m,n∗;m)=0, the philosophical counterpart to ∞; again, hiding the convergence rate in the sinkhole of information that is 0 (or ∞):
m→∞limm⋅L(n∗,m;m)=n∗m→∞limmlog(mm−1)=−n∗=C′.
This all guides us to selecting a multi-scale limit procedure for k and n simultaneously, and if we choose the correct relative scaling. Let k(m)=m and n(m)=m. Let’s temporarily detour returning directly to the dice-rolling setting with a numerical experiment. For
a single (n=1) six-sided die (k=6), the probability of success is 61.
n=2, k=6 dice, the probability of a success is ≈0.3
… →1
However, how many sides to the dice would give us approximately the same rate of success? We can answer this by solving
1−(kk−1)2≈61.
This may not have an integer solution, so we will take the integer valued k which gives at least61 rate of success. This gives us