Let
Sn=∑ni=1Xi where
{Xi}∞i=1 is a sequence of independent and identically distributed (i.i.d.) of random variables with
E[X1]=m. According to the classical law of large number (LLN), the sum
Sn/n converges strongly to
m. Moreover, the well-known central limit theorem (CLT) tells us that, with
m=0 and
s2=E[X21], for each bounded and continuous function
j we have
limnE[j(Sn/√n))]=E[j(X)] with
X∼N(0,s2).
These two fundamentally important results are widely used in probability, statistics, data analysis as well as in many practical situation such as financial pricing and risk controls. They provide a strong argument to explain why in practice normal distributions are so widely used. But a serious problem is that the i.i.d. condition is very difficult to be satisfied in practice for the most real-time processes for which the classical trials and samplings becomes impossible and the uncertainty of probabilities and/or distributions cannot be neglected.
In this talk we present a systematical generalization of the above LLN and CLT. Instead of fixing a probability measure P, we only assume that there exists a uncertain subset of probability measures
{Pq:q∈Q}. In this case a robust way to calculate the expectation of a financial loss
X is its upper expectation:
[\^(E)][X]=supq∈QEq[X] where
Eq is the expectation under the probability
Pq. The corresponding distribution uncertainty of
X is given by
Fq(x)=Pq(X≤x),
q∈Q. Our main assumptions are:
- The distributions of Xi are within an abstract subset of distributions {Fq(x):q∈Q}, called the distribution uncertainty of Xi, with [′(m)]=[\^(E)][Xi]=supq∫∞−∞xFq(dx) and m=−[\^(E)][−Xi]=infq∫∞−∞xFq(dx).
- Any realization of X1,…,Xn does not change the distributional uncertainty of Xn+1 (a new type of `independence' ).
Our new LLN is: for each linear growth continuous function
j we have
limn→∞\^E[j(Sn/n)]=supm≤v≤[′(m)]j(v)
Namely, the distribution uncertainty of
Sn/n is, approximately,
{dv:m≤v≤[′(m)]}.
In particular, if
m=[′(m)]=0, then
Sn/n converges strongly to 0. In this case, if we assume furthermore that
[′(s)]2=[\^(E)][X2i] and
s2=−[\^(E)][−X2i],
i=1,2,…. Then we have the following generalization of the CLT:
limn→∞[j(Sn/√n)]=\^E[j(X)],L(X)∈N(0,[s2,¯s2]).
Here
N(0,[s2,[′(s)]2]) stands for a distribution uncertainty subset and
[\^(E)][j(X)] its the corresponding upper expectation. The number
[\^(E)][j(X)] can be calculated by defining
u(t,x):=[(E)][j(x+√tX)] which solves the following PDE
∂tu=G(uxx), with
G(a):=[1/2]([′(s)]2a+−s2a−).
An interesting situation is when
j is a convex function,
[\^(E)][j(X)]=E[j(X0)] with
X0∼N(0,[′(s)]2). But if
j is a concave function, then the above
[′(s)]2 has to be replaced by
s2. This coincidence can be used to explain a well-known puzzle: many practitioners, particularly in finance, use normal distributions with `dirty' data, and often with successes. In fact, this is also a high risky operation if the reasoning is not fully understood. If
s=[′(s)]=s, then
N(0,[s2,[′(s)]2])=N(0,s2) which is a classical normal distribution. The method of the proof is very different from the classical one and a very deep regularity estimate of fully nonlinear PDE plays a crucial role.
A type of combination of LLN and CLT which converges in law to a more general
N([m,[′(m)]],[s2,[′(s)]2])-distributions have been obtained. We also present our systematical research on the continuous-time counterpart of the above `G-normal distribution', called G-Brownian motion and the corresponding stochastic calculus of Itô's type as well as its applications.