Ex. 15.4

Ex. 15.4

Suppose xi,i=1,...,N are iid (μ,σ2). Let x¯1 and x¯2 be two bootstrap realizations of the sample mean. Show that the sampling correlation corr(x¯1,x¯2)=n2n150%. Along the way, derive var(x¯1) and the variance of the bagged mean x¯bag. Here x¯ is a linear statistic; bagging produces no reduction in variance for linear statistics.

Soln. 15.4

Denote

x¯1=1ni=1nx^i,  x¯2=1ni=1nx~i,

where {x^i,i=1,...,n} and {x~i,i=1,...,n} are realizations from the first and the second bootstrap respectively.

Note that both x^i and x~i are sampled from the empirical distribution of {xi,i=1,...,n}.

Therefore, for any 1in, we have

E[x^i]=E[x~i]=μ,var(x^i)=var(x~i)=σ2.

Also, for any 1i,jn, we have

cov(x^i,x~j)=σ2n.

Then, we know

cov(x¯1,x¯2)=1n2(i,jncov(x^i,x~j))=σ2n,

and

var(x¯1)=var(1ni=1nx~i)=1n2(i=1nvar(xi)+jkncov(x^j,x^k))=1n2(nσ2+(n2n)σ2n)=(2n1)σ2n2.

Therefore we know that

corr(x¯1,x¯2)=cov(x¯1,x¯2)var(x¯1)var(x¯2)=n2n1.

We have already derived var(x¯1) above. For x¯bag, assume we have B realizations, then

var(x¯bag)=var(1Bi=1Bx¯i)=1B2i=1Bvar(x¯i)+1B2jkBcov(x¯j,x¯k)=1B(2n1)σ2n2+B1Bσ2n=(2n1)+(B1)nBn2σ2.