Ex. 9.2

Ex. 9.2

Let A be a known k×k matrix, b be a matrix known k-vector, and z be an unknown k-vector. A Gauss-Seidel algorithm for solving the linear system of equations Az=b works by successively solving for element zj in the jth equation, fixing all other zj's at their current guesses. This process is repeated for j=1,2,...,k,1,2,...,k,..., until convergence (Matrix computations).

(a)

Consider an additive model with N observations and p terms, with the jth term to be fit by a linear smoother Sj. Consider the following system of equations:

(IS1S1S1S2IS2S2SpSpSpI)(f1f2fp)=(S1yS2ySpy).

Here each fj is an N-vector of evaluations of the jth function at the data points, and y is an N-vector of the response values. Show that backfitting is a blockwise Gauss-Seidel algorithm for solving system of equations.

(b)

Let S1 and S2 be symmetric smoothing operators (matrices) with eigenvalues in [0,1). Consider a backfitting algorithm with response vector y and smoothers S1, S2. Show that with any starting values, the algorithm converges and give a formula for the final iterates.

Soln. 9.2
(a)

For j-th equation, we have

Sjf1+...Sjfj1+fj+...+Sjfp=Sjy,

so that

fj=Sj(ykjfk)

which has the same as the second step in Algorithm 9.1. Therefore we can regard backfitting as a blockwise Gauss-Seidel algorithm.

(b)

Denote f1(k) and f2(k) as the value for the k-th iteration with initial values f1(0) and f2(0), we have

f1(k)=S1(yf2(k1))f2(k)=S2(yf1(k1)).

Therefore it's easy to derive

f2(k)=S2S1f2(k1)+S2yS2S1y=(S2S1)2f2(k2)+(S2S1+I)(S2yS2S1y)==(S2S1)kf2(0)+[(S2S1)k1++S2S1+I](S2yS2S1y).

Thus, as k, we have f2(k) converges to

(IS2S1)1(S2S2S1)y.

Similarly for f1(k), it converges to

(IS1S2)1(S1S1S2)y.