Ex. 5.12

Ex. 5.12

Characterize the solution to the following problem,

(1)minfRSS(f,λ)=i=1Nωi{yif(xi)}2+λ{f(t)2}dt,

where the ωi0 are observation weights.

Characterize the solution to the smoothing spline problem (5.9) when the training data have ties in X.

Soln. 5.12

Following the same arguments in Ex. 5.7, the solution to (1) is a cubic spline with knots at the unique values of {xi,i=1,...,N} and fitted spline can be represented as

f^(x)=j=1NNj(xj)θ^j

where

θ^=(NTWN+λΩN)1NTWy

and W=diag(ω1,...,ωn).

Suppose we group all the training data into n groups. Each group i contains ni1 training data which have the same xi and let y¯i be their yi's average. So the first summand in (5.9) can be rewritten as

i=1njni(yjf(xi))2=i=1njni(yjy¯i+y¯if(xi))2=i=1njni[(yjy¯i)2+(y¯if(xi))2+2(yjy¯i)(y¯if(xi))]

Note that

i=1njni(y¯if(xi))2=i=1nni(y¯if(xi))2,

and

i=1njni(yjy¯i)(y¯if(xi))=0

and i=1njni(yjy¯i)2 is independent of f. So minimizing (5.9) in text is equivalent to minimizing

i=1nni(y¯if(xi))2+λf(t)2dt,

which is treated by the first half of this problem.