Ex. 14.18
Ex. 14.18
Consider the regularized log-likelihood for the density estimation problem arising in ICA,
The solution \(\hat g\) is a quartic smoothing spline, and can be written as \(\hat g(s) = \hat q(s) + \hat{q}_\bot (s)\), where q is a quadratic function (in the null space of the penalty). Let \(q(s) = \theta_0 + \theta_1s + \theta_2s^2\). By examining the stationarity conditions for \(\theta^k\), \(k = 1, 2, 3\), show that the solution \(\hat f = \phi e^{\hat g}\) is a density, and has mean zero and variance one. If we used a second-derivative penalty \(\int \{g^{''}(t)\}^2(t)dt\) instead, what simple modification could we make to the problem to maintain the three moment conditions?
Soln. 14.18
Considering the stationarity condition, if we differentiate the log-likelihood w.r.t to \(\theta_0\), we should get zero at the optimal value \(\hat\theta_0\). By the assumption of \(\hat g(s)\), the differential is
therefore we know \(\hat f= \phi e^{\hat g}\) is a density.
Similarly for \(\theta_1\), the differential is
since \(S\) is assumed to have zero mean.
At last, for \(\theta_2\), the differential is
Since both \(S\) and \(\hat f\) have zero mean, we know that the variance of \(\hat f\) equals the variance of \(S\), which is assumed to be 1.
If we use a second-derivative penalty instead, then \(\hat g\) becomes a cubic smoothing spline, and the first two conditions (\(\hat f\) is a density and has zero mean) are still preserved via the same arguments above. We can modify the problem by adding a Lagrangian multiplier to accommodate the constraint for the variance.