Abstract
We consider the recovery of low-rank matrices from noisy data by shrinkage of singular values, in which a single, univariate nonlinearity is applied to each of the empirical singular values. We adopt an asymptotic framework, in which the matrix size is much larger than the rank of the signal matrix to be recovered, and the signal-To-noise ratio of the low-rank piece stays constant. For a variety of loss functions, including Mean Square Error (MSE)-(square Frobenius norm), the nuclear norm loss and the operator norm loss, we show that in this framework there is a well-defined asymptotic loss that we evaluate precisely in each case. In fact, each of the loss functions we study admits a unique admissible shrinkage nonlinearity dominating all other nonlinearities. We provide a general method for evaluating these optimal nonlinearities, and demonstrate our framework by working out simple, explicit formulas for the optimal nonlinearities in the Frobenius, nuclear and operator norm cases. For example, for a square low-rank n-by-n matrix observed in white noise with level σ, the optimal nonlinearity for MSE loss simply shrinks each data singular value y to-y2 ? 4nσ2 (or to 0 if y < 2 √ nσ ). This optimal nonlinearity guarantees an asymptotic MSE of 2nrσ2, which compares favorably with optimally tuned hard thresholding and optimally tuned soft thresholding, providing guarantees of 3nrσ2 and 6nrσ 2, respectively. Our general method also allows one to evaluate optimal shrinkers numerically to arbitrary precision. As an example, we compute optimal shrinkers for the Schatten-p norm loss, for any p > 0.
Original language | English |
---|---|
Article number | 7820186 |
Pages (from-to) | 2137-2152 |
Number of pages | 16 |
Journal | IEEE Transactions on Information Theory |
Volume | 63 |
Issue number | 4 |
DOIs | |
State | Published - Apr 2017 |
Bibliographical note
Publisher Copyright:© 1963-2012 IEEE.
Keywords
- Matrix denoising
- Schatten norm loss
- low-rank matrix estimation
- nuclear norm loss
- optimal shrinkage
- singular value shrinkage
- spiked model
- unique admissible