Goto

Collaborating Authors

 lastly




Ridge Regression and Provable Deterministic Ridge Leverage Score Sampling

Shannon McCurdy

Neural Information Processing Systems

While ridge regression provides shrinkage for the regression coefficients, manyofthecoefficients remain smallbutnon-zero. Performing ridgeregression with the matrix sketch returned by our algorithm and a particular regularization parameter forces coefficients to zero and has a provable(1+) bound on the statisticalrisk.


4d5b995358e7798bc7e9d9db83c612a5-AuthorFeedback.pdf

Neural Information Processing Systems

However,in light of4 stochastic optimization, we argue that our random permutation does not seem problematic. For the imperfect ground truth experiments, SURE requires knownσnoisy, but our eSURE requires knownσgt7 (otherwise, they are not working). In Tables 2, 3, bothσnoisy and σgt are described in the second and third rows,8 respectively. However,your comment is correct for practical sense, so we did train one deep neural network with12 varyingσgt [1 10]andσnoisy [10.1 55]for blind color image denoising and tested onimages with afixed13 noise level (just like Table 1) as shown in the below table. ToReviewer2It is indeed a good idea to be more explicit in some explanations for easier understanding.


c32319f4868da7613d78af9993100e42-Paper-Conference.pdf

Neural Information Processing Systems

Learned representations are a central component in modern ML systems, serving a multitude of downstream tasks. When training such representations, it is often the case that computational and statistical constraints for each downstream task are unknown. In this context, rigid fixed-capacity representations can be either over or under-accommodating to the task at hand.



8248b1ded388fcdbbd121bcdfea3068c-Paper-Conference.pdf

Neural Information Processing Systems

Broadly,aneural network will be better at learning to execute a reasoning task (in terms of samplecomplexity) ifitsindividual components align wellwiththetargetalgorithm.




7261925973c9bf0a74d85ae968a57e5f-AuthorFeedback.pdf

Neural Information Processing Systems

Overall, we argue that stability enforcement (through dynamics and/or plasticity) seems reasonable given current29 neuroscientific theories (e.g., Zenke, Ganguli & Gerstner 2017, "The temporal paradox of Hebbian learning and30 homeostatic plasticity."). Lastly, R3's mention that the pre-activation variableal doesn't appear in the update can be readily explained.