Goto

Collaborating Authors

 xti


A Smoothed Analysis of the Greedy Algorithm for the Linear Contextual Bandit Problem

Sampath Kannan, Jamie H. Morgenstern, Aaron Roth, Bo Waggoner, Zhiwei Steven Wu

Neural Information Processing Systems

Wegiveasmoothed analysis, showing that evenwhen contexts may be chosen by an adversary, small perturbations of the adversary's choices suffice for the algorithm to achieve "no regret", perhaps (depending on the specifics of the setting) with a constant amount of initial training data.



dececdcbf0ea0162234a8fb4ab051415-Supplemental-Conference.pdf

Neural Information Processing Systems

Thus,γ(ω) (0,1] for ω (0,1], which meets the algorithm design requirement. Algorithm 2 actually performs the gradient descent scheme on the function ˆfti(x) = Eu B[fti(x+ϵu)] restricted to the convex set(1 ζ)K.


OnlineMultitaskLearningwithLong-TermMemory

Neural Information Processing Systems

Associatedwitheach segment is a hypothesis from some hypothesis class. We give algorithms that are designed to exploit the scenario where there are many such segments but significantly fewer associated hypotheses.


All-or-nothingstatisticalandcomputationalphase transitionsinsparsespikedmatrixestimation

Neural Information Processing Systems

Similarly the ISOMAP face database consists ofimages (256levels ofgray)ofsize64 64,i.e.,vectors in R4096, whereas the correct intrinsic dimension is only3 (for the vertical, horizontal pause and lightingdirection). The second approach, is anaverage caseapproach (in the spirit of thestatistical mechanics treatment ofhighdimensional systems), thatmodelsfeaturevectorsby arandom ensemble,taken as aset ofrandom vectors with independently identically distributed (i.i.d.) components, and a small but xed fraction of non-zero components.



OntheConvergenceofStepDecayStep-Sizefor StochasticOptimization

Neural Information Processing Systems

Step decay step-size schedules (constant and then cut) are widely used in practice because of their excellent convergence and generalization qualities, but their theoretical properties are not yet well understood. Weprovide convergence results for step decay in the non-convexregime, ensuring that the gradient norm vanishes at an O(lnT/ T)rate.


Bayesian-guidedLabelMappingforVisual Reprogramming

Neural Information Processing Systems

However, in this paper, we reveal that one-to-one mappings may overlook the complex relationship between pretrained and downstream labels.



HybridRegretBoundsforCombinatorial Semi-BanditsandAdversarialLinearBandits

Neural Information Processing Systems

Theformer means that the algorithm will work nearly optimally in all environments in an adversarial setting, a stochastic setting, or a stochastic setting with adversarial corruptions.