Goto

Collaborating Authors

 urlhttp



LimitstoDepth-EfficienciesofSelf-Attention

Neural Information Processing Systems

Self-attention architectures, which are rapidly pushing the frontier innatural language processing, demonstrate asurprising depth-inefficient behavior: previous works indicate that increasing the internal representation (network width) isjust as useful as increasing the number of self-attention layers (network depth).




1d8dc55c1f6cf124af840ce1d92d1896-Paper-Conference.pdf

Neural Information Processing Systems

As inthe classical problem, weights are fixed by an adversary and elements appear in random order. In contrast to previous variants of predictions, our algorithm only has access toamuch weakerpiece ofinformation: anadditive gapc.


UniBench: VisualReasoningRequiresRethinking Vision-LanguageBeyondScaling

Neural Information Processing Systems

Wefind that while scaling training data ormodel size can boost many vision-language model capabilities, scaling offers little benefit for reasoning or relations. Surprisingly, we also discover today's best VLMs struggle on simple digit recognition and counting tasks, e.g. MNIST, which much simpler networks can solve.




Low-rank Interaction with Sparse Additive Effects Model for Large Data Frames

Geneviève Robin, Hoi-To Wai, Julie Josse, Olga Klopp, Eric Moulines

Neural Information Processing Systems

In this paper, we introduce alowrank interaction and sparse additive effects(LORIS) model which combines matrix regression on a dictionary and low-rank design, to estimate main effects andinteractions simultaneously.


Prior-Free Dynamic Auctions with Low Regret Buyers

Yuan Deng, Jon Schneider, Balasubramanian Sivan

Neural Information Processing Systems

We study the problem of how to repeatedly sell to a buyer running a no-regret,mean-based algorithm. Previous work [Braverman et al., 2018] shows that it ispossible to design effective mechanisms in such a setting that extract almost allof the economic surplus, but these mechanisms require the buyer's values each