Goto

Collaborating Authors

 Asia






Private Everlasting Prediction

Neural Information Processing Systems

W e explore prediction as an alternative to learning. A predictor answers a stream of classification queries instead of outputting a hypothesis.


open

Neural Information Processing Systems

We create GTA (a benchmark forGeneral Tool Agents) to evaluate the general tool-use ability ofLLMs inreal-worldscenarios. Who created the dataset (e.g., which team, research group) and on behalf of which entity(e.g.,company,institution,organization)?



Model-Free Active Exploration in Reinforcement Learning

Neural Information Processing Systems

We study the problem of exploration in Reinforcement Learning and present a novel model-free solution. We adopt an information-theoretical viewpoint and start from the instance-specific lower bound of the number of samples that have to be collected to identify a nearly-optimal policy.