Goto

Collaborating Authors

 Education



A Simple and Adaptive Learning Rate for FTRL in Online Learning with Minimax Regret of Θ(T2 / 3) and its Application to Best-of-Both-Worlds

Neural Information Processing Systems

Follow-the-Regularized-Leader (FTRL) is a powerful framework for various online learning problems. By designing its regularizer and learning rate to be adaptive to past observations, FTRL is known to work adaptively to various properties of an underlying environment.





Understanding the Gains from Repeated Self-Distillation

Neural Information Processing Systems

Self-Distillation is a special type of knowledge distillation where the student model has the same architecture as the teacher model.