Goto

Collaborating Authors

 Education






SAGDA: AchievingO(2)Communication ComplexityinFederatedMin-MaxLearning

Neural Information Processing Systems

Compared with conventional minimization problems (e.g., empirical risk minimization), min-max optimization has aricher mathematical structure, thus being able tomodel more sophisticated learning problems thatemergefrom ever-emerging applications.



LearningPhysicalConstraintswith NeuralProjections

Neural Information Processing Systems

How does a human being distinguish the motions of a piece of paper and a piece of cloth? A high-school physics teacher might answer that they are both tangentially inextensible but cloth cannot resist any bending force from the normal direction.


12d286282e1be5431ea05262a21f415c-Paper-Conference.pdf

Neural Information Processing Systems

Knowledge distillation (KD) has been widely used to improve the test accuracy of a "student" network, by training it to mimic the soft probabilities of a trained