Understanding and Improving Layer Normalization
Jingjing Xu, Xu Sun, Zhiyuan Zhang, Guangxiang Zhao, Junyang Lin
–Neural Information Processing Systems
Neural Information Processing Systems
Mar-23-2025, 01:29:21 GMT
Jingjing Xu, Xu Sun, Zhiyuan Zhang, Guangxiang Zhao, Junyang Lin
–Neural Information Processing Systems
Neural Information Processing Systems
Mar-23-2025, 01:29:21 GMT