XLNet: Generalized Autoregressive Pretraining for Language Understanding
Zhilin Yang, Zihang Dai, Yiming Yang, Jaime Carbonell, Russ R. Salakhutdinov, Quoc V. Le
–Neural Information Processing Systems
Neural Information Processing Systems
Mar-27-2025, 01:07:19 GMT