ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation
–Neural Information Processing Systems
We also empirically demonstrate that the knowledge of large ViTPose models can be easily transferred to small ones via a simple knowledge token.
Neural Information Processing Systems
Aug-19-2025, 21:47:39 GMT