ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation

Neural Information Processing Systems 

We also empirically demonstrate that the knowledge of large ViTPose models can be easily transferred to small ones via a simple knowledge token.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found