MMeViT: Multi-Modal ensemble ViT for Post-Stroke Rehabilitation Action Recognition