Robust Pose Estimation in Crowded Scenes with Direct Pose-Level Inference Supplementary Materials