Beyond ELBOs: A Large-Scale Evaluation of Variational Methods for Sampling