Do Neural Optimal Transport Solvers Work? A Continuous Wasserstein-2 Benchmark