The Neural Testbed: Evaluating Joint Predictions