EvalAI: Towards Better Evaluation Systems for AI Agents