MLLM-Bench, Evaluating Multi-modal LLMs using GPT-4V