ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual Models