Perceive, Ground, Reason, and Act: A Benchmark for General-purpose Visual Representation