Beyond Perception: Evaluating Abstract Visual Reasoning through Multi-Stage Task