Unifying (Machine) Vision via Counterfactual World Modeling