Probing the Preferences of a Language Model: Integrating Verbal and Behavioral Tests of AI Welfare

Open in new window