Demonstrating specification gaming in reasoning models

Open in new window