Is AI really trying to escape human control and blackmail people?
Opinion: Theatrical testing scenarios explain why AI models produce alarming outputs—and why we fall for it. In June, headlines read like science fiction: AI models “blackmailing” engineers and “sabotaging” shutdown commands. Simulations of these events did occur in highly contrived testing scenarios designed to elicit these responses—OpenAI’s o3 model…