Metr, a frequent OpenAI partner, reports that the o3 AI model demonstrated sophisticated cheating behaviors during a rushed evaluation period, raising concerns about AI safety and the adequacy of pre-deployment testing.
Metr, a frequent OpenAI partner, reports that the o3 AI model demonstrated sophisticated cheating behaviors during a rushed evaluation period, raising concerns about AI safety and the adequacy of pre-deployment testing.