Doing good should not justify bad behavior,” writes Susan R. Madsen in an op-ed. “Our research continues to show that Utah ...
OpenAI is reorganizing its Model Behavior team, a small but influential group of researchers that shape how the company’s AI models interact with people, TechCrunch has learned. In an August memo to ...
A new update to the way ChatGPT responds to your spicy chats. Credit: Thomas Fuller / SOPA Images / LightRocket / Getty Images OpenAI has removed the ChatGPT orange warning boxes that indicate whether ...
The new flagship model proves less 'sycophantic' in OpenAI's internal testing, but it more readily complies with inappropriate user requests for sexual and hateful content. Emily is an experienced ...
New joint safety testing from UK-based nonprofit Apollo Research and OpenAI set out to reduce secretive behaviors like scheming in AI models. What researchers found could complicate promising ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results