Article

Researchers Propose a Better Way to Report Dangerous AI Flaws

via Wired

March 26, 2025

Topics

People

Shayne Longpre

Graduate Student

Groups

Media Lab Research Theme: Life with AI

Share this article

In a proposal released today, more than 30 prominent AI researchers, including some who found the GPT-3.5 flaw, say that many other vulnerabilities affecting popular models are reported in problematic ways. They suggest a new scheme supported by AI companies that gives outsiders permission to probe their models and a way to disclose flaws publicly.

“Right now it's a little bit of the Wild West,” says Shayne Longpre, a PhD candidate at MIT and the lead author of the proposal. Longpre says that some so-called jailbreakers share their methods of breaking AI safeguards the social media platform X, leaving models and users at risk. Other jailbreaks are shared with only one company even though they might affect many. And some flaws, he says, are kept secret because of fear of getting banned or facing prosecution for breaking terms of use. “It is clear that there are chilling effects and uncertainty,” he says.

Read on Wired

Researchers Propose a Better Way to Report Dangerous AI Flaws

Topics

People

Groups

Experts call for legal ‘safe harbor’ so researchers, journalists and artists can evaluate AI tools

AI crawler wars threaten to make the web more closed for everyone

International AI Safety Report

AI trained on AI garbage spits out AI garbage

Researchers Propose a Better Way to Report Dangerous AI Flaws

Topics

People

Groups

Share this article

Experts call for legal ‘safe harbor’ so researchers, journalists and artists can evaluate AI tools

AI crawler wars threaten to make the web more closed for everyone

International AI Safety Report

AI trained on AI garbage spits out AI garbage