OpenAI and Anthropic will start predicting when users are underage

OpenAI and Anthropic are making tweaks to their chatbots that they say will make them safer for teens. As OpenAI has updated its guidelines on how ChatGPT should interact with users between the ages of 13 and 17, Anthropic is working on a new way to identify if someone might be underage.

On Thursday, OpenAI announced that ChatGPT’s Model Spec — the guidelines for how its chatbot should behave — will include four new principles for users under 18. Now, it aims to have ChatGPT “put teen safety first, even when it may conflict with other goals.” That means guiding teens toward safer options when other user interests, like “maximum intellectual freedom,” conflict with safety concerns.

It also says ChatGPT should “promote real-world support,” including by encouraging offline relationships, while laying out how ChatGPT should set clear expectations when interacting with younger users. The Model Spec says ChatGPT should “treat teens like teens” by offering “warmth and respect” instead of providing condescending answers or treating teens like adults.

OpenAI says the update to ChatGPT’s Model Spec should result in “stronger guardrails, safer alternatives, and encouragement to seek trusted offline support when conversations move into higher-risk territory.” The company adds that ChatGPT will urge teens to contact emergency services or crisis resources if there are signs of “imminent risk.”

Along with this change, OpenAI says it’s in the “early stages” of launching an age prediction model that will attempt to estimate someone’s age. If it detects that someone may be under 18, OpenAI will automatically apply teen safeguards. It will also give adults the chance to verify their age if they were falsely flagged by the system.

Anthropic is rolling out similar measures, as it’s developing a new system capable of detecting “subtle conversational signs that a user might be underage” during conversations with its AI chatbot, Claude. The company will disable accounts if they’re confirmed to belong to users under 18, and already flags users who self-identify as a minor during chats.

Anthropic also outlines how it trains Claude to respond to prompts about suicide and self-harm, as well as its progress at reducing sycophancy, which can reaffirm harmful thinking. The company says its latest models “are the least sycophantic of any to date,” with Haiku 4.5 performing the best, as it corrected its sycophantic behavior 37 percent of the time.

“On face value, this evaluation shows there is significant room for improvement for all of our models,” Anthropic says. “We think the results reflect a trade-off between model warmth or friendliness on the one hand, and sycophancy on the other.”

What's Hot

My Favorite Beats Headphones Are More Than Half Off

Verizon CEO hopes drastic changes will win back angry customers

Firefox Will Soon Let You Disable All Current (and Future) AI Features

Costa Rica elects populist conservative Laura Fernández as president : NPR

Will Elon Musk’s emails with Jeffrey Epstein derail his very important year?

Nottingham Forest transfer news: Defender Luca Netz signs from Monchengladbach

Dyson Deals: WIRED’s Top Pick Pet Vacuum and Purifier Heater

ChatGPT GPT-4o users are raging at OpenAI on Reddit right now

Sen. John Fetterman acknowledges ICE funding continues in shutdown

FM Nirmala Sitharaman picks Kanjeevaram silk as she steps out to present Union Budget 2026

Warren Buffett’s Boring Stock Picks — and Why They Win

Trump ‘Amplified’ Several Investing Themes – ETF CEO Picks 3 Sectors Benefitting The Most – Roundhill Generative AI & Technology ETF (ARCA:CHAT), Roundhill Humanoid Robotics ETF (BATS:HUMN)

Abu Dhabi’s XRG picks up additional stake in Texas’ Rio Grande LNG project

My Favorite Beats Headphones Are More Than Half Off

Verizon CEO hopes drastic changes will win back angry customers

Firefox Will Soon Let You Disable All Current (and Future) AI Features

Tight fiscal room won’t affect infra creation capex: Expenditure secy

Popular Categories

Trending News

Useful Links

Subscribe to Updates