HighPolicy

Anthropic Releases Version 3.0 of Responsible Scaling Policy

AI Impact Summary

Anthropic is releasing version 3.0 of its Responsible Scaling Policy, a voluntary framework designed to mitigate catastrophic risks from AI systems. This policy update reflects learnings from the past two years, including the increasing capabilities of models and the slow pace of government action on AI safety. The updated policy reinforces existing safeguards, improves upon previous versions, and introduces new measures for transparency and accountability, particularly around capability thresholds.

Business Impact

Anthropic's updated Responsible Scaling Policy aims to proactively manage AI risks and influence industry standards, but faces challenges due to slow government action and ambiguous capability thresholds.

Models affected

new
Claude 3.5 Sonnet
model
active

Date: 24 Feb 2026
Change type: policy
Severity: high

Anthropic Releases Version 3.0 of Responsible Scaling Policy

More from Anthropic

Get alerts for Anthropic