Anthropic Releases Version 3.0 of Responsible Scaling Policy
AI Impact Summary
Anthropic is releasing version 3.0 of its Responsible Scaling Policy, a voluntary framework designed to mitigate catastrophic risks from AI systems. This policy update reflects learnings from the past two years, including the increasing capabilities of models and the slow pace of government action on AI safety. The updated policy reinforces existing safeguards, improves upon previous versions, and introduces new measures for transparency and accountability, particularly around capability thresholds.
Business Impact
Anthropic's updated Responsible Scaling Policy aims to proactively manage AI risks and influence industry standards, but faces challenges due to slow government action and ambiguous capability thresholds.
Models affected
- newmodel
Claude 3.5 Sonnet
- active
- Date
- 24 Feb 2026
- Change type
- policy
- Severity
- high