The CASX Framework
An integrative framework for AI existential risk assessment. (Leiss & Smith, 2026)
Capability
Demonstrable ability to solve novel problems across diverse domains without specific training.
Autonomy
The degree of goal-pursuit and multi-step planning without human-in-the-loop (HITL).
Scale
Breadth of deployment (instances) and absolute computational resources utilized.
Access
Connectivity to critical infrastructure, financial networks, or physical actuators.
OpenAI o3-High
Verified Ph.D. level reasoning. Outperforms human experts on the GPQA Diamond benchmark.
Claude 4.5 + SDK
Demonstrated "Computer Use" autonomy; solves 80%+ of complex software engineering bugs without oversight.
DeepSeek-V3 / R1
Proved that "Infinite Scale" is possible via high-efficiency hardware co-design and open-weight distribution.
The Existential Cliff
"We have no idea whether we can stay in control of digital beings more intelligent than ourselves."
Intl. AI Safety Report
A global synthesis of risks from "Agentic AI" and the urgency of technical safeguards for autonomy.
Non-Agentic Paths
Bengio's proposal to maximize Capability while deliberately minimizing Autonomy and Access to ensure safety.