Stanford HAI's annual index chapter on technical performance, tracking benchmark progress for reasoning, coding, and tool-using agents. Covers capability jumps on SWE-bench, GAIA, and WebArena plus compute and cost trends across frontier model families.
Tags
agentic AIfoundations
At a glance
Published
2026
Jurisdiction
International
Category
Foundations
Access
Public access
Build your AI governance program
VerifyWise helps you implement AI governance frameworks, track compliance, and manage risk across your AI systems.