Stanford HAI's annual index chapter on technical performance, tracking benchmark progress for reasoning, coding, and tool-using agents. Covers capability jumps on SWE-bench, GAIA, and WebArena plus compute and cost trends across frontier model families.
Published
2026
Jurisdiction
International
Category
Foundations
Access
Public access
VerifyWise helps you implement AI governance frameworks, track compliance, and manage risk across your AI systems.