Korbak et al., UK AI Security Institute
researchactive

How to Evaluate Control Measures for LLM Agents? A Trajectory from Today to Superintelligence

Korbak et al., UK AI Security Institute

View original resource

Korbak et al. (UK AISI) propose a methodology for evaluating AI-control measures against increasingly capable LLM agents, using red-team protocols and capability elicitation. Introduces a trajectory from current models to hypothetical superintelligent agents.

Tags

agentic AIevaluation

At a glance

Published

2025

Jurisdiction

United Kingdom

Category

Evaluation and benchmarks

Access

Public access

Build your AI governance program

VerifyWise helps you implement AI governance frameworks, track compliance, and manage risk across your AI systems.

How to Evaluate Control Measures for LLM Agents? A Trajectory from Today to Superintelligence | VerifyWise AI Governance Library