Arize AI
toolactive

Agent Evaluation

Arize AI

View original resource

Arize AI documentation covering trajectory evaluation, tool-call correctness, and outcome scoring for production agents. Walks through LLM-as-judge evals, custom metrics, and linking evaluation runs back to specific spans in OpenTelemetry traces.

Tags

agentic AIevaluation

At a glance

Published

2025

Jurisdiction

Global

Category

Evaluation and benchmarks

Access

Public access

Build your AI governance program

VerifyWise helps you implement AI governance frameworks, track compliance, and manage risk across your AI systems.

Agent Evaluation | VerifyWise AI Governance Library