MLCommons
datasetactive

MLCommons AILuminate AI Safety Benchmark

View original resource

MLCommons’ AILuminate benchmark assesses the safety of general-purpose chat models across a broad set of hazard categories, providing standardized, third-party safety grades to complement capability-focused benchmarks.

Tags

benchmarksafetyevaluationMLCommons

At a glance

Published

2025

Jurisdiction

Global

Category

Datasets and benchmarks

Access

Public access

Build your AI governance program

VerifyWise helps you implement AI governance frameworks, track compliance, and manage risk across your AI systems.

MLCommons AILuminate AI Safety Benchmark | VerifyWise AI Governance Library