Evaluate and benchmark your LLM applications for quality, safety, and performance.
4 articles
Introduction to the LLM evaluation platform and key concepts.
Create and run evaluation experiments to test your models.
Upload, browse, and manage evaluation datasets.
Set up evaluation metrics and scoring thresholds.