🔬Evaluators

Evaluation is a critical to score the performance of AI applications and curat well-labeled datasets

Ownlayer offers 17 different kinds of evaluators covering basic search, JSON validation, and custom evaluators that leverage LLM as the judge. If you're not sure about what is the best, we recommend you start from Playground where you can test for the most suitable ones based on your data format.

Variables

AttributeDescription

Input

Model input

Prediction

Model output

Prediction_b

A different model output from the same input

Reference

Ground truth

Last updated