🔬Evaluators
Evaluation is a critical to score the performance of AI applications and curat well-labeled datasets
Last updated
Evaluation is a critical to score the performance of AI applications and curat well-labeled datasets
Last updated
Ownlayer offers 17 different kinds of evaluators covering basic search, JSON validation, and custom evaluators that leverage LLM as the judge. If you're not sure about what is the best, we recommend you start from Playground where you can test for the most suitable ones based on your data format.
Attribute | Description |
---|---|
Input
Model input
Prediction
Model output
Prediction_b
A different model output from the same input
Reference
Ground truth