Ownlayer
  • 👋Welcome
  • Getting Started
    • 💫Quickstart
  • Iterate
    • 💡Prompt
    • 📃Dataset
      • Dataset Creation
      • Data point Evaluation
      • Dataset Evaluation
  • Monitor
    • 🔬Evaluators
      • 🖌️Create Evaluators
      • 🔌Smart Trigger an online Evaluator
      • 🎛️Evaluator Playground
    • 🧐Analyzers
  • Integrations
    • Segment
    • Amplitude
    • Posthog
  • REST API
  • Python SDK
Powered by GitBook
On this page
  • Evaluate Dataset As A Whole
  • Evaluate Dataset
  • Review Evaluator Description
  1. Iterate
  2. Dataset

Dataset Evaluation

To measure the performance of a model on a dataset, it can be done by comparing the model's predictions to some ground truth references. We currently offer BLEU, ROUGE, WER, METEOR, and Accuracy.

PreviousData point EvaluationNextEvaluators

Last updated 7 months ago

Evaluate Dataset As A Whole

Evaluate Dataset

  • Click on 'Evaluate dataset'.

  • Choose the desired dataset evaluators from the list.

Review Evaluator Description

  • Upon selection, a description of the evaluator will appear. Review this to ensure it matches your needs.

  • If everything looks right, click 'Create'.

📃