Dataset Evaluation

To measure the performance of a model on a dataset, it can be done by comparing the model's predictions to some ground truth references. We currently offer BLEU, ROUGE, WER, METEOR, and Accuracy.

Evaluate Dataset As A Whole

Evaluate Dataset

  • Click on 'Evaluate dataset'.

  • Choose the desired dataset evaluators from the list.

Review Evaluator Description

  • Upon selection, a description of the evaluator will appear. Review this to ensure it matches your needs.

  • If everything looks right, click 'Create'.

Last updated