Dataset Evaluation
To measure the performance of a model on a dataset, it can be done by comparing the model's predictions to some ground truth references. We currently offer BLEU, ROUGE, WER, METEOR, and Accuracy.
Evaluate Dataset As A Whole
Evaluate Dataset
Click on 'Evaluate dataset'.
Choose the desired dataset evaluators from the list.
Review Evaluator Description
Upon selection, a description of the evaluator will appear. Review this to ensure it matches your needs.
If everything looks right, click 'Create'.

Last updated