Dataset Evaluation
To measure the performance of a model on a dataset, it can be done by comparing the model's predictions to some ground truth references. We currently offer BLEU, ROUGE, WER, METEOR, and Accuracy.
Last updated
To measure the performance of a model on a dataset, it can be done by comparing the model's predictions to some ground truth references. We currently offer BLEU, ROUGE, WER, METEOR, and Accuracy.
Click on 'Evaluate dataset'.
Choose the desired dataset evaluators from the list.
Upon selection, a description of the evaluator will appear. Review this to ensure it matches your needs.
If everything looks right, click 'Create'.

Last updated