EvalEvalBot commited on
Commit
ed7e52a
·
verified ·
1 Parent(s): 89c8620

Add EvalEval community eval results (mmlu_pro.yaml)

Browse files

Automated submission of evaluation results from Every Eval Ever (EEE). Source details and full evaluation records are linked via source.url.

Learn more: https://evalevalai.com/projects/every-eval-ever

Files changed (1) hide show
  1. .eval_results/mmlu_pro.yaml +8 -0
.eval_results/mmlu_pro.yaml ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ - dataset:
2
+ id: "TIGER-Lab/MMLU-Pro"
3
+ task_id: "mmlu_pro"
4
+ value: 0.6494348404255319
5
+ date: "2025-04-01"
6
+ source:
7
+ url: "https://huggingface.co/datasets/evaleval/EEE_datastore/blob/main/data/MMLU-Pro/qwen/qwen-2.5-vl-72b-instruct/9c455b8c-1153-4c24-9de0-1d506297d805.json"
8
+ name: "EveryEvalEver"