Text Generation
Transformers
Safetensors
step3p5
conversational
custom_code
Eval Results

YAML Metadata Error:Invalid content in Eval Result file .eval_results/mmlu_pro.yaml

Check out the documentation for more information.

Show details
Task ID "mmlu_pro" does not match any task in dataset "TIGER-Lab/MMLU-Pro". Available: none
Step-3.5-Flash / .eval_results /mmlu_pro.yaml
hzwer's picture
Add evaluation results from Step 3.5 Flash paper
ab446a3
raw
history blame contribute delete
200 Bytes
- dataset:
id: TIGER-Lab/MMLU-Pro
task_id: mmlu_pro
value: 84.4
date: '2026-02-11'
source:
url: https://arxiv.org/abs/2602.10604
name: Step 3.5 Flash Paper
user: SaylorTwift