Text Generation
Transformers
Safetensors
step3p5
conversational
custom_code
Eval Results

YAML Metadata Error:Invalid content in Eval Result file .eval_results/hle.yaml

Check out the documentation for more information.

Show details
Task ID "hle" does not match any task in dataset "cais/hle". Available: none
hzwer's picture
Add evaluation results from Step 3.5 Flash paper
ab446a3
raw
history blame contribute delete
206 Bytes
- dataset:
id: cais/hle
task_id: hle
value: 23.1
date: '2026-02-11'
source:
url: https://arxiv.org/abs/2602.10604
name: Step 3.5 Flash Paper
user: SaylorTwift
notes: "Text Only"