Jacob Helwig

jacob-helwig

AI & ML interests

None yet

Recent Activity

updated a collection about 1 month ago

OPDLM

updated a collection about 1 month ago

OPDLM

updated a collection about 1 month ago

OPDLM

View all activity

Organizations

updated a collection about 1 month ago

OPDLM

Collection

Data and checkpoints for Data-Efficient Autoregressive-to-Diffusion Language Models via On-Policy Distillation • 15 items • Updated 14 days ago • 2

updated 2 datasets about 1 month ago

divelab/opdlm_eval_data

Viewer • Updated 17 days ago • 8.52k • 223

divelab/opdlm_train_data

Viewer • Updated 19 days ago • 61.8k • 115

updated 2 datasets 2 months ago

divelab/ShockCast

Updated Apr 19 • 100

divelab/Hybrid_train_new_code

Viewer • Updated Apr 18 • 21.6k • 12

published a dataset 2 months ago

divelab/Hybrid_train_new_code

Viewer • Updated Apr 18 • 21.6k • 12

updated a dataset 3 months ago

divelab/combined_gsm8k_math_dataset_dapo_math_17k_Qwen3-4B_ntokens2048_sft

Viewer • Updated Mar 21 • 186k • 31

published a dataset 3 months ago

divelab/combined_gsm8k_math_dataset_dapo_math_17k_Qwen3-4B_ntokens2048_sft

Viewer • Updated Mar 21 • 186k • 31

updated a model 4 months ago

jacob-helwig/SDAR-1.7B-Chat_kd1epoch_Qwen3-4B-Instruct-2507-GRPO-MATH-1024

2B • Updated Mar 10 • 1

published a model 4 months ago

jacob-helwig/SDAR-1.7B-Chat_kd1epoch_Qwen3-4B-Instruct-2507-GRPO-MATH-1024

2B • Updated Mar 10 • 1

updated a model 4 months ago

jacob-helwig/Qwen3-4B-Instruct-2507-GRPO-MATH-1024

4B • Updated Mar 5 • 8

published a model 4 months ago

jacob-helwig/Qwen3-4B-Instruct-2507-GRPO-MATH-1024

4B • Updated Mar 5 • 8

published a model 9 months ago

jacob-helwig/trainer_output

Updated Sep 15, 2025

published 3 models 10 months ago

jacob-helwig/Qwen2.5-0.5B-Instruct_countdown2345_grpo_balanced_0.5_0.5_SEC0.3DRO1.0G0.0_minpTrue_1600

Updated Sep 13, 2025

jacob-helwig/Qwen2.5-1.5B-Instruct_countdown2345_grpo_balanced_0.5_0.5_SEC0.3DRO1.0G0.0_minpTrue_1600

Updated Sep 13, 2025

jacob-helwig/Qwen2.5-7B-Instruct_countdown2345_grpo_balanced_0.5_0.5_SEC0.3DRO1.0G0.0_minpTrue_1600

Updated Sep 13, 2025

updated a model 10 months ago

jacob-helwig/dive7_Qwen2.5-3B-Instruct_countdown2345_grpo_balanced_0.5_0.5_SEC0.3DRO1.0G0.0_minpTrue_1600

Text Generation • 242k • Updated Sep 13, 2025 • 1

Jacob Helwig

AI & ML interests

Recent Activity

Organizations

jacob-helwig's activity