AI & ML interests
None defined yet.
Recent Activity
uiuc-kang-lab/Qwen3-235B-A22B-Instruct-BIRD-Plat-2k-CISPO
Updated
uiuc-kang-lab/test-lora-upload
Updated
uiuc-kang-lab/Qwen2.5-Math-7B-GRPO-noise-0.1-epoch-3
8B
•
Updated
•
5
uiuc-kang-lab/Qwen2.5-Math-7B-GRPO-random-epoch-2
uiuc-kang-lab/Qwen2.5-Math-7B-GRPO-noise-epoch-3
8B
•
Updated
•
1
uiuc-kang-lab/Qwen2.5-Math-7B-GRPO-noise-0.5-epoch-3
8B
•
Updated
•
4
uiuc-kang-lab/Qwen2.5-Math-7B-TIS-noise-0.5-epoch-3
8B
•
Updated
•
5
uiuc-kang-lab/Qwen2.5-Math-7B-SAPO-noise-0.5-epoch-3
8B
•
Updated
•
6
uiuc-kang-lab/Qwen2.5-Math-7B-DrGRPO-noise-0.5-epoch-3
8B
•
Updated
•
2
uiuc-kang-lab/Qwen2.5-Math-7B-DAPO-noise-0.5-epoch-3
8B
•
Updated
•
5
uiuc-kang-lab/Qwen2.5-Math-7B-GRPO-format-epoch-3
8B
•
Updated
•
4
uiuc-kang-lab/Qwen2.5-Math-7B-PGFC-noise-0.5-epoch-3
8B
•
Updated
•
6
uiuc-kang-lab/Qwen2.5-Math-7B-GRPO-noise-0.4-epoch-3
8B
•
Updated
•
71
uiuc-kang-lab/Qwen2.5-Math-7B-GRPO-noise-0.3-epoch-3
8B
•
Updated
•
49
uiuc-kang-lab/Qwen2.5-Math-7B-GRPO-noise-0.2-epoch-3
8B
•
Updated
•
70
uiuc-kang-lab/Qwen2.5-Math-7B-GRPO-clean-epoch-4
8B
•
Updated
•
37
uiuc-kang-lab/Qwen2.5-Math-7B-GRPO-clean-epoch-3
8B
•
Updated
•
32
uiuc-kang-lab/R1-Distill-Qwen-1.5B-mixed
2B
•
Updated
uiuc-kang-lab/Llama3.2-3B-Instruct-math
3B
•
Updated
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-12-6
2B
•
Updated
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-11-6
2B
•
Updated
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-10-6
2B
•
Updated
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-9-6
2B
•
Updated
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-8-6
2B
•
Updated
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-7-6
2B
•
Updated
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-6-6
2B
•
Updated
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-5-6
2B
•
Updated
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-4-6
2B
•
Updated
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-3-6
2B
•
Updated
uiuc-kang-lab/R1-Distill-Qwen-1.5B-math-epoch-2-6
2B
•
Updated