The models trained with EVOL-RL
Yujun Zhou
yujunzhou
AI & ML interests
None yet
Organizations
None yet
models
251
yujunzhou/SFT_Advanced_Risk_Self_Grading_Qwen3-4B
Text Generation
•
4B
•
Updated
•
6
yujunzhou/SFT_Advanced_Risk_Self_Grading_llama
Text Generation
•
8B
•
Updated
•
5
yujunzhou/SFT_Advanced_Risk_Self_Grading_Qwen3-4B-Base
Text Generation
•
4B
•
Updated
•
3
yujunzhou/SFT_Advanced_Risk_Reward_Tampering_Qwen3-4B
Text Generation
•
4B
•
Updated
•
11
yujunzhou/Advanced_Risk_Self_Grading_llama
8B
•
Updated
yujunzhou/SFT_Advanced_Risk_Reward_Tampering_Qwen3-4B-Base
Text Generation
•
4B
•
Updated
•
6
yujunzhou/SFT_Advanced_Risk_Reward_Tampering_llama
Text Generation
•
8B
•
Updated
•
1
yujunzhou/SFT_Advanced_Risk_Situation_Aware_Qwen3-4B-Base
Text Generation
•
4B
•
Updated
•
6
yujunzhou/SFT_Advanced_Risk_Situation_Aware_Qwen3-4B
Text Generation
•
4B
•
Updated
•
1
yujunzhou/SFT_Advanced_Risk_Situation_Aware_llama
Text Generation
•
8B
•
Updated
•
5