PSFT+RL models
SII-Wenhong
wh-zhu
AI & ML interests
None yet
Organizations
models 57
wh-zhu/Qwen2.5-7B-Instruct-SFT-lr-5e6
8B • Updated
wh-zhu/Qwen2.5-7B-Instruct-16-1300
8B • Updated
wh-zhu/Qwen2.5-7B-Instruct-ref-1300
8B • Updated
wh-zhu/Qwen2.5-7B-Instruct-update4-600
8B • Updated
wh-zhu/Qwen2.5-7B-Instruct-VL-SFT-RL120
8B • Updated
wh-zhu/Qwen2.5-7B-Instruct-VL-SFT-RL165
8B • Updated
• 6
wh-zhu/Qwen2.5-7B-Instruct-VL-PSFT-RL165
8B • Updated
wh-zhu/Qwen2.5-7B-Instruct-VL-ORI-RL140
8B • Updated
• 7
wh-zhu/Qwen2.5-7B-Instruct-edit-ruilin400
8B • Updated
wh-zhu/Qwen2.5-7B-Instruct-VL-RL100
8B • Updated
• 9