Sumiokashi/qwen3-4b-structured-3k-mix-sft_lora-dpo-qwen-cot-merged Text Generation • 4B • Updated Mar 1 • 11