AI & ML interests
None yet
Organizations
None yet
ketchup123/rebuttal_helpsteer3_everything_llama
Updated
ketchup123/rebuttal_helpsteer3_rip_qwen
Updated
ketchup123/rebuttal_helpsteer3_pref_qwen
ketchup123/rebuttal_helpsteer3_rip_llama
ketchup123/rebuttal_helpsteer3_qwen
Updated
ketchup123/rebuttal_helpsteer3_pref_llama
Updated
ketchup123/rebuttal_helpsteer3_llama
Updated
ketchup123/ARR_qwen_2.5_pubmedqa
ketchup123/ARR_qwen_gsm8k
ketchup123/DPO_ablations_qwen_tuludpo_everything
Updated
ketchup123/DPO_ablations_qwen_tuludpo_quality_and_difficulty
Updated
ketchup123/DPO_ablations_qwen_tuludpo_quality_only
Updated
ketchup123/DPO_ablations_llama_tuludpo_everything
Updated • 2
• 1
ketchup123/DPO_ablations_qwen_ultrafeedback_everything
ketchup123/DPO_ablations_qwen_ultrafeedback_quality_and_difficulty
ketchup123/DPO_ablations_qwen_ultrafeedback_quality_only
Updated • 2
• 1
ketchup123/DPO_ablations_llama_tuluDPO_quality_and_difficulty_only
ketchup123/DPO_ablations_llama_tuluDPO_quality_only
Updated • 5
• 1
ketchup123/DPO_ablations_qwen_ultramix_pref_only
ketchup123/DPO_ablations_qwen_ultramix_quality_only
Updated
ketchup123/DPO_ablations_llama_ultramix_pref_only
Updated
ketchup123/DPO_ablations_llama_ultramix_quality_only
ketchup123/llama-3.1-8B-pubmedqa-LF
Updated
ketchup123/DPO_ablations_qwen_ultramix_no_pref_filter
Updated
ketchup123/DPO_ablations_llama_ultramix_no_pref_filter
Updated
ketchup123/DPO_ablations_qwen_orpo_pref_filter_only
Updated
ketchup123/DPO_ablations_qwen_helpsteer_pref_filter_only
ketchup123/DPO_ablations_qwen_codepreferences_pref_filter_only
Updated
ketchup123/DPO_ablations_qwen_tulu_pref_filter_only
Updated • 4
• 1
ketchup123/DPO_ablations_qwen_ultrafeedback_pref_filter_only
Updated • 5
• 1