The Appeal and Reality of Recycling LoRAs with Adaptive Merging Paper • 2602.12323 • Published Feb 12 • 1
A Gradient Perspective on RLVR Stability and Winner Advantage Policy Optimization Paper • 2606.16154 • Published 4 days ago • 6
A Gradient Perspective on RLVR Stability and Winner Advantage Policy Optimization Paper • 2606.16154 • Published 4 days ago • 6
supergoose/flan_combined_task1558_jfleg_incorrect_answer_generation Viewer • Updated Mar 10, 2025 • 2.26k • 8
supergoose/flan_combined_task1063_pib_translation_gujarati_tamil Viewer • Updated Mar 10, 2025 • 4.48k • 7
supergoose/flan_combined_task1558_jfleg_incorrect_answer_generation Viewer • Updated Mar 10, 2025 • 2.26k • 8
supergoose/flan_combined_task1063_pib_translation_gujarati_tamil Viewer • Updated Mar 10, 2025 • 4.48k • 7
supergoose/flan_combined_task1724_civil_comments_insult_classification Viewer • Updated Mar 10, 2025 • 2.99k • 5
supergoose/flan_combined_task951_wiki_cloze_or_multiple_choice_question_answering Viewer • Updated Mar 10, 2025 • 5.91k • 132
supergoose/flan_combined_task520_aquamuse_answer_given_in_passage Viewer • Updated Mar 10, 2025 • 2.98k • 13
supergoose/flan_combined_task1604_ethos_text_classification Viewer • Updated Mar 10, 2025 • 2.98k • 12
supergoose/flan_combined_task951_wiki_cloze_or_multiple_choice_question_answering Viewer • Updated Mar 10, 2025 • 5.91k • 132
supergoose/flan_combined_task520_aquamuse_answer_given_in_passage Viewer • Updated Mar 10, 2025 • 2.98k • 13
supergoose/flan_combined_task1724_civil_comments_insult_classification Viewer • Updated Mar 10, 2025 • 2.99k • 5
supergoose/flan_combined_task1380_quarel_correct_option_generation Viewer • Updated Mar 10, 2025 • 2.39k • 51
supergoose/flan_combined_task1604_ethos_text_classification Viewer • Updated Mar 10, 2025 • 2.98k • 12