rubricreward/mmr3-synthalign
Viewer
• Updated
• 12.4k • 15
Viewer
• Updated
• 14.4k • 16
rubricreward/mR3-Dataset-100K-EasyToHard
Viewer
• Updated
• 100k • 94
• 1
rubricreward/m-ArenaHard-v2.0
Viewer
• Updated
• 11.5k • 15
rubricreward/reward-bench
Viewer
• Updated
• 2.99k • 21
rubricreward/mR3-Dataset-100K-EasyToHard-Truncated
Viewer
• Updated
• 99.5k • 13
• 1
rubricreward/PPE-Human-Preference
Viewer
• Updated
• 15.5k • 3
rubricreward/mR3-Dataset-100K-StartEng-EasyToHard
Viewer
• Updated
• 100k • 26
• 1
rubricreward/mR3-Dataset-100K-StartEng-HardToEasy
Viewer
• Updated
• 100k • 20
rubricreward/mR3-Dataset-100K-HardToEasy
Viewer
• Updated
• 100k • 26
rubricreward/mR3-Dataset-100K-StartEng
Viewer
• Updated
• 100k • 52
rubricreward/mR3-Dataset-100K
Viewer
• Updated
• 100k • 15
rubricreward/mR3-Dataset-100K-Truncated
rubricreward/mR3-Dataset-Cleaned
Viewer
• Updated
• 100k • 6
rubricreward/mR3-Dataset-Filtered3
Viewer
• Updated
• 441k • 9
rubricreward/mR3-Dataset-Filtered2
Viewer
• Updated
• 645k • 103
rubricreward/PolyGuard-Filtered2
Viewer
• Updated
• 518k • 80
rubricreward/mR3-Dataset-Filtered2-no-PolyGuard
Viewer
• Updated
• 128k • 2
rubricreward/mR3-Dataset-Filtered1-no-PolyGuard
Viewer
• Updated
• 208k • 7
rubricreward/HelpSteer3-Filtered1
Viewer
• Updated
• 16.9k • 6
rubricreward/HelpSteer3-tgt_prompt_tgt_thinking-filtered_correct
Viewer
• Updated
• 26.6k • 4
rubricreward/HelpSteer3-tgt_prompt_en_thinking-filtered_correct
Viewer
• Updated
• 26.3k • 6
rubricreward/HelpSteer3-en_prompt_en_thinking-filtered_correct
Viewer
• Updated
• 26.8k • 7
• 1
rubricreward/HelpSteer3-tgt_prompt_tgt_thinking
Viewer
• Updated
• 38.5k • 7
rubricreward/HelpSteer3-tgt_prompt_en_thinking
Viewer
• Updated
• 38.5k • 2
rubricreward/HelpSteer3-en_prompt_en_thinking
Viewer
• Updated
• 38.5k • 13
rubricreward/PolyGuardMix-tgt_prompt_tgt_thinking-filtered_correct
Viewer
• Updated
• 2.57M • 5
rubricreward/PolyGuardMix-tgt_prompt_en_thinking-filtered_correct
Viewer
• Updated
• 2.62M • 2
rubricreward/PolyGuardMix-en_prompt_en_thinking-filtered_correct
Viewer
• Updated
• 2.63M • 7
rubricreward/PolyGuardMix-tgt_prompt_tgt_thinking
Viewer
• Updated
• 2.88M • 2