zecanard/Qwen3.6-27B-uncensored-abliterix-MLX-4bit-mixed_4_6 Image-Text-to-Text • 5B • Updated 8 days ago • 409 • 2
RUBRIC-ARROW: Alternating Pointwise Rubric Reward Modeling for LLM Post-training in Non-verifiable Domains Paper • 2605.29156 • Published 24 days ago • 14
Towards Evaluation Engineering: An Empirical Study of ML Evaluation Harnesses in the Wild Paper • 2605.24213 • Published 29 days ago • 14
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information Paper • 2605.11609 • Published May 12 • 196
IntentGrasp: A Comprehensive Benchmark for Intent Understanding Paper • 2605.06832 • Published May 7 • 8
Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers Paper • 2605.06169 • Published May 7 • 235
openai/whisper-large-v3-turbo Automatic Speech Recognition • 0.8B • Updated Oct 4, 2024 • 7.96M • • 3.1k
DCAgent/e1_embedding_d1_original_sandboxes_glm_4.7_traces_jupiter Viewer • Updated Apr 12 • 12.1k • 47
Adam's Law: Textual Frequency Law on Large Language Models Paper • 2604.02176 • Published Apr 2 • 507
When Numbers Speak: Aligning Textual Numerals and Visual Instances in Text-to-Video Diffusion Models Paper • 2604.08546 • Published Apr 9 • 115