view article Article Controlling Language Model Generation with NVIDIA's LogitsProcessorZoo ariG23498, aerdem4 • Dec 23, 2024 • 51
view article Article Exploring Direct Tensor Manipulation in Language Models: A Case Study in Binary-Level Model Enhancement TensorSlay • Nov 7, 2025 • 4
view article Article From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels drbh, danieldk • Aug 18, 2025 • 98
view article Article You could have designed state of the art positional encoding FL33TW00D-HF • Nov 25, 2024 • 480
view article Article Understanding Gemma 3n: How MatFormer Gives You Many Models in One rishiraj • Jun 26, 2025 • 50
view article Article State of open video generation models in Diffusers +1 sayakpaul, a-r-r-o-w, dn6 • Jan 27, 2025 • 70
view article Article How Long Prompts Block Other Requests - Optimizing LLM Performance tngtech • Jun 12, 2025 • 13
view article Article Prefill and Decode for Concurrent Requests - Optimizing LLM Performance tngtech • Apr 16, 2025 • 78
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning Paper • 2506.01939 • Published Jun 2, 2025 • 190
view article Article Enabling Long Context Training with Sequence Parallelism in Axolotl axolotl-ai-co • Apr 4, 2025 • 17
view article Article SigLIP 2: A better multilingual vision language encoder +1 ariG23498, merve, qubvel-hf • Feb 21, 2025 • 213
view article Article The case for specialized pre-training: ultra-fast foundation models for dedicated tasks Pclanglais • Aug 4, 2024 • 30
Scotch & SOTA 🥃 Pt. 7: Human Feedback Datasets 🫣 Collection The elusive “human” feedback • 1 item • Updated Sep 13, 2023 • 1
Scotch & SOTA 🥃 Pt. 6: Dialogue Tuning Datasets 💬 Collection Conversations, turn-based dialog, and things that can be turned into that. • 4 items • Updated Sep 13, 2023 • 1
Scotch & SOTA 🥃 Pt. 5: Instruction Tuning Datasets 👩🏫 Collection Question & answer, task completion, general SFT and otherwise finetuney data. • 6 items • Updated Mar 2 • 1