PritamcodesAGI (Pritam Kumar Ravi)

upvoted an article 4 months ago

Article

We Got Claude to Build CUDA Kernels and teach open models!

+2

burtenshaw, evalstate, merve, pcuenq

•

Jan 28

• 156

upvoted an article 5 months ago

Article

Mixture of Experts Explained

+4

osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq

•

Dec 11, 2023

• 1.13k

upvoted a collection 5 months ago

📝 Research & Long-Form Blog Posts

Collection

In-depth technical articles and research pieces published by Hugging Face • 16 items • Updated about 16 hours ago • 22

upvoted 2 articles 5 months ago

Article

Deriving the PPO Loss from First Principles

garg-aayush

•

Dec 25, 2025

• 44

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

+5

ariG23498, sergiopaniego, reach-vb, pcuenq, ArthurZ, SaylorTwift, cyrilvallez

•

Sep 11, 2025

• 188

upvoted an article 6 months ago

Article

The Annotated Diffusion Model

nielsr, kashif

•

Jun 7, 2022

• 358

upvoted a collection 7 months ago

📐 FineMath

Collection

FineMath datasets and ablation models • 14 items • Updated May 5, 2025 • 26

upvoted a paper 8 months ago

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 515

upvoted an article 9 months ago

Article

Making any LLM model "reasoning"

Metal3d

•

Mar 23, 2025

• 25

upvoted 3 papers 10 months ago

Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off

Paper • 2508.04825 • Published Aug 6, 2025 • 60

MolmoAct: Action Reasoning Models that can Reason in Space

Paper • 2508.07917 • Published Aug 11, 2025 • 45

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 320

upvoted 3 papers 11 months ago

AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMs

Paper • 2507.05687 • Published Jul 8, 2025 • 31

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Paper • 2507.01006 • Published Jul 1, 2025 • 256

LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs

Paper • 2506.14429 • Published Jun 17, 2025 • 44

upvoted a paper 12 months ago

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9, 2025 • 265

upvoted an article 12 months ago

Article

Explore, Build, and Innovate AI Reasoning with NVIDIA’s Open Models and Recipes

nvidia

•

Jun 4, 2025

• 23

upvoted a paper about 1 year ago

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published May 12, 2025 • 86

upvoted an article about 1 year ago

Article

Vision Language Models (Better, faster, stronger)

+3

merve, sergiopaniego, ariG23498, pcuenq, andito

•

May 12, 2025

• 613

upvoted a paper about 1 year ago

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Paper • 2505.04588 • Published May 7, 2025 • 65

Pritam Kumar Ravi

AI & ML interests

Organizations

We Got Claude to Build CUDA Kernels and teach open models!

Mixture of Experts Explained

📝 Research & Long-Form Blog Posts

Deriving the PPO Loss from First Principles

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

The Annotated Diffusion Model

📐 FineMath

Less is More: Recursive Reasoning with Tiny Networks

Making any LLM model "reasoning"

Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off

MolmoAct: Action Reasoning Models that can Reason in Space

Group Sequence Policy Optimization

AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMs

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs

Reinforcement Pre-Training

Explore, Build, and Innovate AI Reasoning with NVIDIA’s Open Models and Recipes

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Vision Language Models (Better, faster, stronger)

ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Pritam Kumar Ravi

AI & ML interests

Organizations

PritamcodesAGI's activity

We Got Claude to Build CUDA Kernels and teach open models!

Mixture of Experts Explained

Deriving the PPO Loss from First Principles

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

The Annotated Diffusion Model

Making any LLM model "reasoning"

Explore, Build, and Innovate AI Reasoning with NVIDIA’s Open Models and Recipes

Vision Language Models (Better, faster, stronger)