Kashif Rasul's picture

Kashif Rasul

kashif

·

AI & ML interests

Time Series Forecasting, Denoising Diffusion, Generative Modeling, Reinforcement Learning

Recent Activity

updated a Space about 21 hours ago

kashif/xtoken-distillation-static-1ca43a

published a Space about 21 hours ago

kashif/xtoken-distillation-static-1ca43a

updated a bucket about 21 hours ago

kashif/xtoken-distillation-static-1ca43a-bucket

View all activity

Organizations

published an article 4 days ago

Article

Beyond LoRA: Can you beat the most popular fine-tuning technique?

+2

BenjaminB, sayakpaul, hubnemo, kashif

•

4 days ago

• 35

published an article 23 days ago

Article

Agentic RL: Token-In, Token-Out Done Right

huggingface

•

23 days ago

• 14

published an article 26 days ago

Article

Shipping a Trillion Parameters With a Hub Bucket: Delta Weight Sync in TRL

+6

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, lvwerra, sergiopaniego

•

26 days ago

• 42

published an article 3 months ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

+7

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego

•

Mar 10

• 164

published an article 3 months ago

Article

Ulysses Sequence Parallelism: Training with Million-Token Contexts

kashif, stas

•

Mar 9

• 30

published an article 11 months ago

Article

Vision Language Model Alignment in TRL ⚡️

+3

sergiopaniego, merve, qgallouedec, kashif, ariG23498

•

Aug 7, 2025

• 112

published an article 12 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 780

published an article 12 months ago

Article

Gemma 3n fully available in the open-source ecosystem!

+6

ariG23498, pcuenq, sergiopaniego, reach-vb, FL33TW00D-HF, Xenova, Steveeeeeeen, kashif

•

Jun 26, 2025

• 121

published an article about 1 year ago

Article

KV Cache from scratch in nanoVLM

+3

ariG23498, kashif, lusxvr, andito, pcuenq

•

Jun 4, 2025

• 120

published an article about 1 year ago

Article

🐯 Liger GRPO meets TRL

+4

shisahni, kashif, smohammadi, ShirinYamani, m0m0chen, liberty4321

•

May 25, 2025

• 54

published an article almost 2 years ago

Article

How NuminaMath Won the 1st AIMO Progress Prize

+6

yfleureau, liyongsea, edbeeching, lewtun, benlipkin, romansoletskyi, vwxyzjn, kashif

•

Jul 11, 2024

• 128

published an article almost 2 years ago

Article

Preference Optimization for Vision Language Models

+2

qgallouedec, vwxyzjn, merve, kashif

•

Jul 10, 2024

• 93

published an article about 2 years ago

Article

Diffusers welcomes Stable Diffusion 3

+4

dn6, YiYiXu, sayakpaul, OzzyGT, kashif, multimodalart

•

Jun 12, 2024

• 99

published an article about 2 years ago

Article

Diffusers welcomes Stable Diffusion 3

+4

dn6, YiYiXu, sayakpaul, OzzyGT, kashif, multimodalart

•

Jun 12, 2024

• 99

published an article over 2 years ago

Article

Constitutional AI with Open LLMs

+5

vwxyzjn, lewtun, edbeeching, lvwerra, osanseviero, kashif, thomwolf

•

Feb 1, 2024

• 17

published an article over 2 years ago

Article

Patch Time Series Transformer in Hugging Face

+3

namctin, wmgifford, ajati, vijaye12, kashif

•

Feb 1, 2024

• 14

published an article over 2 years ago

Article

PatchTSMixer in HuggingFace

+4

ajati, vijaye12, namctin, wmgifford, kashif, nielsr

•

Jan 19, 2024

• 10

published an article over 2 years ago

Article

PatchTSMixer in HuggingFace

+4

ajati, vijaye12, namctin, wmgifford, kashif, nielsr

•

Jan 19, 2024

• 10

published an article over 2 years ago

Article

Preference Tuning LLMs with Direct Preference Optimization Methods

+3

kashif, edbeeching, lewtun, lvwerra, osanseviero

•

Jan 18, 2024

• 84

published an article over 2 years ago

Article

Finetune Stable Diffusion Models with DDPO via TRL

+2

metric-space, sayakpaul, kashif, lvwerra

•

Sep 29, 2023

• 20