4 21 11

Roxanna

borntobeignored

AI & ML interests

None yet

Recent Activity

upvoted a collection 1 day ago

OpenEnv India Hackathon top 100

liked a dataset 3 days ago

open-thoughts/TaskTrove

liked a dataset 7 days ago

BytedTsinghua-SIA/CUDA-Agent-Ops-6K

View all activity

Organizations

upvoted a collection 1 day ago

OpenEnv India Hackathon top 100

Collection

Top 100 Space submissions from the OpenEnv India Hackathon. • 98 items • Updated about 16 hours ago • 7

liked a dataset 3 days ago

open-thoughts/TaskTrove

Viewer • Updated 14 days ago • 211k • 2.39k • 17

liked a dataset 7 days ago

BytedTsinghua-SIA/CUDA-Agent-Ops-6K

Viewer • Updated Feb 27 • 6k • 457 • 62

upvoted an article 10 days ago

Article

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego

•

Mar 10

• 151

liked a Space about 1 month ago

puzzle

👁

108

Get a numeric score for any text input

upvoted an article about 2 months ago

Article

The Large Language Model Course

mlabonne

•

Jan 16, 2025

• 229

upvoted 4 articles 5 months ago

Article

20x Faster TRL Fine-tuning with RapidFire AI

kbigdelysh, arunkk09, qgallouedec

•

Nov 21, 2025

• 27

Article

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

wenhuach, Haihao, weiweiz1, n1ck-guo, isaacmac, kding1, IlyasMoutawwakil, marcsun13, medmekk

•

Apr 29, 2025

• 44

Article

Open-R1: a fully open reproduction of DeepSeek-R1

eliebak, lvwerra, lewtun

•

Jan 28, 2025

• 889

Article

🚀 Build a Qwen 2.5 VL API endpoint with Hugging Face spaces and Docker!

ariG23498

•

Jan 29, 2025

• 21

upvoted a paper 5 months ago

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published Nov 20, 2025 • 110

published a Space 6 months ago

RLagents

🚀

yoyoyo

upvoted 3 articles 7 months ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

ariG23498, sergiopaniego, reach-vb, pcuenq, ArthurZ, SaylorTwift, cyrilvallez

•

Sep 11, 2025

• 187

Article

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

drbh, danieldk

•

Aug 18, 2025

• 97

Article

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

smohammadi, siro1, winglian, marcsun13, djsaunde

•

Aug 8, 2025

• 98

liked a Space 8 months ago

AgentSeer

🔍

Visualize AI agent workflows and security risks

upvoted a paper 8 months ago

Analysing Chain of Thought Dynamics: Active Guidance or Unfaithful Post-hoc Rationalisation?

Paper • 2508.19827 • Published Aug 27, 2025 • 33

liked a dataset 9 months ago

ai-safety-institute/AgentHarm

Viewer • Updated Dec 19, 2024 • 468 • 4.28k • 56

commented a paper 9 months ago

Memp: Exploring Agent Procedural Memory

Paper • 2508.06433 • Published Aug 8, 2025 • 36 •

upvoted a paper 9 months ago

Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Paper • 2508.03680 • Published Aug 5, 2025 • 141

Roxanna

AI & ML interests

Recent Activity

Organizations

borntobeignored's activity

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

puzzle

The Large Language Model Course

20x Faster TRL Fine-tuning with RapidFire AI

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

Open-R1: a fully open reproduction of DeepSeek-R1

🚀 Build a Qwen 2.5 VL API endpoint with Hugging Face spaces and Docker!

RLagents

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

AgentSeer