25 23

Ricardo Corso Fernandes Jr

g4ry

G-4-R-Y

AI & ML interests

NLP, NLU, Textless NLP, Multimodal Modelling and Speech Processing.

Recent Activity

upvoted a paper 4 days ago

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

upvoted an article about 2 months ago

TRL v1.0: Post-Training Library Built to Move with the Field

liked a model 2 months ago

Jackrong/Qwen3.5-4B-Claude-4.6-Opus-Reasoning-Distilled-v2-GGUF

View all activity

Organizations

None yet

upvoted a paper 4 days ago

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

Paper • 2605.23904 • Published 10 days ago • 207

upvoted an article about 2 months ago

Article

TRL v1.0: Post-Training Library Built to Move with the Field

qgallouedec, stevhliu, pcuenq, sergiopaniego

•

Mar 31

• 53

upvoted 6 articles 3 months ago

Article

Introducing Legal RAG Bench

isaacus

•

Feb 20

• 14

Article

Uncensor any LLM with abliteration

mlabonne

•

Jun 13, 2024

• 862

Article

DenseR: Dense Rewards For Free in LLM Reasoning

hbXNov

•

Feb 18

• 21

Article

FINAL Bench: The Real Bottleneck to AGI Is Self-Correction

FINAL-Bench

•

Feb 21

• 20

Article

Introducing SyGra Studio

ServiceNow-AI

•

Feb 5

• 26

Article

One-Shot Any Web App with Gradio's gr.HTML

ysharma, hysts, freddyaboulton

•

Feb 18

• 33

upvoted a paper 4 months ago

Agentic Reasoning for Large Language Models

Paper • 2601.12538 • Published Jan 18 • 204

upvoted 3 articles 10 months ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

orrzohar, mfarre, andito, merve, pcuenq, cyrilzakka, Xenova

•

Feb 20, 2025

• 340

Article

VideoMamba: State Space Model for Efficient Video Understanding

vladbogo

•

Mar 16, 2024

• 2

Article

Diffusion Language Models: The New Paradigm

ProCreations

•

Jun 10, 2025

• 48

upvoted an article 12 months ago

Article

Introduction to State Space Models (SSM)

lbourdois

•

Jul 19, 2024

• 226

upvoted a paper about 1 year ago

MMaDA: Multimodal Large Diffusion Language Models

Paper • 2505.15809 • Published May 21, 2025 • 98

upvoted 2 articles about 1 year ago

Article

Vision Language Models (Better, faster, stronger)

merve, sergiopaniego, ariG23498, pcuenq, andito

•

May 12, 2025

• 613

Article

Probabilistic Fractal Activation Function (P-FAF) and Its Advantages Over Traditional Word Vectorization

TuringsSolutions

•

Feb 8, 2024

• 14

upvoted a paper about 1 year ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6, 2025 • 191

upvoted 3 articles about 1 year ago

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

not-lain

•

Jan 30, 2025

• 342

Article

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

edbeeching, ybelkada, lvwerra, smangrul, lewtun, kashif

•

Mar 9, 2023

• 72

Article

Putting RL back in RLHF

vwxyzjn, ArashAhmadian

•

Jun 12, 2024

• 111

Ricardo Corso Fernandes Jr

AI & ML interests

Recent Activity

Organizations

g4ry's activity

TRL v1.0: Post-Training Library Built to Move with the Field

Introducing Legal RAG Bench

Uncensor any LLM with abliteration

DenseR: Dense Rewards For Free in LLM Reasoning

FINAL Bench: The Real Bottleneck to AGI Is Self-Correction

Introducing SyGra Studio

One-Shot Any Web App with Gradio's gr.HTML

SmolVLM2: Bringing Video Understanding to Every Device

VideoMamba: State Space Model for Efficient Video Understanding

Diffusion Language Models: The New Paradigm

Introduction to State Space Models (SSM)

Vision Language Models (Better, faster, stronger)

Probabilistic Fractal Activation Function (P-FAF) and Its Advantages Over Traditional Word Vectorization

KV Caching Explained: Optimizing Transformer Inference Efficiency

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

Putting RL back in RLHF