robbie

robb-0

AI & ML interests

i like semiotics and hermeneutics, happens that I train image LoRAs (and in secret fine-tune LLMs.) Billy is my teddy doggy 🐶🐕🦊🧸

Organizations

upvoted a collection 6 months ago

Falcon-H1

Collection

Falcon-H1 Family of Hybrid-Head Language Models (Transformer-SSM), including 0.5B, 1.5B, 1.5B-Deep, 3B, 7B, and 34B (pretrained & instruction-tuned). • 39 items • Updated Jan 9 • 59

upvoted a paper 6 months ago

Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

Paper • 2507.22448 • Published Jul 30, 2025 • 70

upvoted a paper 10 months ago

Beyond Chains of Thought: Benchmarking Latent-Space Reasoning Abilities in Large Language Models

Paper • 2504.10615 • Published Apr 14, 2025 • 2

upvoted a collection 10 months ago

Granite 4.0 Language Models

Collection

13 items • Updated Nov 17, 2025 • 207

upvoted a paper 10 months ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7, 2025 • 152

upvoted a collection 11 months ago

Llama 4

Collection

Llama 4 release • 13 items • Updated Apr 29, 2025 • 695

upvoted a paper 11 months ago

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2, 2024 • 53

upvoted a collection 11 months ago

— UI is a good thing 💅 —

Collection

cool spaces with a cool UI, what could be better? • 5 items • Updated May 5, 2025 • 30

upvoted an article 11 months ago

Article

I Clicked “I Agree”, But What Am I Really Consenting To?

Mar 26, 2025

•

upvoted 2 collections 11 months ago

My Bookmarks

Collection

163 items • Updated Sep 23, 2025 • 4

Spaces for LLM / VLM / NLP

Collection

1294 items • Updated 41 minutes ago • 12

upvoted 4 papers 11 months ago

Model Hubs and Beyond: Analyzing Model Popularity, Performance, and Documentation

Paper • 2503.15222 • Published Mar 19, 2025 • 1

The AI Community Building the Future? A Quantitative Analysis of Development Activity on Hugging Face Hub

Paper • 2405.13058 • Published May 20, 2024 • 2

SpaceByte: Towards Deleting Tokenization from Large Language Modeling

Paper • 2404.14408 • Published Apr 22, 2024 • 7

T-FREE: Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings

Paper • 2406.19223 • Published Jun 27, 2024 • 11

upvoted a paper 12 months ago

Does Time Have Its Place? Temporal Heads: Where Language Models Recall Time-specific Information

Paper • 2502.14258 • Published Feb 20, 2025 • 26

upvoted a collection 12 months ago

Foundation Text-Generation Models Below 360M Parameters

Collection

Great candidates for fine-tuning targeting Wllama and Transformers.js for mobile devices, ordered by number of parameters. • 42 items • Updated about 1 month ago • 41

upvoted a paper 12 months ago

Finch: Prompt-guided Key-Value Cache Compression

Paper • 2408.00167 • Published Jul 31, 2024 • 17

upvoted a collection 12 months ago

Hallucination

Collection

14 items • Updated Jun 10, 2024 • 8

upvoted a paper 12 months ago

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13, 2025 • 170

robbie

AI & ML interests

Organizations

robb-0's activity

I Clicked “I Agree”, But What Am I Really Consenting To?