-
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 627 -
Hierarchical Reasoning Model
Paper • 2506.21734 • Published • 48 -
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 509 -
Training language models to follow instructions with human feedback
Paper • 2203.02155 • Published • 24
Gabriel
stoksweet
·
AI & ML interests
None yet
Recent Activity
liked
a dataset 1 day ago
karpathy/tinystories-gpt4-clean updated
a collection
16 days ago
Papers liked
a Space 24 days ago
nanochat-students/transformers Organizations
None yet