Cool Papers
updated
BlockFusion: Expandable 3D Scene Generation using Latent Tri-plane
Extrapolation
Paper
• 2401.17053
• Published
• 33
Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning
Tasks
Paper
• 2402.04248
• Published
• 32
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open
Language Models
Paper
• 2402.03300
• Published
• 141
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Paper
• 2402.05930
• Published
• 39
Animated Stickers: Bringing Stickers to Life with Video Diffusion
Paper
• 2402.06088
• Published
• 11
HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting
Paper
• 2402.06149
• Published
• 18
BASE TTS: Lessons from building a billion-parameter Text-to-Speech model
on 100K hours of data
Paper
• 2402.08093
• Published
• 62
Paper
• 2402.13144
• Published
• 100
MobileLLM: Optimizing Sub-billion Parameter Language Models for
On-Device Use Cases
Paper
• 2402.14905
• Published
• 134
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper
• 2402.17764
• Published
• 627