Kyu Song

kyunocap

1 133 36

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Stitched Value Model for Diffusion Alignment

upvoted a paper about 1 month ago

OcclusionFormer: Arranging Z-Order for Layout-Grounded Image Generation

upvoted a paper about 2 months ago

Warp-as-History: Generalizable Camera-Controlled Video Generation from One Training Video

View all activity

Organizations

None yet

upvoted 2 papers about 1 month ago

Stitched Value Model for Diffusion Alignment

Paper • 2605.19804 • Published May 19 • 12

OcclusionFormer: Arranging Z-Order for Layout-Grounded Image Generation

Paper • 2605.21343 • Published May 20 • 8

upvoted 7 papers about 2 months ago

Warp-as-History: Generalizable Camera-Controlled Video Generation from One Training Video

Paper • 2605.15182 • Published May 14 • 39

SANA-WM: Efficient Minute-Scale World Modeling with Hybrid Linear Diffusion Transformer

Paper • 2605.15178 • Published May 14 • 91

Causal Forcing++: Scalable Few-Step Autoregressive Diffusion Distillation for Real-Time Interactive Video Generation

Paper • 2605.15141 • Published May 14 • 96

Soohak: A Mathematician-Curated Benchmark for Evaluating Research-level Math Capabilities of LLMs

Paper • 2605.09063 • Published May 9 • 82

liked a Space about 2 months ago

Nemotron 3 Nano Omni

⚡

Chat with AI using text, images, video, and audio

upvoted an article 2 months ago

Article

How to Ground a Korean AI Agent in Real Demographics with Synthetic Personas

nvidia

•

Apr 21

• 26

upvoted 9 papers 3 months ago

Seedance 2.0: Advancing Video Generation for World Complexity

Paper • 2604.14148 • Published Apr 15 • 168

Matrix-Game 3.0: Real-Time and Streaming Interactive World Model with Long-Horizon Memory

Paper • 2604.08995 • Published Apr 10 • 51

LPM 1.0: Video-based Character Performance Model

Paper • 2604.07823 • Published Apr 9 • 82

SkillClaw: Let Skills Evolve Collectively with Agentic Evolver

Paper • 2604.08377 • Published Apr 9 • 295

AvatarPointillist: AutoRegressive 4D Gaussian Avatarization

Paper • 2604.04787 • Published Apr 6 • 12

Token Warping Helps MLLMs Look from Nearby Viewpoints

Paper • 2604.02870 • Published Apr 3 • 34

Out of Sight but Not Out of Mind: Hybrid Memory for Dynamic Video World Models

Paper • 2603.25716 • Published Mar 26 • 157

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

Paper • 2603.25746 • Published Mar 26 • 155

PackForcing: Short Video Training Suffices for Long Video Sampling and Long Context Inference

Paper • 2603.25730 • Published Mar 26 • 53