4 12 6

Yusuf Dalva PRO

ydalva

https://yusufdalva.github.io

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

VideoMLA: Low-Rank Latent KV Cache for Minute-Scale Autoregressive Video Diffusion

authored a paper 6 days ago

AdaState: Self-Evolving Anchors for Streaming Video Generation

submitted a paper 6 days ago

AdaState: Self-Evolving Anchors for Streaming Video Generation

View all activity

Organizations

None yet

upvoted a paper 2 days ago

VideoMLA: Low-Rank Latent KV Cache for Minute-Scale Autoregressive Video Diffusion

Paper • 2605.30351 • Published 7 days ago • 25

authored a paper 6 days ago

AdaState: Self-Evolving Anchors for Streaming Video Generation

Paper • 2605.30349 • Published 7 days ago • 12

submitted a paper to Daily Papers 6 days ago

AdaState: Self-Evolving Anchors for Streaming Video Generation

Paper • 2605.30349 • Published 7 days ago • 12

upvoted a paper 3 months ago

Unified Latents (UL): How to train your latents

Paper • 2602.17270 • Published Feb 19 • 60

upvoted 2 papers 6 months ago

Infinity-RoPE: Action-Controllable Infinite Video Generation Emerges From Autoregressive Self-Rollout

Paper • 2511.20649 • Published Nov 25, 2025 • 52

LayerComposer: Interactive Personalized T2I via Spatially-Aware Layered Canvas

Paper • 2510.20820 • Published Oct 23, 2025 • 11

authored 2 papers 6 months ago

LayerComposer: Interactive Personalized T2I via Spatially-Aware Layered Canvas

Paper • 2510.20820 • Published Oct 23, 2025 • 11

Canvas-to-Image: Compositional Image Generation with Multimodal Controls

Paper • 2511.21691 • Published Nov 26, 2025 • 36

upvoted a paper 6 months ago

Canvas-to-Image: Compositional Image Generation with Multimodal Controls

Paper • 2511.21691 • Published Nov 26, 2025 • 36

commented a paper 6 months ago

Canvas-to-Image: Compositional Image Generation with Multimodal Controls

Paper • 2511.21691 • Published Nov 26, 2025 • 36 •

upvoted a paper 11 months ago

How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks

Paper • 2507.01955 • Published Jul 2, 2025 • 36

upvoted a paper 12 months ago

Dynamic View Synthesis as an Inverse Problem

Paper • 2506.08004 • Published Jun 9, 2025 • 5

authored a paper about 1 year ago

LoRAShop: Training-Free Multi-Concept Image Generation and Editing with Rectified Flow Transformers

Paper • 2505.23758 • Published May 29, 2025 • 22

upvoted a paper about 1 year ago

LoRAShop: Training-Free Multi-Concept Image Generation and Editing with Rectified Flow Transformers

Paper • 2505.23758 • Published May 29, 2025 • 22

commented a paper about 1 year ago

LoRAShop: Training-Free Multi-Concept Image Generation and Editing with Rectified Flow Transformers

Paper • 2505.23758 • Published May 29, 2025 • 22 •

upvoted a paper about 1 year ago

RoPECraft: Training-Free Motion Transfer with Trajectory-Guided RoPE Optimization on Diffusion Transformers

Paper • 2505.13344 • Published May 19, 2025 • 7

liked a model about 1 year ago

l3xx/ronaldo

Text-to-Image • Updated Oct 24, 2024 • 7 • • 3

upvoted a paper about 1 year ago

Gaussian Mixture Flow Matching Models

Paper • 2504.05304 • Published Apr 7, 2025 • 11

authored a paper about 1 year ago

Image-to-Image Translation with Disentangled Latent Vectors for Face Editing

Paper • 2301.04628 • Published Jan 11, 2023

liked a Space about 1 year ago

CFG Zero Star

🐠

Demo for CFG-Zero*

Yusuf Dalva PRO

AI & ML interests

Recent Activity

Organizations

ydalva's activity

CFG Zero Star