Training and evaluation datasets collected for Solaris: Building a Multiplayer Video World Model in Minecraft
AI & ML interests
None defined yet.
Recent Activity
Papers
Solaris: Building a Multiplayer Video World Model in Minecraft
Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders
Collection for "Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders"
-
nyu-visionx/Scale-RAE-Qwen1.5B_DiT2.4B
Text Generation • 4B • Updated • 4.57k -
nyu-visionx/Scale-RAE-Qwen1.5B_DiT2.4B-64ep
Text Generation • 4B • Updated • 441 -
nyu-visionx/Scale-RAE-Qwen7B_DiT9.8B
Text Generation • 17B • Updated • 37 • 1 -
nyu-visionx/Scale-RAE-Qwen7B_DiT9.8B-64ep
Text Generation • 17B • Updated • 12
VSI-SUPER benchmark proposed in Cambrian-S
Collection for Diffusion Transformers with Representation Autoencoders
-
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs
Paper • 2406.16860 • Published • 63 -
nyu-visionx/cambrian-8b
Text Generation • Updated • 1.06k • 63 -
nyu-visionx/cambrian-13b
Text Generation • 13B • Updated • 7 • 19 -
nyu-visionx/cambrian-34b
Text Generation • 35B • Updated • 15 • 27
Training and evaluation datasets collected for Solaris: Building a Multiplayer Video World Model in Minecraft
Model weights for Solaris: Building a Multiplayer Video World Model in Minecraft
Collection for "Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders"
-
nyu-visionx/Scale-RAE-Qwen1.5B_DiT2.4B
Text Generation • 4B • Updated • 4.57k -
nyu-visionx/Scale-RAE-Qwen1.5B_DiT2.4B-64ep
Text Generation • 4B • Updated • 441 -
nyu-visionx/Scale-RAE-Qwen7B_DiT9.8B
Text Generation • 17B • Updated • 37 • 1 -
nyu-visionx/Scale-RAE-Qwen7B_DiT9.8B-64ep
Text Generation • 17B • Updated • 12
Data used during Cambrian-S's 4-stage training
VSI-SUPER benchmark proposed in Cambrian-S
Collection for Diffusion Transformers with Representation Autoencoders
-
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs
Paper • 2406.16860 • Published • 63 -
nyu-visionx/cambrian-8b
Text Generation • Updated • 1.06k • 63 -
nyu-visionx/cambrian-13b
Text Generation • 13B • Updated • 7 • 19 -
nyu-visionx/cambrian-34b
Text Generation • 35B • Updated • 15 • 27