SimToolReal: An Object-Centric Policy for Zero-Shot Dexterous Tool Manipulation Paper • 2602.16863 • Published 6 days ago • 12
The Unreasonable Effectiveness of Scaling Agents for Computer Use Paper • 2510.02250 • Published Oct 2, 2025 • 25
Multi-Turn Code Generation Through Single-Step Rewards Paper • 2502.20380 • Published Feb 27, 2025 • 32
Multi-Turn Code Generation Through Single-Step Rewards Paper • 2502.20380 • Published Feb 27, 2025 • 32 • 2
Multi-Turn Code Generation Through Single-Step Rewards Paper • 2502.20380 • Published Feb 27, 2025 • 32
Robotouille: An Asynchronous Planning Benchmark for LLM Agents Paper • 2502.05227 • Published Feb 6, 2025
leap-llm-chalo2000/Meta-Llama-3-8B-Instruct-sft-alfworld-iter1-gpt-4o-mini Text Generation • 8B • Updated Nov 5, 2024 • 1
leap-llm-chalo2000/Meta-Llama-3-8B-Instruct-sft-alfworld-iter1-sanjiban Text Generation • 8B • Updated Nov 5, 2024 • 2
leap-llm-chalo2000/Meta-Llama-3-8B-Instruct-sft-alfworld-iter0 Text Generation • 8B • Updated Nov 5, 2024
MOSAIC: A Modular System for Assistive and Interactive Cooking Paper • 2402.18796 • Published Feb 29, 2024 • 25
Demo2Code: From Summarizing Demonstrations to Synthesizing Code via Extended Chain-of-Thought Paper • 2305.16744 • Published May 26, 2023 • 1
Demo2Code: From Summarizing Demonstrations to Synthesizing Code via Extended Chain-of-Thought Paper • 2305.16744 • Published May 26, 2023 • 1
MOSAIC: A Modular System for Assistive and Interactive Cooking Paper • 2402.18796 • Published Feb 29, 2024 • 25