oaimli/longtune_scitrek_grounding_reinforcement_gemma_5 Image-Text-to-Text • 4B • Updated about 19 hours ago • 8
oaimli/longtune_scitrek_grounding_reinforcement_gemma_5 Image-Text-to-Text • 4B • Updated about 19 hours ago • 8
oaimli/longtune_scitrek_grounding_reinforcement_gemma_0 Image-Text-to-Text • 4B • Updated about 19 hours ago • 9
oaimli/longtune_scitrek_grounding_reinforcement_gemma_0 Image-Text-to-Text • 4B • Updated about 19 hours ago • 9
oaimli/longtune_scitrek_direct_grounding_gemma Image-Text-to-Text • 4B • Updated about 19 hours ago • 9
oaimli/longtune_scitrek_direct_grounding_gemma Image-Text-to-Text • 4B • Updated about 19 hours ago • 9
EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience Paper • 2601.15876 • Published 27 days ago • 90
oaimli/longtune_scitrek_grounding_reinforcement_gemma_5_alex Image-Text-to-Text • 4B • Updated Dec 22, 2025
oaimli/longtune_scitrek_grounding_reinforcement_gemma_5_alex Image-Text-to-Text • 4B • Updated Dec 22, 2025
oaimli/longtune_hotpotqa_grounding_reinforcement_qwen_5_225 Text Generation • 4B • Updated Dec 22, 2025 • 1
oaimli/longtune_hotpotqa_grounding_reinforcement_qwen_5_225 Text Generation • 4B • Updated Dec 22, 2025 • 1