MolmoMotion: Forecasting Point Trajectories in 3D with Language Instruction Paper • 2606.18558 • Published 4 days ago • 44
MAOAM: Unified Object and Material Selection with Vision-Language Models Paper • 2606.04880 • Published 19 days ago • 10
From Plans to Pixels: Learning to Plan and Orchestrate for Open-Ended Image Editing Paper • 2605.15181 • Published May 14 • 12