Communicating about Space: Language-Mediated Spatial Integration Across Partial Views Paper • 2603.27183 • Published 23 days ago • 20
Grounding Computer Use Agents on Human Demonstrations Paper • 2511.07332 • Published Nov 10, 2025 • 107
Grounding Computer Use Agents on Human Demonstrations Paper • 2511.07332 • Published Nov 10, 2025 • 107
WebMMU Collection WebMMU: A Benchmark for Multimodal Multilingual Website Understanding and Code Generation • 2 items • Updated Sep 16, 2025 • 2
WebMMU: A Benchmark for Multimodal Multilingual Website Understanding and Code Generation Paper • 2508.16763 • Published Aug 22, 2025 • 2
WebMMU Collection WebMMU: A Benchmark for Multimodal Multilingual Website Understanding and Code Generation • 2 items • Updated Sep 16, 2025 • 2
EARL Collection Official artifacts for the paper, The Promise of RL for Autoregressive Image Editing (EARL). • 7 items • Updated Mar 2
EARL Collection Official artifacts for the paper, The Promise of RL for Autoregressive Image Editing (EARL). • 7 items • Updated Mar 2
EARL Collection Official artifacts for the paper, The Promise of RL for Autoregressive Image Editing (EARL). • 7 items • Updated Mar 2