ComboStoc: Combinatorial Stochasticity for Diffusion Generative Models Paper β’ 2405.13729 β’ Published 6 days ago β’ 2
Prox-E: Fine-Grained 3D Shape Editing via Primitive-Based Abstractions Paper β’ 2604.23774 β’ Published 6 days ago β’ 13
World2Minecraft: Occupancy-Driven Simulated Scenes Construction Paper β’ 2604.27578 β’ Published 5 days ago β’ 3
Representation FrΓ©chet Loss for Visual Generation Paper β’ 2604.28190 β’ Published 5 days ago β’ 21
Synthetic Computers at Scale for Long-Horizon Productivity Simulation Paper β’ 2604.28181 β’ Published 5 days ago β’ 15
Co-Director: Agentic Generative Video Storytelling Paper β’ 2604.24842 β’ Published 8 days ago β’ 16
IAM: Identity-Aware Human Motion and Shape Joint Generation Paper β’ 2604.25164 β’ Published 7 days ago β’ 2
From Skills to Talent: Organising Heterogeneous Agents as a Real-World Company Paper β’ 2604.22446 β’ Published 11 days ago β’ 117
SketchVLM: Vision language models can annotate images to explain thoughts and guide users Paper β’ 2604.22875 β’ Published 12 days ago β’ 33
Video Analysis and Generation via a Semantic Progress Function Paper β’ 2604.22554 β’ Published 11 days ago β’ 63
Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond Paper β’ 2604.22748 β’ Published 11 days ago β’ 222
Tstars-Tryon 1.0: Robust and Realistic Virtual Try-On for Diverse Fashion Items Paper β’ 2604.19748 β’ Published 14 days ago β’ 249
UniMesh: Unifying 3D Mesh Understanding and Generation Paper β’ 2604.17472 β’ Published 16 days ago β’ 11
Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence Paper β’ 2604.18292 β’ Published 15 days ago β’ 83
OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation Paper β’ 2604.18486 β’ Published 15 days ago β’ 90
HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds Paper β’ 2604.14268 β’ Published 20 days ago β’ 117