3D-VCD: Hallucination Mitigation in 3D-LLM Embodied Agents through Visual Contrastive Decoding
Paper • 2604.08645 • Published • 1
None defined yet.
RewardFlow: Generate Images by Optimizing What You Reward
Phantom: Physics-Infused Video Generation via Joint Modeling of Visual and Latent Physical Dynamics