video - a e-tuanzi Collection

e-tuanzi 's Collections

3d

agent

light

video

game

video

updated 25 days ago

Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation

Paper • 2512.24271 • Published Dec 30, 2025 • 62
FlowBlending: Stage-Aware Multi-Model Sampling for Fast and High-Fidelity Video Generation

Paper • 2512.24724 • Published Dec 31, 2025 • 7
Pretraining Frame Preservation in Autoregressive Video Memory Compression

Paper • 2512.23851 • Published Dec 29, 2025 • 24
PhyGDPO: Physics-Aware Groupwise Direct Preference Optimization for Physically Consistent Text-to-Video Generation

Paper • 2512.24551 • Published Dec 31, 2025 • 19
JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation

Paper • 2512.22905 • Published Dec 28, 2025 • 20
Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems

Paper • 2512.24385 • Published Dec 30, 2025 • 8
Factorized Learning for Temporally Grounded Video-Language Models

Paper • 2512.24097 • Published Dec 30, 2025 • 7
SurgWorld: Learning Surgical Robot Policies from Videos via World Modeling

Paper • 2512.23162 • Published Dec 29, 2025 • 11
Video-BrowseComp: Benchmarking Agentic Video Research on Open Web

Paper • 2512.23044 • Published Dec 28, 2025 • 10
Knot Forcing: Taming Autoregressive Video Diffusion Models for Real-time Infinite Interactive Portrait Animation

Paper • 2512.21734 • Published Dec 25, 2025 • 5