-
Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation
Paper • 2512.24271 • Published • 63 -
FlowBlending: Stage-Aware Multi-Model Sampling for Fast and High-Fidelity Video Generation
Paper • 2512.24724 • Published • 7 -
Pretraining Frame Preservation in Autoregressive Video Memory Compression
Paper • 2512.23851 • Published • 25 -
PhyGDPO: Physics-Aware Groupwise Direct Preference Optimization for Physically Consistent Text-to-Video Generation
Paper • 2512.24551 • Published • 20
Collections
Discover the best community collections!
Collections including paper arxiv:2512.24551
-
BroRL: Scaling Reinforcement Learning via Broadened Exploration
Paper • 2510.01180 • Published • 19 -
MITS: Enhanced Tree Search Reasoning for LLMs via Pointwise Mutual Information
Paper • 2510.03632 • Published • 42 -
Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation
Paper • 2509.25849 • Published • 48 -
Beyond the Exploration-Exploitation Trade-off: A Hidden State Approach for LLM Reasoning in RLVR
Paper • 2509.23808 • Published • 47
-
Towards Scalable Pre-training of Visual Tokenizers for Generation
Paper • 2512.13687 • Published • 105 -
MMGR: Multi-Modal Generative Reasoning
Paper • 2512.14691 • Published • 119 -
Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss
Paper • 2512.23447 • Published • 98 -
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation
Paper • 2512.23576 • Published • 65
-
Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation
Paper • 2512.24271 • Published • 63 -
FlowBlending: Stage-Aware Multi-Model Sampling for Fast and High-Fidelity Video Generation
Paper • 2512.24724 • Published • 7 -
Pretraining Frame Preservation in Autoregressive Video Memory Compression
Paper • 2512.23851 • Published • 25 -
PhyGDPO: Physics-Aware Groupwise Direct Preference Optimization for Physically Consistent Text-to-Video Generation
Paper • 2512.24551 • Published • 20
-
Towards Scalable Pre-training of Visual Tokenizers for Generation
Paper • 2512.13687 • Published • 105 -
MMGR: Multi-Modal Generative Reasoning
Paper • 2512.14691 • Published • 119 -
Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss
Paper • 2512.23447 • Published • 98 -
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation
Paper • 2512.23576 • Published • 65
-
BroRL: Scaling Reinforcement Learning via Broadened Exploration
Paper • 2510.01180 • Published • 19 -
MITS: Enhanced Tree Search Reasoning for LLMs via Pointwise Mutual Information
Paper • 2510.03632 • Published • 42 -
Knapsack RL: Unlocking Exploration of LLMs via Optimizing Budget Allocation
Paper • 2509.25849 • Published • 48 -
Beyond the Exploration-Exploitation Trade-off: A Hidden State Approach for LLM Reasoning in RLVR
Paper • 2509.23808 • Published • 47