Filter, Then Reweight: Rethinking Optimization Granularity in On-Policy Distillation Paper ⢠2606.02684 ⢠Published 18 days ago ⢠16
Unveiling Fine-Grained Visual Traces: Evaluating Multimodal Interleaved Reasoning Chains in Multimodal STEM Tasks Paper ⢠2604.19697 ⢠Published May 8 ⢠1
LongAV-Compass: Towards Unified Evaluation of Minute-Scale Audio-Visual Generation Across T2AV, I2AV, and V2AV Paper ⢠2605.26244 ⢠Published 25 days ago ⢠38
Artifact-Bench: Evaluating MLLMs on Detecting and Assessing the Artifacts of AI-Generated Videos Paper ⢠2605.18984 ⢠Published May 18 ⢠22
MMSkills: Towards Multimodal Skills for General Visual Agents Paper ⢠2605.13527 ⢠Published May 14 ⢠120
Edit-Compass & EditReward-Compass: A Unified Benchmark for Image Editing and Reward Modeling Paper ⢠2605.13062 ⢠Published May 13 ⢠33
Beyond the Last Layer: Multi-Layer Representation Fusion for Visual Tokenization Paper ⢠2605.10780 ⢠Published May 12 ⢠33
AJ-Bench: Benchmarking Agent-as-a-Judge for Environment-Aware Evaluation Paper ⢠2604.18240 ⢠Published Apr 20 ⢠16
LongCat-Next: Lexicalizing Modalities as Discrete Tokens Paper ⢠2603.27538 ⢠Published Mar 29 ⢠147
RealChart2Code: Advancing Chart-to-Code Generation with Real Data and Multi-Task Evaluation Paper ⢠2603.25804 ⢠Published Mar 26 ⢠30
VTC-Bench: Evaluating Agentic Multimodal Models via Compositional Visual Tool Chaining Paper ⢠2603.15030 ⢠Published Mar 16 ⢠21
OpenGPT-4o-Image: A Comprehensive Dataset for Advanced Image Generation and Editing Paper ⢠2509.24900 ⢠Published Sep 29, 2025 ⢠54
RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark Paper ⢠2509.24897 ⢠Published Sep 29, 2025 ⢠46
Cosmos-Preidct1 Collection ā ļø This collection is archived. š https://huggingface.co/collections/nvidia/cosmos3 ⢠14 items ⢠Updated 7 days ago ⢠304