BLIP3o-NEXT: Next Frontier of Native Image Generation Paper • 2510.15857 • Published Oct 17, 2025 • 26
VideoNSA: Native Sparse Attention Scales Video Understanding Paper • 2510.02295 • Published Oct 2, 2025 • 10
Benchmark Designers Should "Train on the Test Set" to Expose Exploitable Non-Visual Shortcuts Paper • 2511.04655 • Published Nov 6, 2025 • 10
Benchmarking Visual State Tracking in Multimodal Video Understanding Paper • 2606.03920 • Published 29 days ago • 52
Benchmarking Visual State Tracking in Multimodal Video Understanding Paper • 2606.03920 • Published 29 days ago • 52