EASI Leaderboard

EASI: Holistic Evaluation of Multimodal LLMs on Spatial Intelligence

EASI conceptualizes a comprehensive taxonomy of spatial tasks that unifies existing benchmarks and a standardized protocol for the fair evaluation of state-of-the-art proprietary and open-source models.

Protocol
Select Columns to Display
65.2SenseNova-SI-1.3-InternVL3-8B64.5SenseNova-SI-1.2-InternVL3-8B63.8Gemini 3 Pro61.5SenseNova-SI-1.1-InternVL3-8B58.8GPT-558.1SenseNova-SI-1.1-Qwen3-VL-8B58.0Gemini 2.5 Pro54.2Seed 1.653.3Grok 451.0VST-7B-SFT51.0SenseNova-SI-1.1-Qwen2.5-VL-7B50.6Qwen3-VL-8B-Instruct49.4SenseNova-SI-1.1-InternVL3-2B49.0InternVL3_5-8B48.6SenseNova-SI-1.1-BAGEL-7B-MoT48.4VST-3B-SFT46.6vlm-3r-llava-qwen2-lora46.4Cambrian-S-7B45.7InternVL3-8B45.7SenseNova-SI-1.1-Qwen2.5-VL-3B45.3BAGEL-7B-MoT44.6Qwen3-VL-2B-Instruct43.7ViLaSR43.2Cambrian-S-3B42.6Qwen2.5-VL-7B-Instruct41.8SpaceR-SFT-7B40.9SpatialLadder-3B40.4Qwen2.5-VL-3B-Instruct39.8InternVL3-2B35.6Spatial-MLLM-subset-sft22.0MindCube-Qwen2.5VL-RawQA-SFT

Last updated: 2026-01-19 09:32:13 UTC