MINER: Mining Multimodal Internal Representation for Efficient Retrieval Paper • 2605.06460 • Published May 7 • 3
Grounded Chess Reasoning in Language Models via Master Distillation Paper • 2603.20510 • Published Mar 20 • 1
A Gradient Perspective on RLVR Stability and Winner Advantage Policy Optimization Paper • 2606.16154 • Published 18 days ago • 8
A Gradient Perspective on RLVR Stability and Winner Advantage Policy Optimization Paper • 2606.16154 • Published 18 days ago • 8
Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models Paper • 2306.04675 • Published Jun 7, 2023 • 1
Agentic Monte Carlo: Simulating Reinforcement Learning for Black-Box Agents Paper • 2606.05296 • Published about 1 month ago • 10
Agentic Monte Carlo: Simulating Reinforcement Learning for Black-Box Agents Paper • 2606.05296 • Published about 1 month ago • 10
Response Quality Assessment for Retrieval-Augmented Generation via Conditional Conformal Factuality Paper • 2506.20978 • Published Jun 26, 2025 • 1
RankJudge: A Multi-Turn LLM-as-a-Judge Synthetic Benchmark Generator Paper • 2605.21748 • Published May 20 • 17
Chartographer: Counterfactual Chart Generation for Evaluating Vision-Language Models Paper • 2605.27311 • Published May 26 • 3
Beyond Procedure: Substantive Fairness in Conformal Prediction Paper • 2602.16794 • Published Feb 18 • 1
On the Burden of Achieving Fairness in Conformal Prediction Paper • 2605.14260 • Published May 15 • 1
RankJudge: A Multi-Turn LLM-as-a-Judge Synthetic Benchmark Generator Paper • 2605.21748 • Published May 20 • 17