sashaboguraev/pythia-1b-ppt-control_nca_steps250_1b-seed324 Text Generation • 1B • Updated about 1 month ago • 2 • 1
How LoRA Remembers? A Parametric Memory Law for LLM Finetuning Paper • 2605.30260 • Published May 28 • 44
OR-Space: A Full-Lifecycle Workspace Benchmark for Industrial Optimization Agents Paper • 2605.28158 • Published May 27 • 6
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published May 20 • 207
Omni-DuplexEval: Evaluating Real-time Duplex Omni-modal Interaction Paper • 2605.17360 • Published May 17 • 4
kairawal/Gemma-3-1B-IT-EL-SynthDolly-r16alpha128-E5-S73 Text Generation • 1.0B • Updated May 22 • 3 • 1
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information Paper • 2605.11609 • Published May 12 • 196
IndusAgent: Reinforcing Open-Vocabulary Industrial Anomaly Detection with Agentic Tools Paper • 2605.20682 • Published May 20 • 85
CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence Paper • 2605.12882 • Published May 13 • 274
HERMES++: Toward a Unified Driving World Model for 3D Scene Understanding and Generation Paper • 2604.28196 • Published Apr 30 • 74
Online Self-Calibration Against Hallucination in Vision-Language Models Paper • 2605.00323 • Published May 1 • 3