TanyuNvidia/expand-qwen2.5-3b_squeezer_llam4_maverick_top_5_formatreward_0.2 3B • Updated Sep 15, 2025
TanyuNvidia/expand-qwen2.5-3b_squeezer_llam4_maverick_top_5_formatreward_0.2 3B • Updated Sep 15, 2025
ParallelSearch Collection Checkpoints for our paper "ParallelSearch: Train your LLMs to Decompose Query and Search Sub-queries in Parallel with Reinforcement Learning" • 3 items • Updated Aug 8, 2025 • 5