Papers
updated
The Art of Scaling Reinforcement Learning Compute for LLMs
Paper
•
2510.13786
•
Published
•
32
Attention Is All You Need for KV Cache in Diffusion LLMs
Paper
•
2510.14973
•
Published
•
42
Paper
•
2510.13998
•
Published
•
58
GigaBrain-0: A World Model-Powered Vision-Language-Action Model
Paper
•
2510.19430
•
Published
•
51
Every Question Has Its Own Value: Reinforcement Learning with Explicit
Human Values
Paper
•
2510.20187
•
Published
•
19
LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts
Paper
•
2510.19363
•
Published
•
62
Qwen3-Omni Technical Report
Paper
•
2509.17765
•
Published
•
146
T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground
Paper
•
2512.10430
•
Published
•
114
Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed
Paper
•
2512.14067
•
Published
•
15
Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers
Paper
•
2512.17351
•
Published
•
27
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Paper
•
2512.16676
•
Published
•
212
Reinforcement Learning for Self-Improving Agent with Skill Library
Paper
•
2512.17102
•
Published
•
34
mHC: Manifold-Constrained Hyper-Connections
Paper
•
2512.24880
•
Published
•
281
TransMLA: Multi-head Latent Attention Is All You Need
Paper
•
2502.07864
•
Published
•
57