Shaobai Jiang
shaobaij
AI & ML interests
None yet
Recent Activity
upvoted a paper about 15 hours ago
Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation upvoted a paper about 18 hours ago
Weak-Driven Learning: How Weak Agents make Strong Agents Stronger upvoted a paper about 19 hours ago
LoopFormer: Elastic-Depth Looped Transformers for Latent Reasoning via Shortcut ModulationOrganizations
None yet