The models trained with EVOL-RL
Yujun Zhou
yujunzhou
AI & ML interests
None yet
Recent Activity
new activity 13 days ago
yujunzhou/AIME-TTT-OctoThinker-8B-Hybrid-Base-TTRL:Running in MSTY Studio submitted a paper 3 months ago
Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM ReasoningOrganizations
None yet