arxiv:2508.15763
Zhouqi Hua
ZhouqiHUA
AI & ML interests
reasoning LLM
Recent Activity
liked
a dataset
13 days ago
openai/gsm8k
upvoted
a
paper
about 1 month ago
Achieving Olympia-Level Geometry Large Language Model Agent via Complexity Boosting Reinforcement Learning
upvoted
a
paper
about 1 month ago
OPV: Outcome-based Process Verifier for Efficient Long Chain-of-Thought Verification
Organizations
None yet