Collection of models and datasets for Beyond Binary Rewards: Training LMs to Reason about their Uncertainty
Mehul Damani PRO
mehuldamani
AI & ML interests
Reinforcement Learning, Large Language Models
Recent Activity
published
a model
1 day ago
mehuldamani/rerun_rlcr_single_from_rlvr_chkpt480
published
a model
1 day ago
mehuldamani/qwen3_8b_medical_rlvr_multi_k_5
published
a model
4 days ago
mehuldamani/math_rlcr_single_try1_cl4096
Organizations
None yet