1 2

Mehul Damani PRO

mehuldamani

https://damanimehul.github.io

AI & ML interests

Reinforcement Learning, Large Language Models

Recent Activity

published a model 1 day ago

mehuldamani/rerun_rlcr_single_from_rlvr_chkpt480

published a model 1 day ago

mehuldamani/qwen3_8b_medical_rlvr_multi_k_5

published a model 4 days ago

mehuldamani/math_rlcr_single_try1_cl4096

View all activity

Organizations

None yet

Collections 1

Papers 4

models 206

datasets 50

mehuldamani/big-math-tough

Viewer • Updated 5 days ago • 18.5k • 65

mehuldamani/medTroubleshootig-rlvr-220-evaled-on-rlcr

Viewer • Updated 10 days ago • 5k • 12

mehuldamani/medTroubleshootig-rlvr-220-evaled-on-rlvr

Viewer • Updated 10 days ago • 5k • 12

mehuldamani/medDataset_25k

Viewer • Updated 27 days ago • 75k • 371

mehuldamani/medDataset

Viewer • Updated 27 days ago • 1.29M • 102

mehuldamani/qwen3_8b_ambigQA_rlcr_multi_analysis

Viewer • Updated 29 days ago • 2k • 10

mehuldamani/qwen3_8b_ambigQA_rlcr_single_passk_tryAgain

Viewer • Updated about 1 month ago • 2k • 4

mehuldamani/ambigQA

Viewer • Updated Dec 22, 2025 • 12k • 16

mehuldamani/judge-new-sft-instruct

Viewer • Updated Dec 10, 2025 • 100 • 3

mehuldamani/judge-new-sft-base

Viewer • Updated Dec 10, 2025 • 100 • 7

View 50 datasets

Mehul Damani PRO

AI & ML interests

Recent Activity

Organizations

Collections 1

mehuldamani/big-math-digits-v2-correctness

mehuldamani/hotpot-v2-correctness-7b

mehuldamani/orm-big-math-digits-v2-correctness

mehuldamani/big-math-digits-v2-brier

mehuldamani/big-math-digits-v2-correctness

mehuldamani/hotpot-v2-correctness-7b

mehuldamani/orm-big-math-digits-v2-correctness

mehuldamani/big-math-digits-v2-brier

Papers 4

models 206

mehuldamani/rerun_rlcr_single_from_rlvr_chkpt480

mehuldamani/qwen3_8b_medical_rlvr_multi_k_5

mehuldamani/math_rlcr_single_try1_cl4096

mehuldamani/math_rlvr_single_try1_cl4096

mehuldamani/math_rlvr_multi_try1_cl4096

mehuldamani/math_rlcr_multi_try1_cl4096

mehuldamani/math_rlcr_single_try1

mehuldamani/math_rlcr_multi_try1

mehuldamani/math_rlvr_single_try1

mehuldamani/math_rlvr_multi_try1

datasets 50

mehuldamani/big-math-tough

mehuldamani/medTroubleshootig-rlvr-220-evaled-on-rlcr

mehuldamani/medTroubleshootig-rlvr-220-evaled-on-rlvr

mehuldamani/medDataset_25k

mehuldamani/medDataset

mehuldamani/qwen3_8b_ambigQA_rlcr_multi_analysis

mehuldamani/qwen3_8b_ambigQA_rlcr_single_passk_tryAgain

mehuldamani/ambigQA

mehuldamani/judge-new-sft-instruct

mehuldamani/judge-new-sft-base

Mehul Damani PRO

AI & ML interests

Recent Activity

Organizations

Collections 1

Papers 4

models 206 Sort: Recently updated

datasets 50 Sort: Recently updated

models 206

datasets 50