·
AI & ML interests
RL for LLMs/CodeLLMs
Organizations
datasets 13
reshinthadith/math12k-stage3
Viewer
• Updated
• 6k • 34
reshinthadith/math12k-stage2
Viewer
• Updated
• 4k • 31
reshinthadith/math12k-stage1
Viewer
• Updated
• 2k • 42
reshinthadith/the-stack-mujoco-xml
Viewer
• Updated
• 48.3k • 14
• 1
reshinthadith/WizardLM_evol_instruct_V2_code_filtered
Viewer
• Updated
• 138k • 17
• 1
reshinthadith/basic_code_ppl_eval
Viewer
• Updated
• 8.73k • 269
• 4
Updated
• 10
reshinthadith/2048_has_code_filtered_base_code_review_python_based_on_property
Viewer
• Updated
• 6.4k • 18
reshinthadith/2048_has_code_filtered_base_code_review_python
Viewer
• Updated
• 6.4k • 13
reshinthadith/dfg_augmented_mbpp
Viewer
• Updated
• 95 • 33