·
AI & ML interests
NLP, RL
Organizations
Viewer
• Updated
• 12.5k • 30
Viewer
• Updated
• 361k • 12
Dahoas/aimo-validation-aime
Viewer
• Updated
• 90 • 12
Dahoas/qwen-1.5-4B-default-positives-epoch-1-100
Viewer
• Updated
• 290k • 11
Dahoas/qwen-1.5-4B-tree-positives-epoch-2-100
Viewer
• Updated
• 491k • 8
Dahoas/qwen-1.5-4B-tree-positives-epoch-1-100
Viewer
• Updated
• 477k • 10
Dahoas/qwen-1.5-4B-epoch-1-test-100
Viewer
• Updated
• 498k • 7
Dahoas/qwen-1.5-4B-K-100-test
Viewer
• Updated
• 500k • 41
Dahoas/MATH_train_K_100_qwen_1.5_4B_outputs
Viewer
• Updated
• 750k • 23
Viewer
• Updated
• 750k • 19
• 2
Viewer
• Updated
• 8.79k • 28
Dahoas/MATH_full_chat_format
Viewer
• Updated
• 12.5k • 5
• 1
Viewer
• Updated
• 7.91k • 5
Viewer
• Updated
• 4.01k • 18
Viewer
• Updated
• 1k • 9
• 1
Viewer
• Updated
• 1k • 117
Dahoas/prompted_hf_cot_gsm8k
Viewer
• Updated
• 8.79k • 14
• 7
Viewer
• Updated
• 8.79k • 10
• 1
Dahoas/cot_gsm8k_three_step
Viewer
• Updated
• 741 • 9
Dahoas/no_nl_cot_gsm8k_three_step
Viewer
• Updated
• 2.09k • 20
Dahoas/no_nl_cot_gsm8k_toy
Viewer
• Updated
• 2.42k • 81
Viewer
• Updated
• 578 • 11
Viewer
• Updated
• 32.2k • 13
Dahoas/split_no_nl_cot_gsm8k
Viewer
• Updated
• 28k • 9
• 1
Viewer
• Updated
• 8.68k • 13
• 2
Dahoas/gsm_socratic_conditional
Viewer
• Updated
• 52.4k • 11
• 1
Dahoas/cot_gsm8k_socratic
Viewer
• Updated
• 8.79k • 16
• 4
Viewer
• Updated
• 8.79k • 55
• 6
Viewer
• Updated
• 20k • 15
• 4
Viewer
• Updated
• 207k • 23
• 3