AI & ML interests
None yet
Organizations
jlpang888/ultrafeedback_easy_only
Viewer
• Updated • 55.5k • 94
jlpang888/ultrainteract_math_cot_sorted_20k
Viewer
• Updated • 22k • 19
jlpang888/ultrainteract_math_cot_unsorted_20k
Viewer
• Updated • 22k • 3
jlpang888/ultrainteract_math_cot_sorted
Viewer
• Updated • 57.2k • 2
jlpang888/ultrainteract_math_cot_unsorted
Viewer
• Updated • 57.2k • 4
jlpang888/ultrafeedback_sorted_rejected_score
Viewer
• Updated • 63.1k • 8
jlpang888/ultrafeedback_sorted_chosen_score
Viewer
• Updated • 63.1k • 4
jlpang888/ultrafeedback_sorted_score_diff_difficult_7387
Viewer
• Updated • 7.49k • 3
jlpang888/tulu_300k_google
Viewer
• Updated • 301k • 8
jlpang888/tulu_30k_google
Viewer
• Updated • 20k • 5
Viewer
• Updated • 301k • 82
jlpang888/ultrafeedback-sft-chosen-and-rejected
Viewer
• Updated • 123k • 9
jlpang888/ultrafeedback-sft-rejected
Viewer
• Updated • 62.1k • 28
jlpang888/ultrafeedback-sft-chosen
Viewer
• Updated • 62.1k • 7
jlpang888/ultrafeedback_sorted_llama_learning_order_new_0923
Viewer
• Updated • 63.1k • 5
jlpang888/ultrafeedback_sorted_external_reward_new_0923
Viewer
• Updated • 63.1k • 2
jlpang888/ultrafeedback_sorted_embedding_distance_new
Viewer
• Updated • 63.1k • 5
jlpang888/ultrafeedback_with_learning_order
Viewer
• Updated • 61.1k • 4
jlpang888/ultrafeedback_ches_score_0.9prop
Viewer
• Updated • 57k • 3
jlpang888/ultrafeedback_ches_score_0.5prop
Viewer
• Updated • 32.6k • 4
jlpang888/ultrafeedback_sorted_score_diff_5perc_swap
Viewer
• Updated • 63.1k • 3
jlpang888/ultrafeedback_sorted_score_diff_10perc_swap
Viewer
• Updated • 63.1k • 3
jlpang888/mistral-instruct-ultrafeedback_sorted_dpo_loss
Viewer
• Updated • 62.7k • 6
jlpang888/llama3-ultrafeedback-armorm_sorted_dpo_loss
Viewer
• Updated • 61.8k • 5
jlpang888/mistral-instruct-ultrafeedback_sorted_rm_score
Viewer
• Updated • 62.7k • 10
jlpang888/llama3-ultrafeedback-armorm_sorted_rm_score
Viewer
• Updated • 61.8k • 5
jlpang888/llama3-ultrafeedback-armorm_sorted_rating_score
Viewer
• Updated • 61.8k • 7
jlpang888/mistral-instruct-ultrafeedback_sorted_rating_score
Viewer
• Updated • 62.7k • 6
jlpang888/ultrafeedback_sorted_llama_learning_order_new
Viewer
• Updated • 63.1k • 2
jlpang888/arigilla_mix_7k
Viewer
• Updated • 7.5k • 2