Resources for hybrid preferences research where we learn how to route preference instances for human vs. AI feedback
Lj V. Miranda PRO
ljvmiranda921
AI & ML interests
NLP - multilinguality, data-centric AI
Recent Activity
updated
a dataset
about 3 hours ago
ljvmiranda921/rl-kalamang-R1
published
a dataset
about 4 hours ago
ljvmiranda921/rl-kalamang-R1
updated
a dataset
3 days ago
ljvmiranda921/details_msde-google_gemma-3-4b-pt-lora-4bit-tgl_25k-Gemma3