Mingyu Derek Ma's picture

5 4

Mingyu Derek Ma

derekma

·

https://derek.ma

AI & ML interests

Generative Language Model, Scientific LM, Clinical LM, Decoding

Recent Activity

liked a model about 2 hours ago

karina-zadorozhny/ume

upvoted an article about 2 hours ago

A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond

liked a model 12 months ago

deepseek-ai/DeepSeek-R1

View all activity

Organizations

Papers 9

arxiv:2407.01231

arxiv:2406.09923

arxiv:2406.09411

arxiv:2401.12255

models 1

derekma/extreme-heat-relevancy

Text Classification • Updated May 5, 2023 • 3

datasets 0

None public yet