Mark's picture

Mark

Makrrr

·

AI & ML interests

NLP, RLHF, IR

Recent Activity

upvoted a paper 3 days ago

Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses

upvoted a paper 28 days ago

SkillOS: Learning Skill Curation for Self-Evolving Agents

updated a model about 1 month ago

CL-From-Nothing/Qwen3-4B-SSD-RLVE-Eval20-N20-global-step-500

View all activity

Organizations

liked a model 11 months ago

Makrrr/Qwen3-1.7B-GSM8K-GRPO-verl

Reinforcement Learning • 2B • Updated Jul 5, 2025 • 93 • 3

liked 3 Spaces about 1 year ago

Check My Progress Deep RL Course

Check your progress in a Deep RL course

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

FineWeb: decanting the web for the finest text data at scale

Explore and download the FineWeb web‑scale text dataset