Running 174 The ultimate guide to RL environments: building and scaling them in the LLM era 📝 174 Building and scaling RL environments for LLM training
Running on CPU Upgrade Featured 3.19k The Smol Training Playbook 📚 3.19k The secrets to building world-class LLMs
NetherlandsForensicInstitute/ARM64BERT-embedding Sentence Similarity • 87.8M • Updated about 1 month ago • 291 • 8
Menlo/ReZero-v0.1-llama-3.2-3b-it-grpo-250404 Text Generation • 3B • Updated Apr 17, 2025 • 135 • • 63