Running Agents 230 BigCodeBench Leaderboard 🥇 230 Explore code-generation model leaderboards and task details
Running on CPU Upgrade 20 BigCodeBench Evaluator 🥇 20 Evaluate code samples using specified parameters