EM-RAFT

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

FlippyDora authored a paper 10 days ago

EscapeBench: Towards Advancing Creative Intelligence of Language Model Agents

FlippyDora authored a paper 10 days ago

GAR: Generative Adversarial Reinforcement Learning for Formal Theorem Proving

FlippyDora authored a paper 10 days ago

ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning

View all activity

FlippyDora

authored 5 papers 10 days ago

EscapeBench: Towards Advancing Creative Intelligence of Language Model Agents

Paper • 2412.13549 • Published Dec 18, 2024

GAR: Generative Adversarial Reinforcement Learning for Formal Theorem Proving

Paper • 2510.11769 • Published Oct 13, 2025 • 26

ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning

Paper • 2510.12693 • Published Oct 14, 2025 • 28

Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models

Paper • 2603.13985 • Published Mar 14 • 10

AgentSPEX: An Agent SPecification and EXecution Language

Paper • 2604.13346 • Published 18 days ago • 162

FlippyDora

submitted a paper to Daily Papers about 2 months ago

Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models

Paper • 2603.13985 • Published Mar 14 • 10

FlippyDora

authored a paper 4 months ago

PRL: Process Reward Learning Improves LLMs' Reasoning Ability and Broadens the Reasoning Boundary

Paper • 2601.10201 • Published Jan 15 • 9

FlippyDora

submitted a paper to Daily Papers 4 months ago

PRL: Process Reward Learning Improves LLMs' Reasoning Ability and Broadens the Reasoning Boundary

Paper • 2601.10201 • Published Jan 15 • 9

FlippyDora

updated a model 9 months ago

ScaleML-RLHF/Llama-1B-em-raftpp-iter4

1B • Updated Jul 29, 2025 • 2

FlippyDora

published a model 9 months ago

ScaleML-RLHF/Llama-1B-em-raftpp-iter4

1B • Updated Jul 29, 2025 • 2

FlippyDora

updated a model 9 months ago

ScaleML-RLHF/Llama-1B-em-raftpp-iter10

1B • Updated Jul 29, 2025 • 2

FlippyDora

published a model 9 months ago

ScaleML-RLHF/Llama-1B-em-raftpp-iter10

1B • Updated Jul 29, 2025 • 2

FlippyDora

updated 2 models 9 months ago

ScaleML-RLHF/Llama-3B-em-raftpp-iter6

4B • Updated Jul 29, 2025 • 1

ScaleML-RLHF/Llama-3B-em-raftpp-iter5

4B • Updated Jul 29, 2025 • 2

FlippyDora

published a model 9 months ago

ScaleML-RLHF/Llama-3B-em-raftpp-iter6

4B • Updated Jul 29, 2025 • 1

FlippyDora

updated a model 9 months ago

ScaleML-RLHF/Llama-3B-em-grpo-iter8

4B • Updated Jul 29, 2025 • 1

FlippyDora

published 2 models 9 months ago

ScaleML-RLHF/Llama-3B-em-raftpp-iter5

4B • Updated Jul 29, 2025 • 2

ScaleML-RLHF/Llama-3B-em-grpo-iter8

4B • Updated Jul 29, 2025 • 1

FlippyDora

updated a model 9 months ago

ScaleML-RLHF/Llama-3B-em-raftpp-iter4

4B • Updated Jul 29, 2025 • 2

FlippyDora

published a model 9 months ago

ScaleML-RLHF/Llama-3B-em-raftpp-iter4

4B • Updated Jul 29, 2025 • 2

AI & ML interests

Recent Activity

Team members 1

ScaleML-RLHF's activity