arxiv:2504.15477
Rohan Surana
rohan2810
AI & ML interests
None yet
Recent Activity
upvoted a paper 6 minutes ago
MASS-DPO: Multi-negative Active Sample Selection for Direct Policy Optimization submitted a paper about 19 hours ago
F-GRPO: Factorized Group-Relative Policy Optimization for Unified Candidate Generation and RankingOrganizations
None yet