Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

Tencent-Hunyuan-Multimodal-RL

company
https://TODO
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

cheese1  authored a paper about 21 hours ago
Reinforcing Few-step Generators via Reward-Tilted Distribution Matching
cheese1  authored a paper about 21 hours ago
Few-Step Diffusion Sampling Through Instance-Aware Discretizations
cheese1  authored a paper about 21 hours ago
Improving Diffusion Generalization with Weak-to-Strong Segmented Guidance
View all activity

Papers

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

Beyond Uniform Token-Level Trust Region in LLM Reinforcement Learning

View all Papers

Xiangxin Zhou's profile pictureLazy Beaver's profile pictureBoye Niu's profile pictureRuoyu's profile pictureJiarui Yao's profile pictureJiaqi Tang's profile pictureTianyu Pang's profile picturePU JIAN's profile picturesumail's profile pictureLvfang Tao's profile pictureHuanjinYao's profile picture
Tencent-Hunyuan-Multimodal-RL 's papers 3
Submitted by
Tianyu Pang
41

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

Tencent-Hunyuan-Multimodal-RL Tencent-Hunyuan-Multimodal-RL
3
Submitted by
Xiangxin Zhou
42

Beyond Uniform Token-Level Trust Region in LLM Reinforcement Learning

Tencent-Hunyuan-Multimodal-RL Tencent-Hunyuan-Multimodal-RL
3
Submitted by
Xiangxin Zhou
33

Rethinking the Divergence Regularization in LLM RL

Tencent-Hunyuan-Multimodal-RL Tencent-Hunyuan-Multimodal-RL
718 4
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs