qwen3-reversed / README.md
akseljoonas's picture
akseljoonas HF Staff
Upload README.md
bbfb9b6 verified

A newer version of the Gradio SDK is available: 6.8.0

Upgrade
metadata
title: Qwen3 Reversed (DPO)
emoji: 🤖
colorFrom: indigo
colorTo: purple
sdk: gradio
python_version: '3.10'
sdk_version: 4.44.1
app_file: app.py
models:
  - akseljoonas/qwen3-4b-dpo-hh-rlhf-reversed
short_description: Qwen3-4B DPO demo on ZeroGPU
startup_duration_timeout: 45m

Qwen3-4B DPO (HH RLHF Reversed)

This Space serves akseljoonas/qwen3-4b-dpo-hh-rlhf-reversed with a simple Gradio UI.

Notes

  • ZeroGPU requires @spaces.GPU for GPU allocation.
  • Default GPU duration set to 120s in app.py.

Model: https://huggingface.co/akseljoonas/qwen3-4b-dpo-hh-rlhf-reversed