File size: 597 Bytes
255ff9f
70ff748
 
 
 
255ff9f
bbfb9b6
70ff748
255ff9f
70ff748
 
 
 
255ff9f
 
70ff748
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
---
title: Qwen3 Reversed (DPO)
emoji: "🤖"
colorFrom: indigo
colorTo: purple
sdk: gradio
python_version: "3.10"
sdk_version: 4.44.1
app_file: app.py
models:
  - akseljoonas/qwen3-4b-dpo-hh-rlhf-reversed
short_description: Qwen3-4B DPO demo on ZeroGPU
startup_duration_timeout: 45m
---

# Qwen3-4B DPO (HH RLHF Reversed)

This Space serves **akseljoonas/qwen3-4b-dpo-hh-rlhf-reversed** with a simple Gradio UI.

## Notes
- ZeroGPU requires `@spaces.GPU` for GPU allocation.
- Default GPU duration set to 120s in `app.py`.

Model: https://huggingface.co/akseljoonas/qwen3-4b-dpo-hh-rlhf-reversed