Update README.md
Browse files
README.md
CHANGED
|
@@ -132,3 +132,144 @@ You should use `figma` to trigger the image generation.
|
|
| 132 |
Weights for this model are available in Safetensors format.
|
| 133 |
|
| 134 |
[Download](/p1atdev/animagine-xl-4.0-opt-pvc-style-lora/tree/main) them in the Files & versions tab.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 132 |
Weights for this model are available in Safetensors format.
|
| 133 |
|
| 134 |
[Download](/p1atdev/animagine-xl-4.0-opt-pvc-style-lora/tree/main) them in the Files & versions tab.
|
| 135 |
+
|
| 136 |
+
## Dataset
|
| 137 |
+
|
| 138 |
+
About 7000 of pvc figure images.
|
| 139 |
+
|
| 140 |
+
## Training
|
| 141 |
+
|
| 142 |
+
### Training code
|
| 143 |
+
|
| 144 |
+
https://github.com/p1atdev/vision-ft
|
| 145 |
+
|
| 146 |
+
### Training settings
|
| 147 |
+
|
| 148 |
+
|
| 149 |
+
<details>
|
| 150 |
+
<summary>`config.yml`</summary>
|
| 151 |
+
|
| 152 |
+
```yml
|
| 153 |
+
model:
|
| 154 |
+
checkpoint_path: "./models/animagine-xl-4.0-opt.bnb_nf4.safetensors"
|
| 155 |
+
pretrained_model_name_or_path: cagliostrolab/animagine-xl-4.0
|
| 156 |
+
|
| 157 |
+
dtype: bfloat16
|
| 158 |
+
|
| 159 |
+
denoiser:
|
| 160 |
+
attn_implementation: "flash_attention_2"
|
| 161 |
+
|
| 162 |
+
peft:
|
| 163 |
+
config:
|
| 164 |
+
type: lora
|
| 165 |
+
rank: 2
|
| 166 |
+
alpha: 1.0
|
| 167 |
+
dropout: 0.0
|
| 168 |
+
|
| 169 |
+
dtype: bfloat16
|
| 170 |
+
|
| 171 |
+
include_keys:
|
| 172 |
+
- "attn1"
|
| 173 |
+
- "attn2"
|
| 174 |
+
- ".ff."
|
| 175 |
+
exclude_keys: ["text_encoder", "vae"]
|
| 176 |
+
|
| 177 |
+
dataset:
|
| 178 |
+
folder: "data/pvc"
|
| 179 |
+
num_repeats: 4
|
| 180 |
+
batch_size: 4
|
| 181 |
+
|
| 182 |
+
bucket_base_size: 1024
|
| 183 |
+
step: 128
|
| 184 |
+
min_size: 384
|
| 185 |
+
do_upscale: true
|
| 186 |
+
|
| 187 |
+
caption_processors:
|
| 188 |
+
- type: shuffle
|
| 189 |
+
split_separator: ","
|
| 190 |
+
|
| 191 |
+
optimizer:
|
| 192 |
+
name: "schedulefree.RAdamScheduleFree"
|
| 193 |
+
args:
|
| 194 |
+
lr: 0.005
|
| 195 |
+
|
| 196 |
+
scheduler:
|
| 197 |
+
# name: "torch.optim.lr_scheduler.ConstantLR"
|
| 198 |
+
# args: {}
|
| 199 |
+
|
| 200 |
+
tracker:
|
| 201 |
+
project_name: "sdxl-pvc-1"
|
| 202 |
+
loggers:
|
| 203 |
+
- wandb
|
| 204 |
+
|
| 205 |
+
saving:
|
| 206 |
+
strategy:
|
| 207 |
+
per_epochs: 0.25
|
| 208 |
+
per_steps: null
|
| 209 |
+
save_last: true
|
| 210 |
+
|
| 211 |
+
callbacks:
|
| 212 |
+
- type: "safentensors"
|
| 213 |
+
name: "sdxl-pvc"
|
| 214 |
+
save_dir: "./output/sdxl-pvc-2"
|
| 215 |
+
|
| 216 |
+
preview:
|
| 217 |
+
strategy:
|
| 218 |
+
per_epochs: 1
|
| 219 |
+
per_steps: 100
|
| 220 |
+
|
| 221 |
+
callbacks:
|
| 222 |
+
- type: "local"
|
| 223 |
+
save_dir: "./output/sdxl-lora/preview"
|
| 224 |
+
|
| 225 |
+
data:
|
| 226 |
+
path: "./projects/pvc-sdxl/preview.yml"
|
| 227 |
+
|
| 228 |
+
seed: 42
|
| 229 |
+
num_train_epochs: 10
|
| 230 |
+
|
| 231 |
+
trainer:
|
| 232 |
+
# debug_mode: "1step"
|
| 233 |
+
|
| 234 |
+
gradient_checkpointing: true
|
| 235 |
+
|
| 236 |
+
torch_compile: true
|
| 237 |
+
torch_compile_args:
|
| 238 |
+
backend: eager
|
| 239 |
+
mode: default
|
| 240 |
+
fullgraph: false
|
| 241 |
+
|
| 242 |
+
fp32_matmul_precision: "high"
|
| 243 |
+
allow_tf32: true
|
| 244 |
+
```
|
| 245 |
+
|
| 246 |
+
</details>
|
| 247 |
+
|
| 248 |
+
<details>
|
| 249 |
+
<summary>`preview.yml` </summary>
|
| 250 |
+
|
| 251 |
+
```yml
|
| 252 |
+
- prompt: |-
|
| 253 |
+
pvc style, 1girl, aqua eyes, baseball cap, blonde hair, closed mouth, earrings,
|
| 254 |
+
green background, hat, hoop earrings, jewelry, looking at viewer,
|
| 255 |
+
shirt, short hair, simple background, solo, upper body, yellow shirt,
|
| 256 |
+
masterpiece, high score, great score, absurdres
|
| 257 |
+
negative_prompt: |-
|
| 258 |
+
lowres, bad anatomy, bad hands, text, error, missing finger, extra digits,
|
| 259 |
+
fewer digits, cropped, worst quality, low quality, low score, bad score,
|
| 260 |
+
average score, signature, watermark, username, blurry
|
| 261 |
+
height: 1024
|
| 262 |
+
width: 1024
|
| 263 |
+
cfg_scale: 5.0
|
| 264 |
+
num_steps: 20
|
| 265 |
+
|
| 266 |
+
seed: 0
|
| 267 |
+
```
|
| 268 |
+
|
| 269 |
+
</details>
|
| 270 |
+
|
| 271 |
+
The training was stopped in the 9th epoch, because of my room was too hot due to this training.
|
| 272 |
+
|
| 273 |
+
### Device and Hours
|
| 274 |
+
|
| 275 |
+
RTX 4070 Ti Super (VRAM 16GB) for 27 hours.
|