chansung/merged_ds_coding
Viewer • Updated • 60.6k • 142 • 18
How to use chansung/coding_llamaduo_60k_v0.2 with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("google/gemma-7b")
model = PeftModel.from_pretrained(base_model, "chansung/coding_llamaduo_60k_v0.2")This model is a fine-tuned version of google/gemma-7b on the chansung/merged_ds_coding dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 0.7499 | 1.0 | 126 | 1.2580 |
| 0.6058 | 2.0 | 252 | 1.1687 |
| 0.5571 | 3.0 | 378 | 1.1492 |
| 0.5118 | 4.0 | 504 | 1.1551 |
| 0.4711 | 5.0 | 630 | 1.1767 |
| 0.4287 | 6.0 | 756 | 1.1948 |
| 0.3943 | 7.0 | 882 | 1.2383 |
| 0.3612 | 8.0 | 1008 | 1.2904 |
| 0.3457 | 9.0 | 1134 | 1.3253 |
| 0.3328 | 10.0 | 1260 | 1.3326 |
Base model
google/gemma-7b