llama-duo/synth_summarize_dataset_dedup
Viewer • Updated • 880k • 945 • 2
How to use llama-duo/gemma7b-summarize-gemini1_5flash-16k with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("google/gemma-7b")
model = PeftModel.from_pretrained(base_model, "llama-duo/gemma7b-summarize-gemini1_5flash-16k")This model is a fine-tuned version of google/gemma-7b on the llama-duo/synth_summarize_dataset_dedup dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 20.3933 | 0.9811 | 26 | 7.9476 |
| 5.2781 | 2.0 | 53 | 4.2830 |
| 1.5712 | 2.9811 | 79 | 2.9235 |
| 1.2696 | 4.0 | 106 | 2.7524 |
| 1.1842 | 4.9811 | 132 | 2.7035 |
| 1.1359 | 6.0 | 159 | 2.6779 |
| 1.1053 | 6.9811 | 185 | 2.6764 |
| 1.0828 | 8.0 | 212 | 2.6683 |
| 1.0853 | 8.9811 | 238 | 2.6656 |
| 1.0794 | 9.8113 | 260 | 2.6632 |
Base model
google/gemma-7b