KLUE: Korean Language Understanding Evaluation
Paper
•
2105.09680
•
Published
•
1
This model is a fine-tuned version of klue/roberta-base on the klue dataset. It achieves the following results on the evaluation set:
Pretrained RoBERTa Model on Korean Language. See Github and Paper for more details.
NOTE: Use BertTokenizer instead of RobertaTokenizer. (AutoTokenizer will load BertTokenizer)
from transformers import AutoModel, AutoTokenizer
model = AutoModel.from_pretrained("klue/roberta-base")
tokenizer = AutoTokenizer.from_pretrained("klue/roberta-base")
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 | Accuracy |
|---|---|---|---|---|---|---|---|
| 0.0449 | 1.0 | 2626 | 0.0601 | 0.9361 | 0.9176 | 0.9267 | 0.9830 |
| 0.0262 | 2.0 | 5252 | 0.0469 | 0.9484 | 0.9510 | 0.9497 | 0.9874 |
| 0.0144 | 3.0 | 7878 | 0.0487 | 0.9546 | 0.9557 | 0.9552 | 0.9884 |
Base model
klue/roberta-base