Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
kokolamba
/
moe-mha
like
0
PyTorch
sparse_subspace_decoder
Model card
Files
Files and versions
xet
Community
main
moe-mha
1.19 GB
1 contributor
History:
2 commits
kokolamba
Upload best checkpoint from config
02354e5
verified
3 months ago
.gitattributes
1.52 kB
initial commit
3 months ago
config.json
1.22 kB
Upload best checkpoint from config
3 months ago
merges.txt
456 kB
Upload best checkpoint from config
3 months ago
optimizer.pt
763 MB
xet
Upload best checkpoint from config
3 months ago
pytorch_model.bin
419 MB
xet
Upload best checkpoint from config
3 months ago
rng_state.pth
14.2 kB
xet
Upload best checkpoint from config
3 months ago
scheduler.pt
1.06 kB
xet
Upload best checkpoint from config
3 months ago
special_tokens_map.json
131 Bytes
Upload best checkpoint from config
3 months ago
tokenizer.json
3.56 MB
Upload best checkpoint from config
3 months ago
tokenizer_config.json
507 Bytes
Upload best checkpoint from config
3 months ago
trainer_state.json
13.9 kB
Upload best checkpoint from config
3 months ago
training_args.bin
5.5 kB
xet
Upload best checkpoint from config
3 months ago
vocab.json
798 kB
Upload best checkpoint from config
3 months ago