koutch/short_paper_llama_0.json_train_dpo_v2_dev Text Generation • 8B • Updated about 7 hours ago • 20
koutch/short_paper_qwent_0.json_train_dpo_v2_dev Text Generation • 4B • Updated about 8 hours ago • 11
koutch/short_paper_qwen_0.json_train_dpo_v2_dev Text Generation • 4B • Updated about 8 hours ago • 14
koutch/short_paper_llama_0.json_train_dpo_v1_dev Text Generation • 8B • Updated about 8 hours ago • 11
koutch/short_paper_qwent_0.json_train_dpo_v1_dev Text Generation • 4B • Updated about 8 hours ago • 14
koutch/short_paper_smol_0.json_train_dpo_v2_dev Text Generation • 3B • Updated about 8 hours ago • 31
koutch/short_paper_qwen_0.json_train_dpo_v1_dev Text Generation • 4B • Updated about 8 hours ago • 17
koutch/short_paper_llama_llama3.1-8b_train_sft_all_train_no_think Text Generation • 8B • Updated about 9 hours ago • 40
koutch/short_paper_smol_0.json_train_dpo_v1_dev Text Generation • 3B • Updated about 9 hours ago • 36
koutch/short_paper_qwen_qwen3-instruct-4b_train_sft_all_train_no_think Text Generation • 4B • Updated about 9 hours ago • 38
koutch/short_paper_qwent_qwen3-thinking-4b_train_sft_all_train_no_think Text Generation • 4B • Updated about 9 hours ago • 36
koutch/short_paper_smol_smol3-3B_train_sft_all_train_no_think Text Generation • 3B • Updated about 9 hours ago • 38
koutch/short_paper_qwent_0.json_train_dpo_v2_dev Text Generation • 4B • Updated about 8 hours ago • 11
koutch/short_paper_qwent_0.json_train_dpo_v1_dev Text Generation • 4B • Updated about 8 hours ago • 14
koutch/short_paper_qwen_0.json_train_dpo_v2_dev Text Generation • 4B • Updated about 8 hours ago • 14
koutch/short_paper_qwen_0.json_train_dpo_v1_dev Text Generation • 4B • Updated about 8 hours ago • 17
koutch/short_paper_qwen_0.json_train_dpo_v1_dev Text Generation • 4B • Updated about 8 hours ago • 17
koutch/short_paper_llama_0.json_train_dpo_v1_dev Text Generation • 8B • Updated about 8 hours ago • 11
koutch/short_paper_llama_0.json_train_dpo_v2_dev Text Generation • 8B • Updated about 7 hours ago • 20
koutch/short_paper_llama_0.json_train_dpo_v1_dev Text Generation • 8B • Updated about 8 hours ago • 11