·
AI & ML interests
None yet
Organizations
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-0.0001_neftune_alpha-10
Text Generation
• 1B • Updated
• 5
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-0.0001_neftune_alpha-5
Text Generation
• 1B • Updated
• 3
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-0.0001_neftune_alpha-0
Text Generation
• 1B • Updated
• 3
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-3e-05_neftune_alpha-10
Text Generation
• 1B • Updated
• 3
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-3e-05_neftune_alpha-5
Text Generation
• 1B • Updated
• 3
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-3e-05_neftune_alpha-0
Text Generation
• 1B • Updated
• 3
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-1e-05_neftune_alpha-10
Text Generation
• 1B • Updated
• 2
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-1e-05_neftune_alpha-5
Text Generation
• 1B • Updated
• 5
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-4_max_lr-1e-05_neftune_alpha-0
Text Generation
• 1B • Updated
• 3
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-0.0001_neftune_alpha-10
Text Generation
• 1B • Updated
• 5
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-0.0001_neftune_alpha-5
Text Generation
• 1B • Updated
• 5
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-0.0001_neftune_alpha-0
Text Generation
• 1B • Updated
• 3
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-3e-05_neftune_alpha-10
Text Generation
• 1B • Updated
• 15
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-3e-05_neftune_alpha-5
Text Generation
• 1B • Updated
• 5
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-3e-05_neftune_alpha-0
Text Generation
• 1B • Updated
• 3
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-1e-05_neftune_alpha-10
Text Generation
• 1B • Updated
• 2
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-1e-05_neftune_alpha-5
Text Generation
• 1B • Updated
• 3
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-2_max_lr-1e-05_neftune_alpha-0
Text Generation
• 1B • Updated
• 5
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-1_max_lr-0.0001_neftune_alpha-5
Text Generation
• 1B • Updated
• 3
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-1_max_lr-0.0001_neftune_alpha-0
Text Generation
• 1B • Updated
• 3
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-1_max_lr-3e-05_neftune_alpha-10
Text Generation
• 1B • Updated
• 3
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-1_max_lr-3e-05_neftune_alpha-5
Text Generation
• 1B • Updated
• 4
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-1_max_lr-3e-05_neftune_alpha-0
Text Generation
• 1B • Updated
• 5
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-1_max_lr-1e-05_neftune_alpha-10
Text Generation
• 1B • Updated
• 5
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-1_max_lr-1e-05_neftune_alpha-5
Text Generation
• 1B • Updated
• 5
Mlxa/deepseek-coder-1.3B-kexer_num_epochs-1_max_lr-1e-05_neftune_alpha-0
Text Generation
• 1B • Updated
• 3
Text Classification
• 67M • Updated
Text Generation
• 0.4B • Updated
• 2
Mlxa/TinyStories-8M-DPO-2
Text Generation
• 19.7M • Updated
• 2
Text Generation
• 19.7M • Updated
• 2