AI & ML interests
None yet
Organizations
samitizerxu/debug_worldsize4
Updated
samitizerxu/deepseek-r1-distill-lora-arc
Text Generation
•
8B
•
Updated
•
1
samitizerxu/deepseek-r1-qwen-7b-naive
Updated
samitizerxu/convnext_baseline
Updated
samitizerxu/openfwi-baseline
Updated
samitizerxu/Deepseek-R1-Distil-7B-Qwen-DPO-keep-v2
8B
•
Updated
•
1
samitizerxu/DS-7B-Qwen-distil-DPO-keep-v2
Updated
samitizerxu/DS-7B-Qwen-distil-DPO-keep
Text Generation
•
8B
•
Updated
•
5
samitizerxu/DS-7B-Qwen-distil-KTO-keep-awq
8B
•
Updated
•
2
samitizerxu/DS-7B-Qwen-distil-KTO-keep-alt
8B
•
Updated
•
1
samitizerxu/DS-7B-Qwen-distil-KTO-keep
8B
•
Updated
•
1
samitizerxu/DS-distil-keep
Updated
samitizerxu/DeepSeek-R1-Distill-Qwen-14B-ft
15B
•
Updated
samitizerxu/DeepSeek-R1-Distill-Qwen-7B-final
Updated
samitizerxu/DeepSeek-R1-Distill-Qwen-7B-e
Updated
samitizerxu/DeepSeek-R1-Distill-Qwen-14B-e
Updated
samitizerxu/DeepSeek-R1-Distill-Qwen-1.5B-GRPO-finetune
Updated
samitizerxu/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
samitizerxu/Qwen2.5-R1-Distill-GRPO-final
Updated
samitizerxu/Qwen2.5-R1-Distill-GRPO-h
Text Generation
•
15B
•
Updated
•
1
samitizerxu/Qwen2.5-R1-Distill-GRPO-em
Text Generation
•
15B
•
Updated
•
1
samitizerxu/Qwen2.5-R1-Distill-GRPO-h-t
Updated
samitizerxu/Qwen2.5-R1-Distill-GRPO-em-t
Text Generation
•
15B
•
Updated
•
1
samitizerxu/Qwen2.5-R1-Distill-GRPO-baseline
Updated
samitizerxu/DeepSeek-R1-Distill-Qwen-14B
Updated
samitizerxu/Qwen2.5-1.5B-Open-R1-Code-GRPO
Updated
samitizerxu/Qwen-2.5-7B-Simple-RL
Updated
samitizerxu/longformer_baseline_cer
Token Classification
•
0.1B
•
Updated
•
2
samitizerxu/segformer-b3-from-scratch-final
Image Segmentation
•
47.2M
•
Updated
•
14
samitizerxu/kelp-from-scratch-segformer-b1-02-10
Updated