Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shaobai Jiang's picture
4 1161

Shaobai Jiang

shaobaij
0xSojalSec's profile picture 21world's profile picture Diluner's profile picture
ยท

AI & ML interests

None yet

Recent Activity

upvoted a paper about 6 hours ago
Evaluating Parameter Efficient Methods for RLVR
upvoted a paper about 21 hours ago
Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL
upvoted a paper 1 day ago
OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration
View all activity

Organizations

None yet

New activity in openai/gpt-oss-120b 6 months ago

Fine tune 120b at 8 H100s getting cuda OOM error

๐Ÿ‘€ 1
6
#117 opened 6 months ago by
jinxu88

FlashInfer requires sm75+

7
#48 opened 6 months ago by
hrithiksagar-tih
New activity in mistralai/Mistral-7B-v0.1 over 2 years ago

If I trained a model on mistral already, do I need to start from scratch due to difficulties of fine-tuning?

2
#62 opened over 2 years ago by
brando

Cant run the model with the most basic code

๐Ÿ‘ 6
6
#7 opened over 2 years ago by
masterchop
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs