Checkpoints from my first 124M LLM pre-training project, covering scratch training, continued pre-training, and SFT experiments.
Mrinaal Arora
mrinaalarora
AI & ML interests
None yet
Recent Activity
updated a collection 20 days ago
124M-Base-Experiments updated a model 20 days ago
mrinaalarora/mrinaal-124m-instruct-v3-mathmix-smoltalk-150k published a model 20 days ago
mrinaalarora/mrinaal-124m-instruct-v3-mathmix-smoltalk-150k