VoladorLuYu 's Collections LLM Reports
updated
Nemotron-4 15B Technical Report
Paper
• 2402.16819
• Published
• 46
InternLM2 Technical Report
Paper
• 2403.17297
• Published
• 34
Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model
Paper
• 2404.04167
• Published
• 13
MobileLLM: Optimizing Sub-billion Parameter Language Models for
On-Device Use Cases
Paper
• 2402.14905
• Published
• 134
JetMoE: Reaching Llama2 Performance with 0.1M Dollars
Paper
• 2404.07413
• Published
• 38
Chinchilla Scaling: A replication attempt
Paper
• 2404.10102
• Published
• 1
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your
Phone
Paper
• 2404.14219
• Published
• 259
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report
Paper
• 2405.00732
• Published
• 122
The Prompt Report: A Systematic Survey of Prompting Techniques
Paper
• 2406.06608
• Published
• 68
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code
Intelligence
Paper
• 2406.11931
• Published
• 69
Paper
• 2407.10671
• Published
• 168
Training Language Models to Self-Correct via Reinforcement Learning
Paper
• 2409.12917
• Published
• 140