Papers - Reinforcement Learning
updated
Unleashing the Power of Pre-trained Language Models for Offline
Reinforcement Learning
Paper
•
2310.20587
•
Published
•
18
SELF: Language-Driven Self-Evolution for Large Language Model
Paper
•
2310.00533
•
Published
•
2
Bigger, Better, Faster: Human-level Atari with human-level efficiency
Paper
•
2305.19452
•
Published
•
4
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for
Reinforcement Learning and Monte-Carlo Tree Search
Paper
•
2408.08152
•
Published
•
60
Natural Language Reinforcement Learning
Paper
•
2411.14251
•
Published
•
31
StarCraft II: A New Challenge for Reinforcement Learning
Paper
•
1708.04782
•
Published
•
1
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via
Reinforcement Learning
Paper
•
2501.12948
•
Published
•
431
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open
Language Models
Paper
•
2402.03300
•
Published
•
138