Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shijund 's Collections
rpgllm
LLM Alignment
Transformers

LLM Alignment

updated Jan 30, 2024
Upvote
-

  • WARM: On the Benefits of Weight Averaged Reward Models

    Paper • 2401.12187 • Published Jan 22, 2024 • 19

  • Self-Rewarding Language Models

    Paper • 2401.10020 • Published Jan 18, 2024 • 151

  • Secrets of RLHF in Large Language Models Part II: Reward Modeling

    Paper • 2401.06080 • Published Jan 11, 2024 • 28
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs