Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shijun Dai's picture
3 16

Shijun Dai

Shijund
gmayank100's profile picture
·
  • Daishijun

AI & ML interests

None yet

Organizations

None yet

Collections 3

rpgllm
  • CoSER: Coordinating LLM-Based Persona Simulation of Established Roles

    Paper • 2502.09082 • Published Feb 13, 2025 • 30
LLM Alignment
  • WARM: On the Benefits of Weight Averaged Reward Models

    Paper • 2401.12187 • Published Jan 22, 2024 • 19
  • Self-Rewarding Language Models

    Paper • 2401.10020 • Published Jan 18, 2024 • 151
  • Secrets of RLHF in Large Language Models Part II: Reward Modeling

    Paper • 2401.06080 • Published Jan 11, 2024 • 28
rpgllm
  • CoSER: Coordinating LLM-Based Persona Simulation of Established Roles

    Paper • 2502.09082 • Published Feb 13, 2025 • 30
LLM Alignment
  • WARM: On the Benefits of Weight Averaged Reward Models

    Paper • 2401.12187 • Published Jan 22, 2024 • 19
  • Self-Rewarding Language Models

    Paper • 2401.10020 • Published Jan 18, 2024 • 151
  • Secrets of RLHF in Large Language Models Part II: Reward Modeling

    Paper • 2401.06080 • Published Jan 11, 2024 • 28
View 3 collections

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs