P-GenRM: Personalized Generative Reward Model with Test-time User-based Scaling Paper • 2602.12116 • Published 4 days ago • 4
ArenaRL: Scaling RL for Open-Ended Agents via Tournament-based Relative Ranking Paper • 2601.06487 • Published Jan 10 • 52
MOA: Multi-Objective Alignment for Role-Playing Agents Paper • 2512.09756 • Published Dec 10, 2025 • 5
ChARM: Character-based Act-adaptive Reward Modeling for Advanced Role-Playing Language Agents Paper • 2505.23923 • Published May 29, 2025 • 8
OmniCharacter: Towards Immersive Role-Playing Agents with Seamless Speech-Language Personality Interaction Paper • 2505.20277 • Published May 26, 2025