Delta Belief RL Collection Collection of the models for our paper "Intrinsic Credit Assignment for Long Horizon Interaction" • 6 items • Updated 8 days ago • 1
Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample Creation Paper • 2502.19414 • Published Feb 26, 2025 • 20
Project Alexandria: Towards Freeing Scientific Knowledge from Copyright Burdens via LLMs Paper • 2502.19413 • Published Feb 26, 2025 • 22
Great Models Think Alike and this Undermines AI Oversight Paper • 2502.04313 • Published Feb 6, 2025 • 33