MEnvAgent: Scalable Polyglot Environment Construction for Verifiable Software Engineering Paper • 2601.22859 • Published 6 days ago • 13
MEnvAgent: Scalable Polyglot Environment Construction for Verifiable Software Engineering Paper • 2601.22859 • Published 6 days ago • 13
view post Post 702 Baidu + Transformers + Hugging Face = Pure Magic! ✨We got this nice gift from Hugging Face. @xianbao See translation 🤗 7 7 + Reply
MEnvAgent: Scalable Polyglot Environment Construction for Verifiable Software Engineering Paper • 2601.22859 • Published 6 days ago • 13
ChessArena: A Chess Testbed for Evaluating Strategic Reasoning Capabilities of Large Language Models Paper • 2509.24239 • Published Sep 29, 2025 • 3
ChessArena: A Chess Testbed for Evaluating Strategic Reasoning Capabilities of Large Language Models Paper • 2509.24239 • Published Sep 29, 2025 • 3