AgentSPEX: An Agent SPecification and EXecution Language Paper • 2604.13346 • Published 19 days ago • 162
AgentSPEX: An Agent SPecification and EXecution Language Paper • 2604.13346 • Published 19 days ago • 162
V-ReasonBench: Toward Unified Reasoning Benchmark Suite for Video Generation Models Paper • 2511.16668 • Published Nov 20, 2025 • 56
GAR: Generative Adversarial Reinforcement Learning for Formal Theorem Proving Paper • 2510.11769 • Published Oct 13, 2025 • 26