SciEval: A Multi-Level Large Language Model Evaluation Benchmark for Scientific Research Paper • 2308.13149 • Published Aug 25, 2023
SciDFM: A Large Language Model with Mixture-of-Experts for Science Paper • 2409.18412 • Published Sep 27, 2024
CATP: Contextually Adaptive Token Pruning for Efficient and Enhanced Multimodal In-Context Learning Paper • 2508.07871 • Published Aug 11, 2025
OpenAgents: An Open Platform for Language Agents in the Wild Paper • 2310.10634 • Published Oct 16, 2023 • 9
LayoutReader: Pre-training of Text and Layout for Reading Order Detection Paper • 2108.11591 • Published Aug 26, 2021 • 1
VideoAgentTrek: Computer Use Pretraining from Unlabeled Videos Paper • 2510.19488 • Published Oct 22, 2025 • 19
Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents Paper • 2510.24702 • Published Oct 28, 2025 • 28
Grounding Computer Use Agents on Human Demonstrations Paper • 2511.07332 • Published Nov 10, 2025 • 105