ai2-transfer

non-profit

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

sunyiyou authored a paper about 17 hours ago

OpenOOD v1.5: Enhanced Benchmark for Out-of-Distribution Detection

sunyiyou authored a paper about 17 hours ago

Scattered Forest Search: Smarter Code Space Exploration with LLMs

sunyiyou authored a paper about 17 hours ago

Can LLMs Design Good Questions Based on Context?

View all activity

sunyiyou

authored 9 papers about 17 hours ago

Climbing the Ladder of Reasoning: What LLMs Can-and Still Can't-Solve after SFT?

Paper • 2504.11741 • Published Apr 16, 2025 • 1

OMEGA: Can LLMs Reason Outside the Box in Math? Evaluating Exploratory, Compositional, and Transformative Generalization

Paper • 2506.18880 • Published Jun 23, 2025 • 4

Can Aha Moments Be Fake? Identifying True and Decorative Thinking Steps in Chain-of-Thought

Paper • 2510.24941 • Published Oct 28, 2025 • 4

Unsafer in Many Turns: Benchmarking and Defending Multi-Turn Safety Risks in Tool-Using Agents

Paper • 2602.13379 • Published Feb 13 • 3

Rethinking Domain Generalization for Face Anti-spoofing: Separability and Alignment

Paper • 2303.13662 • Published Mar 23, 2023

Agents' Last Exam

Paper • 2606.05405 • Published 6 days ago • 2

tairaa

authored a paper 2 months ago

MolmoPoint: Better Pointing for VLMs with Grounding Tokens

Paper • 2603.28069 • Published Mar 30 • 9

soldni

authored 10 papers 4 months ago

2 OLMo 2 Furious

Paper • 2501.00656 • Published Dec 31, 2024 • 22

Organize the Web: Constructing Domains Enhances Pre-Training Data Curation

Paper • 2502.10341 • Published Feb 14, 2025 • 3

olmOCR: Unlocking Trillions of Tokens in PDFs with Vision Language Models

Paper • 2502.18443 • Published Feb 25, 2025 • 12

DataDecide: How to Predict Best Pretraining Data with Small Experiments

Paper • 2504.11393 • Published Apr 15, 2025 • 20

Teaching Models to Understand (but not Generate) High-risk Data

Paper • 2505.03052 • Published May 5, 2025 • 6

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

Paper • 2506.05209 • Published Jun 5, 2025 • 61

FlexOlmo: Open Language Models for Flexible Data Use

Paper • 2507.07024 • Published Jul 9, 2025 • 10

olmOCR 2: Unit Test Rewards for Document OCR

Paper • 2510.19817 • Published Oct 22, 2025 • 17

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published Nov 24, 2025 • 63

Olmo 3

Paper • 2512.13961 • Published Dec 15, 2025 • 35

AI & ML interests

Recent Activity

Team members 3

ai2-transfer's activity