LAUNCH Lab

university

https://launch.eecs.umich.edu/

launchnlp

Activity Feed

AI & ML interests

Factuality, reasoning, alignment, LLM applications

Recent Activity

Ayoung01 authored a paper 8 days ago

MET: Theory-Grounded and Culture-Aware Multilingual Moral Reasoning

Ayoung01 authored a paper 8 days ago

LiveOIBench: Can Large Language Models Outperform Human Contestants in Informatics Olympiads?

Ayoung01 authored a paper 8 days ago

Logit Arithmetic Elicits Long Reasoning Capabilities Without Training

View all activity

Papers

MET: Theory-Grounded and Culture-Aware Multilingual Moral Reasoning

Gaming the Judge: Unfaithful Chain-of-Thought Can Undermine Agent Evaluation

View all Papers

Collections 2

spaces 7

LudoBench

🎲

Multimodal Game Reasoning Benchmark [ICLR 2026]

Answer Convergence Early Stopping

🛑

Demo for EMNLP Paper "Answer Convergence as a Signal..."

FactRBench

🏆

View and analyze long-form factuality leaderboard

ExpertLongBench

🚀

Leaderboard for ExpertLongBench

ManyICLBench

🚀

Leaderboard for ManyICLBench

MLRC-BENCH

📊

Display model performance rankings

View 7 Spaces

models 15

datasets 14

launch/MCLASH

Viewer • Updated 8 days ago • 2.61k • 319

launch/CLASH

Viewer • Updated 20 days ago • 345 • 94 • 3

launch/thinkprm-1K-verification-cots

Viewer • Updated Apr 18 • 1k • 87 • 8

launch/LudoBench

Viewer • Updated Mar 1 • 638 • 39

launch/ExpertLongBench

Preview • Updated Jul 30, 2025 • 109 • 10

launch/ManyICLBench

Viewer • Updated Jun 26, 2025 • 66 • 602 • 1

launch/CMV

Viewer • Updated Jun 26, 2025 • 133 • 20

launch/FactRBench

Viewer • Updated Jun 9, 2025 • 1.06k • 28 • 2

launch/FactBench

Viewer • Updated Jun 9, 2025 • 1k • 65 • 3

launch/gov_report

Viewer • Updated Nov 9, 2022 • 58.4k • 617 • 14

View 14 datasets

AI & ML interests

Recent Activity

Papers

Team members 18

Collections 2

spaces 7 Sort: Recently updated

LudoBench

Answer Convergence Early Stopping

FactRBench

ExpertLongBench

ManyICLBench

MLRC-BENCH

models 15 Sort: Recently updated

datasets 14 Sort: Recently updated

spaces 7

models 15

datasets 14