Joschka Strüber

Klingspor

the-klingspor

AI & ML interests

None yet

Recent Activity

upvoted an article 3 days ago

DenseR: Dense Rewards For Free in LLM Reasoning

updated a model 8 days ago

Klingspor/StarPO-4B

updated a model 8 days ago

Klingspor/StarPO-1.7B

View all activity

Organizations

upvoted an article 3 days ago

Article

DenseR: Dense Rewards For Free in LLM Reasoning

3 days ago

•

upvoted a collection 8 days ago

Delta Belief RL

Collection

Collection of the models for our paper "Intrinsic Credit Assignment for Long Horizon Interaction" • 6 items • Updated 8 days ago • 1

upvoted an article 10 months ago

Article

Gotchas in Tokenizer Behavior Every Developer Should Know

Apr 18, 2025

•

upvoted an article 11 months ago

Article

Open R1: Update #3

Mar 11, 2025

•

296

upvoted 2 papers 12 months ago

Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample Creation

Paper • 2502.19414 • Published Feb 26, 2025 • 20

Project Alexandria: Towards Freeing Scientific Knowledge from Copyright Burdens via LLMs

Paper • 2502.19413 • Published Feb 26, 2025 • 22

upvoted a paper about 1 year ago

Great Models Think Alike and this Undermines AI Oversight

Paper • 2502.04313 • Published Feb 6, 2025 • 33

Joschka Strüber

AI & ML interests

Recent Activity

Organizations

Klingspor's activity

DenseR: Dense Rewards For Free in LLM Reasoning

Gotchas in Tokenizer Behavior Every Developer Should Know

Open R1: Update #3