1 10 1

shawnxzhu

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall

upvoted a paper 3 days ago

On Data Engineering for Scaling LLM Terminal Capabilities

authored a paper 4 days ago

CHARM: Calibrating Reward Models With Chatbot Arena Scores

View all activity

Organizations

upvoted a paper 1 day ago

ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall

Paper • 2510.07896 • Published Oct 9, 2025 • 8

upvoted a paper 3 days ago

On Data Engineering for Scaling LLM Terminal Capabilities

Paper • 2602.21193 • Published 3 days ago • 87

authored 2 papers 4 days ago

CHARM: Calibrating Reward Models With Chatbot Arena Scores

Paper • 2504.10045 • Published Apr 14, 2025

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

Paper • 2602.17684 • Published 23 days ago • 21

updated a collection 5 days ago

CodeScaler

Collection

6 items • Updated 5 days ago • 4

upvoted a paper 5 days ago

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

Paper • 2602.17684 • Published 23 days ago • 21

upvoted a collection 5 days ago

CodeScaler

Collection

6 items • Updated 5 days ago • 4

published 3 models 5 days ago

published a dataset 5 days ago

LARK-Lab/CodeScalerPair-51K

Viewer • Updated 5 days ago • 51.1k • 26 • 1

updated a dataset 5 days ago

LARK-Lab/CodeScalerPair-51K

Viewer • Updated 5 days ago • 51.1k • 26 • 1

updated 3 models 5 days ago

LARK-Lab/CodeScaler-8B

Text Classification • 8B • Updated 5 days ago • 10

LARK-Lab/CodeScaler-4B

Text Classification • 4B • Updated 5 days ago • 12

LARK-Lab/CodeScaler-1.7B

Text Classification • 2B • Updated 5 days ago • 9

upvoted a paper 13 days ago

Secure Code Generation via Online Reinforcement Learning with Vulnerability Reward Model

Paper • 2602.07422 • Published 21 days ago • 22

upvoted a paper 19 days ago

MSign: An Optimizer Preventing Training Instability in Large Language Models via Stable Rank Restoration

Paper • 2602.01734 • Published 26 days ago • 32

New activity in LARK-Lab/CodeScalerPair-51K 24 days ago

Update README.md

#1 opened 24 days ago by

xinyu04

updated a collection 24 days ago

CodeScaler

Collection

6 items • Updated 5 days ago • 4

shawnxzhu

AI & ML interests

Recent Activity

Organizations

shawnxzhu's activity

Update README.md