5 47 4

TongZheng PRO

TongZheng1999

https://kidzheng.github.io/

AI & ML interests

Natural Language Processing

Recent Activity

upvoted a paper 13 days ago

PhyCritic: Multimodal Critic Models for Physical AI

upvoted a paper 14 days ago

OPE: Overcoming Information Saturation in Parallel Thinking via Outline-Guided Path Exploration

upvoted a paper 14 days ago

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

View all activity

Organizations

upvoted a paper 13 days ago

PhyCritic: Multimodal Critic Models for Physical AI

Paper • 2602.11124 • Published 13 days ago • 51

upvoted 2 papers 14 days ago

OPE: Overcoming Information Saturation in Parallel Thinking via Outline-Guided Path Exploration

Paper • 2602.08344 • Published 16 days ago • 5

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Paper • 2602.08234 • Published 16 days ago • 67

upvoted a paper 20 days ago

Training Data Efficiency in Multimodal Process Reward Models

Paper • 2602.04145 • Published 21 days ago • 76

liked a Space 21 days ago

Efficient Reasoning Online Judgement

📉

upvoted 3 papers 21 days ago

CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs

Paper • 2602.03048 • Published 22 days ago • 33

Learning Query-Specific Rubrics from Human Preferences for DeepResearch Report Generation

Paper • 2602.03619 • Published 21 days ago • 26

Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing

Paper • 2602.03845 • Published 21 days ago • 26

submitted a paper to Daily Papers 21 days ago

Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing

Paper • 2602.03845 • Published 21 days ago • 26

upvoted a paper 22 days ago

TTCS: Test-Time Curriculum Synthesis for Self-Evolving

Paper • 2601.22628 • Published 26 days ago • 35

updated a model about 1 month ago

TongZheng1999/HS_Model_4B_17k

4B • Updated Jan 17

published a model about 1 month ago

TongZheng1999/HS_Model_4B_17k

4B • Updated Jan 17

updated a model about 1 month ago

TongZheng1999/HS_Model_4B

4B • Updated Jan 16

published a model about 1 month ago

TongZheng1999/HS_Model_4B

4B • Updated Jan 16

updated a dataset about 1 month ago

TongZheng1999/hmmt_2025_14

Viewer • Updated Jan 15 • 2 • 4

published a dataset about 1 month ago

TongZheng1999/hmmt_2025_14

Viewer • Updated Jan 15 • 2 • 4

updated a dataset about 1 month ago

TongZheng1999/hmmt_2025_13

Viewer • Updated Jan 15 • 2 • 5

published a dataset about 1 month ago

TongZheng1999/hmmt_2025_13

Viewer • Updated Jan 15 • 2 • 5

updated a dataset about 1 month ago

TongZheng1999/hmmt_2025_12

Viewer • Updated Jan 15 • 2 • 6

published a dataset about 1 month ago

TongZheng1999/hmmt_2025_12

Viewer • Updated Jan 15 • 2 • 6

TongZheng PRO

AI & ML interests

Recent Activity

Organizations

TongZheng1999's activity

Efficient Reasoning Online Judgement