Tong Xiao
neupupil
AI & ML interests
NLP & ML & LLM
Recent Activity
upvoted a paper 5 days ago
LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling upvoted a paper about 1 month ago
GRAM: A Generative Foundation Reward Model for Reward Generalization liked a Space 2 months ago
EfficientReasoning/efficient_reasoning_online_judgement