2 13 2

Steven

yijunyang

stevenyangyj

AI & ML interests

None yet

Recent Activity

authored a paper 10 days ago

System-2 Mathematical Reasoning via Enriched Instruction Tuning

authored a paper 10 days ago

WALL-E 2.0: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents

authored a paper 10 days ago

SeeNav-Agent: Enhancing Vision-Language Navigation with Visual Prompt and Step-Level Policy Optimization

View all activity

Organizations

authored 6 papers 10 days ago

System-2 Mathematical Reasoning via Enriched Instruction Tuning

Paper • 2412.16964 • Published Dec 22, 2024 • 2

WALL-E 2.0: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents

Paper • 2504.15785 • Published Apr 22, 2025 • 22

SeeNav-Agent: Enhancing Vision-Language Navigation with Visual Prompt and Step-Level Policy Optimization

Paper • 2512.02631 • Published Dec 2, 2025 • 9

GTR-Turbo: Merged Checkpoint is Secretly a Free Teacher for Agentic VLM Training

Paper • 2512.13043 • Published Dec 15, 2025 • 6

ProAct: Agentic Lookahead in Interactive Environments

Paper • 2602.05327 • Published Feb 5 • 27

UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience

Paper • 2603.24533 • Published 11 days ago • 46

liked a model 10 days ago

MarsXL/UI-Voyager

Image-Text-to-Text • 570k • Updated 10 days ago • 258 • 5

upvoted a paper 10 days ago

UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience

Paper • 2603.24533 • Published 11 days ago • 46

upvoted a paper 17 days ago

When AI Navigates the Fog of War

Paper • 2603.16642 • Published 19 days ago • 28

upvoted a paper about 2 months ago

ProAct: Agentic Lookahead in Interactive Environments

Paper • 2602.05327 • Published Feb 5 • 27

upvoted a paper 3 months ago

GTR-Turbo: Merged Checkpoint is Secretly a Free Teacher for Agentic VLM Training

Paper • 2512.13043 • Published Dec 15, 2025 • 6

upvoted 3 papers 4 months ago

SeeNav-Agent: Enhancing Vision-Language Navigation with Visual Prompt and Step-Level Policy Optimization

Paper • 2512.02631 • Published Dec 2, 2025 • 9

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published Nov 27, 2025 • 245

HunyuanVideo 1.5 Technical Report

Paper • 2511.18870 • Published Nov 24, 2025 • 29

liked a dataset 10 months ago

huanqia/MM-IQ

Viewer • Updated Feb 7, 2025 • 2.71k • 86 • 16

upvoted a paper 12 months ago

C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing

Paper • 2504.07964 • Published Apr 10, 2025 • 62

authored a paper about 1 year ago

GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training

Paper • 2503.08525 • Published Mar 11, 2025 • 17

upvoted a paper about 1 year ago

GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training

Paper • 2503.08525 • Published Mar 11, 2025 • 17

commented a paper about 1 year ago

GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training

Paper • 2503.08525 • Published Mar 11, 2025 • 17 •

upvoted a paper about 1 year ago

R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts

Paper • 2502.20395 • Published Feb 27, 2025 • 45

Steven

AI & ML interests

Recent Activity

Organizations

yijunyang's activity