1 15 11

Yuxuan Wang

yxwang1215

https://yxwang1215.github.io/

yxwang1215

AI & ML interests

None yet

Recent Activity

liked a dataset 22 days ago

Hezep/AudioMarathon

upvoted a paper 23 days ago

Green-VLA: Staged Vision-Language-Action Model for Generalist Robots

upvoted a paper 25 days ago

HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning

View all activity

Organizations

liked a dataset 22 days ago

Hezep/AudioMarathon

Viewer • Updated Nov 12, 2025 • 6.36k • 1.14k • 4

upvoted a paper 23 days ago

Green-VLA: Staged Vision-Language-Action Model for Generalist Robots

Paper • 2602.00919 • Published Jan 31 • 324

upvoted a paper 25 days ago

HopChain: Multi-Hop Data Synthesis for Generalizable Vision-Language Reasoning

Paper • 2603.17024 • Published 28 days ago • 109

upvoted 2 papers 2 months ago

Grounding and Enhancing Informativeness and Utility in Dataset Distillation

Paper • 2601.21296 • Published Jan 29 • 19

Innovator-VL: A Multimodal Large Language Model for Scientific Discovery

Paper • 2601.19325 • Published Jan 27 • 81

New activity in TencentBAC/RoT-Qwen3-VL-4B 3 months ago

Lack of "special_tokens.bin" file

#1 opened 3 months ago by

yxwang1215

liked a dataset 3 months ago

openai/gsm8k

Benchmark • Updated 22 days ago • 17.6k • 775k • 1.25k

liked 2 models 3 months ago

TencentBAC/RoT-Qwen3-VL-4B

Updated Jan 26 • 3

TencentBAC/RoT-Qwen3-VL-2B

Updated Jan 26 • 1

upvoted 2 papers 3 months ago

Paper2Rebuttal: A Multi-Agent Framework for Transparent Author Response Assistance

Paper • 2601.14171 • Published Jan 20 • 53

The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models

Paper • 2601.15165 • Published Jan 21 • 73

liked a Space 3 months ago

The Ultra-Scale Playbook

🌌

3.78k

The ultimate guide to training LLM on large GPU Clusters

updated a dataset 5 months ago

yxwang1215/wmt_plus

Viewer • Updated Nov 1, 2025 • 27.1M • 6

published a dataset 5 months ago

yxwang1215/wmt_plus

Viewer • Updated Nov 1, 2025 • 27.1M • 6

liked a dataset 5 months ago

nvidia/LongAudio

Preview • Updated 9 days ago • 290 • 20

upvoted a paper 6 months ago

Multimodal Spatial Reasoning in the Large Model Era: A Survey and Benchmarks

Paper • 2510.25760 • Published Oct 29, 2025 • 17

liked a model 6 months ago

maomaocun/dLLM-Var

8B • Updated Oct 29, 2025 • 5 • 4

upvoted 2 papers 6 months ago

AI for Service: Proactive Assistance with AI Glasses

Paper • 2510.14359 • Published Oct 16, 2025 • 78

Efficient Multi-modal Large Language Models via Progressive Consistency Distillation

Paper • 2510.00515 • Published Oct 1, 2025 • 42

upvoted a paper 7 months ago

IMG: Calibrating Diffusion Models via Implicit Multimodal Guidance

Paper • 2509.26231 • Published Sep 30, 2025 • 18