Yuwei Niu

Yuwei-Niu

https://purshow.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

liked a dataset 22 days ago

Yuwei-Niu/CVM-AAAI

upvoted a paper about 1 month ago

Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward

View all activity

Organizations

upvoted a paper 12 days ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

Paper • 2512.19693 • Published 12 days ago • 61

upvoted a paper about 1 month ago

Does Understanding Inform Generation in Unified Multimodal Models? From Analysis to Path Forward

Paper • 2511.20561 • Published Nov 25, 2025 • 32

upvoted 3 papers 3 months ago

SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models

Paper • 2510.12784 • Published Oct 14, 2025 • 19

OpenGPT-4o-Image: A Comprehensive Dataset for Advanced Image Generation and Editing

Paper • 2509.24900 • Published Sep 29, 2025 • 53

RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark

Paper • 2509.24897 • Published Sep 29, 2025 • 46

upvoted a paper 4 months ago

Symbolic Graphics Programming with Large Language Models

Paper • 2509.05208 • Published Sep 5, 2025 • 46

upvoted an article 5 months ago

Article

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation

Aug 11, 2025

•

upvoted 2 papers 5 months ago

Pixels, Patterns, but No Poetry: To See The World like Humans

Paper • 2507.16863 • Published Jul 21, 2025 • 68

Semi-off-Policy Reinforcement Learning for Vision-Language Slow-thinking Reasoning

Paper • 2507.16814 • Published Jul 22, 2025 • 21

upvoted a paper 7 months ago

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Paper • 2506.03147 • Published Jun 3, 2025 • 58

upvoted a paper 8 months ago

Step1X-Edit: A Practical Framework for General Image Editing

Paper • 2504.17761 • Published Apr 24, 2025 • 92

upvoted a paper 9 months ago

Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and Mitigation

Paper • 2503.19622 • Published Mar 25, 2025 • 31

upvoted 2 papers 10 months ago

WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation

Paper • 2503.07265 • Published Mar 10, 2025 • 4

MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Paper • 2503.07365 • Published Mar 10, 2025 • 61

upvoted a paper 11 months ago

GuardReasoner: Towards Reasoning-based LLM Safeguards

Paper • 2501.18492 • Published Jan 30, 2025 • 88

upvoted 3 papers about 1 year ago

Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate

Paper • 2410.07167 • Published Oct 9, 2024 • 39

Unveiling the Backbone-Optimizer Coupling Bias in Visual Representation Learning

Paper • 2410.06373 • Published Oct 8, 2024 • 36

Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation

Paper • 2410.05363 • Published Oct 7, 2024 • 45

upvoted a paper over 1 year ago

Law of Vision Representation in MLLMs

Paper • 2408.16357 • Published Aug 29, 2024 • 95

Yuwei Niu

AI & ML interests

Recent Activity

Organizations

Yuwei-Niu's activity

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation