9 41 28

bohan zeng

zbhpku

AI & ML interests

None yet

Recent Activity

upvoted a paper about 8 hours ago

GARDO: Reinforcing Diffusion Models without Reward Hacking

upvoted a paper 8 days ago

GRAN-TED: Generating Robust, Aligned, and Nuanced Text Embedding for Diffusion Models

liked a dataset 11 days ago

OpenDCAI/dataflex-selector-MMLUSubset-test

View all activity

Organizations

None yet

upvoted a paper about 8 hours ago

GARDO: Reinforcing Diffusion Models without Reward Hacking

Paper • 2512.24138 • Published 8 days ago • 26

upvoted a paper 8 days ago

GRAN-TED: Generating Robust, Aligned, and Nuanced Text Embedding for Diffusion Models

Paper • 2512.15560 • Published 21 days ago • 24

upvoted a paper 14 days ago

SemanticGen: Video Generation in Semantic Space

Paper • 2512.20619 • Published 15 days ago • 89

upvoted a paper 15 days ago

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI

Paper • 2512.16676 • Published 20 days ago • 202

upvoted a paper 19 days ago

Kling-Omni Technical Report

Paper • 2512.16776 • Published 20 days ago • 164

upvoted a paper 20 days ago

VABench: A Comprehensive Benchmark for Audio-Video Generation

Paper • 2512.09299 • Published 28 days ago • 7

upvoted a paper 21 days ago

Scone: Bridging Composition and Distinction in Subject-Driven Image Generation via Unified Understanding-Generation Modeling

Paper • 2512.12675 • Published 24 days ago • 40

upvoted a paper 23 days ago

SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder

Paper • 2512.11749 • Published 26 days ago • 38

upvoted a paper 26 days ago

Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation

Paper • 2512.10949 • Published 27 days ago • 45

upvoted a paper about 1 month ago

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

Paper • 2512.04677 • Published Dec 4, 2025 • 167

upvoted 2 papers 2 months ago

When Modalities Conflict: How Unimodal Reasoning Uncertainty Governs Preference Dynamics in MLLMs

Paper • 2511.02243 • Published Nov 4, 2025 • 24

Rethinking Driving World Model as Synthetic Data Generator for Perception Tasks

Paper • 2510.19195 • Published Oct 22, 2025 • 10

upvoted 6 papers 3 months ago

OpenGPT-4o-Image: A Comprehensive Dataset for Advanced Image Generation and Editing

Paper • 2509.24900 • Published Sep 29, 2025 • 53

RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark

Paper • 2509.24897 • Published Sep 29, 2025 • 46

upvoted 2 papers 4 months ago

Multi-Step Visual Reasoning with Visual Tokens Scaling and Verification

Paper • 2506.07235 • Published Jun 8, 2025 • 3

Multimodal Reasoning for Science: Technical Report and 1st Place Solution to the ICML 2025 SeePhys Challenge

Paper • 2509.06079 • Published Sep 7, 2025 • 6

bohan zeng

AI & ML interests

Recent Activity

Organizations

zbhpku's activity