11 30 4

fulong ye

Alon77777

https://scholar.google.com.hk/citations?hl=zh-CN&user=-BbQ5VgAAAAJ

superhero-7

AI & ML interests

vision and language, diffusion model, text-to-image generation, image-to-text generation, referring expression generation and comprehension

Recent Activity

upvoted a paper about 21 hours ago

DreamStyle: A Unified Framework for Video Stylization

upvoted a paper 2 days ago

DreamID-V:Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer

upvoted a paper 4 months ago

OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models

View all activity

Organizations

upvoted a paper about 21 hours ago

DreamStyle: A Unified Framework for Video Stylization

Paper • 2601.02785 • Published 2 days ago • 18

upvoted a paper 2 days ago

DreamID-V:Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer

Paper • 2601.01425 • Published 4 days ago • 39

upvoted 3 papers 4 months ago

OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion Transformer Models

Paper • 2509.17627 • Published Sep 22, 2025 • 66

UMO: Scaling Multi-Identity Consistency for Image Customization via Matching Reward

Paper • 2509.06818 • Published Sep 8, 2025 • 29

USO: Unified Style and Subject-Driven Generation via Disentangled and Reward Learning

Paper • 2508.18966 • Published Aug 26, 2025 • 56

upvoted 2 papers 7 months ago

Phantom-Data : Towards a General Subject-Consistent Video Generation Dataset

Paper • 2506.18851 • Published Jun 23, 2025 • 30

Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors

Paper • 2505.24625 • Published May 30, 2025 • 9

authored a paper 9 months ago

DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning

Paper • 2504.14509 • Published Apr 20, 2025 • 51

upvoted a paper 9 months ago

DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning

Paper • 2504.14509 • Published Apr 20, 2025 • 51

commented a paper 9 months ago

DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning

Paper • 2504.14509 • Published Apr 20, 2025 • 51 •

upvoted a paper 11 months ago

Phantom: Subject-consistent video generation via cross-modal alignment

Paper • 2502.11079 • Published Feb 16, 2025 • 59

updated a collection about 1 year ago

Multimodal

Collection

1 item • Updated Jan 2, 2025

upvoted 2 papers about 1 year ago

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published Dec 25, 2024 • 106

Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey

Paper • 2412.18619 • Published Dec 16, 2024 • 60

authored a paper about 1 year ago

AnyDressing: Customizable Multi-Garment Virtual Dressing via Latent Diffusion Models

Paper • 2412.04146 • Published Dec 5, 2024 • 23

liked a Space over 1 year ago

Image Arena Leaderboard

📊

565

Image Generation and Image Editing Arena & Leaderboard

updated a collection over 1 year ago

T2I-control

Collection

3 items • Updated Sep 14, 2024

upvoted 3 papers over 1 year ago

Aquila2 Technical Report

Paper • 2408.07410 • Published Aug 14, 2024 • 15

IMAGDressing-v1: Customizable Virtual Dressing

Paper • 2407.12705 • Published Jul 17, 2024 • 13

LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models

Paper • 2407.07895 • Published Jul 10, 2024 • 42

fulong ye

AI & ML interests

Recent Activity

Organizations

Alon77777's activity

Image Arena Leaderboard