sascha-kirch 's Collections Diffusion Models
updated
Instruct-Imagen: Image Generation with Multi-modal Instruction
Paper
• 2401.01952
• Published
• 32
ODIN: A Single Model for 2D and 3D Perception
Paper
• 2401.02416
• Published
• 13
Bigger is not Always Better: Scaling Properties of Latent Diffusion
Models
Paper
• 2404.01367
• Published
• 22
Cross-Attention Makes Inference Cumbersome in Text-to-Image Diffusion
Models
Paper
• 2404.02747
• Published
• 13
PointInfinity: Resolution-Invariant Point Diffusion Models
Paper
• 2404.03566
• Published
• 16
ControlNet++: Improving Conditional Controls with Efficient Consistency
Feedback
Paper
• 2404.07987
• Published
• 48
Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse
Controls to Any Diffusion Model
Paper
• 2404.09967
• Published
• 21
Align Your Steps: Optimizing Sampling Schedules in Diffusion Models
Paper
• 2404.14507
• Published
• 23
Semantica: An Adaptable Image-Conditioned Diffusion Model
Paper
• 2405.14857
• Published
• 11
Improved Distribution Matching Distillation for Fast Image Synthesis
Paper
• 2405.14867
• Published
• 15
Paper
• 2405.18407
• Published
• 48
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis
Paper
• 2403.03206
• Published
• 71
Kaleido Diffusion: Improving Conditional Diffusion Models with
Autoregressive Latent Modeling
Paper
• 2405.21048
• Published
• 16
4Diffusion: Multi-view Video Diffusion Model for 4D Generation
Paper
• 2405.20674
• Published
• 15
Learning Temporally Consistent Video Depth from Video Diffusion Priors
Paper
• 2406.01493
• Published
• 23
BitsFusion: 1.99 bits Weight Quantization of Diffusion Model
Paper
• 2406.04333
• Published
• 38
Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few
Steps Image Generation
Paper
• 2406.02347
• Published
• 3
Step-aware Preference Optimization: Aligning Preference with Denoising
Performance at Each Step
Paper
• 2406.04314
• Published
• 30
Alleviating Distortion in Image Generation via Multi-Resolution
Diffusion Models
Paper
• 2406.09416
• Published
• 29
Interpreting the Weight Space of Customized Diffusion Models
Paper
• 2406.09413
• Published
• 20
ExVideo: Extending Video Diffusion Models via Parameter-Efficient
Post-Tuning
Paper
• 2406.14130
• Published
• 10
Paper
• 2402.09470
• Published
• 13
ControlNeXt: Powerful and Efficient Control for Image and Video
Generation
Paper
• 2408.06070
• Published
• 55
Paper
• 2408.07009
• Published
• 62
Discrete Diffusion Modeling by Estimating the Ratios of the Data
Distribution
Paper
• 2310.16834
• Published
• 5
Transfusion: Predict the Next Token and Diffuse Images with One
Multi-Modal Model
Paper
• 2408.11039
• Published
• 63
CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models
Paper
• 2411.18613
• Published
• 59
SNOOPI: Supercharged One-step Diffusion Distillation with Proper
Guidance
Paper
• 2412.02687
• Published
• 113
Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion
Models
Paper
• 2312.09608
• Published
• 16
Audio-visual Controlled Video Diffusion with Masked Selective State
Spaces Modeling for Natural Talking Head Generation
Paper
• 2504.02542
• Published
• 51