Evaluating Parameter Efficient Methods for RLVR Paper ⢠2512.23165 ⢠Published Dec 29, 2025 ⢠26
Video-Thinker: Sparking "Thinking with Videos" via Reinforcement Learning Paper ⢠2510.23473 ⢠Published Oct 27, 2025 ⢠85
Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel Decoding Paper ⢠2505.16990 ⢠Published May 22, 2025 ⢠22
REPAIR: Robust Editing via Progressive Adaptive Intervention and Reintegration Paper ⢠2510.01879 ⢠Published Oct 2, 2025 ⢠8
Refusal Falls off a Cliff: How Safety Alignment Fails in Reasoning? Paper ⢠2510.06036 ⢠Published Oct 7, 2025 ⢠7
Interleaving Reasoning for Better Text-to-Image Generation Paper ⢠2509.06945 ⢠Published Sep 8, 2025 ⢠15
OpenCUA: Open Foundations for Computer-Use Agents Paper ⢠2508.09123 ⢠Published Aug 12, 2025 ⢠31
Seedance 1.0: Exploring the Boundaries of Video Generation Models Paper ⢠2506.09113 ⢠Published Jun 10, 2025 ⢠105
Flow-GRPO: Training Flow Matching Models via Online RL Paper ⢠2505.05470 ⢠Published May 8, 2025 ⢠87
Magic 1-For-1: Generating One Minute Video Clips within One Minute Paper ⢠2502.07701 ⢠Published Feb 11, 2025 ⢠36
Direct Preference Optimization Using Sparse Feature-Level Constraints Paper ⢠2411.07618 ⢠Published Nov 12, 2024 ⢠17