Scale RAE Collection Collection for "Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders" • 9 items • Updated 4 days ago • 3
Solaris: Building a Multiplayer Video World Model in Minecraft Paper • 2602.22208 • Published 21 days ago • 28
Beyond Language Modeling: An Exploration of Multimodal Pretraining Paper • 2603.03276 • Published 15 days ago • 93
Beyond Language Modeling: An Exploration of Multimodal Pretraining Paper • 2603.03276 • Published 15 days ago • 93