Late-to-Early Training: LET LLMs Learn Earlier, So Faster and Better Paper • 2602.05393 • Published 6 days ago • 6
Protein Autoregressive Modeling via Multiscale Structure Generation Paper • 2602.04883 • Published 6 days ago • 3
SPARKLING: Balancing Signal Preservation and Symmetry Breaking for Width-Progressive Learning Paper • 2602.02472 • Published 8 days ago • 44
ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation Paper • 2601.21420 • Published 13 days ago • 42
Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models Paper • 2601.19834 • Published 14 days ago • 25
Stable-DiffCoder: Pushing the Frontier of Code Diffusion Large Language Model Paper • 2601.15892 • Published 19 days ago • 53
Stable-DiffCoder: Pushing the Frontier of Code Diffusion Large Language Model Paper • 2601.15892 • Published 19 days ago • 53
CryoFM Collection Generative foundation model for cryo-EM density maps. See webpage: https://bytedance-seed.github.io/cryofm/. • 3 items • Updated 26 days ago • 4