stereoplegic 's Collections
StableSSM: Alleviating the Curse of Memory in State-space Models through
Stable Reparameterization
Paper
• 2311.14495
• Published
• 1
Vision Mamba: Efficient Visual Representation Learning with
Bidirectional State Space Model
Paper
• 2401.09417
• Published
• 62
SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image
Segmentation
Paper
• 2401.13560
• Published
• 1
Graph-Mamba: Towards Long-Range Graph Sequence Modeling with Selective
State Spaces
Paper
• 2402.00789
• Published
• 2
Convolutional State Space Models for Long-Range Spatiotemporal Modeling
Paper
• 2310.19694
• Published
• 2
Vivim: a Video Vision Mamba for Medical Video Object Segmentation
Paper
• 2401.14168
• Published
• 2
2-D SSM: A General Spatial Layer for Visual Transformers
Paper
• 2306.06635
• Published
• 1
BlackMamba: Mixture of Experts for State-Space Models
Paper
• 2402.01771
• Published
• 25
Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning
Tasks
Paper
• 2402.04248
• Published
• 32
Simple Hardware-Efficient Long Convolutions for Sequence Modeling
Paper
• 2302.06646
• Published
• 2
A Quantitative Review on Language Model Efficiency Research
Paper
• 2306.01768
• Published
• 2
A Unified View of Long-Sequence Models towards Modeling Million-Scale
Dependencies
Paper
• 2302.06218
• Published
• 1
Accelerating Toeplitz Neural Network with Constant-time Inference
Complexity
Paper
• 2311.08756
• Published
• 1
Graph Mamba: Towards Learning on Graphs with State Space Models
Paper
• 2402.08678
• Published
• 17
DenseMamba: State Space Models with Dense Hidden Connection for
Efficient Large Language Models
Paper
• 2403.00818
• Published
• 19
Improving Token-Based World Models with Parallel Observation Prediction
Paper
• 2402.05643
• Published
• 1
Hierarchical State Space Models for Continuous Sequence-to-Sequence
Modeling
Paper
• 2402.10211
• Published
• 13
LOCOST: State-Space Models for Long Document Abstractive Summarization
Paper
• 2401.17919
• Published
Diffusion Models Without Attention
Paper
• 2311.18257
• Published
• 3
ZigMa: Zigzag Mamba Diffusion Model
Paper
• 2403.13802
• Published
• 18
MambaIR: A Simple Baseline for Image Restoration with State-Space Model
Paper
• 2402.15648
• Published
SSM Meets Video Diffusion Models: Efficient Video Generation with
Structured State Spaces
Paper
• 2403.07711
• Published
• 1
Scalable Diffusion Models with State Space Backbone
Paper
• 2402.05608
• Published
LocalMamba: Visual State Space Model with Windowed Selective Scan
Paper
• 2403.09338
• Published
• 8
VMamba: Visual State Space Model
Paper
• 2401.10166
• Published
• 40
VideoMamba: State Space Model for Efficient Video Understanding
Paper
• 2403.06977
• Published
• 29
MambaMixer: Efficient Selective State Space Models with Dual Token and
Channel Selection
Paper
• 2403.19888
• Published
• 12
MambaByte: Token-free Selective State Space Model
Paper
• 2401.13660
• Published
• 60
Zamba: A Compact 7B SSM Hybrid Model
Paper
• 2405.16712
• Published
• 25
Samba: Simple Hybrid State Space Models for Efficient Unlimited Context
Language Modeling
Paper
• 2406.07522
• Published
• 40
Longhorn: State Space Models are Amortized Online Learners
Paper
• 2407.14207
• Published
• 18
Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model
Paper
• 2405.14174
• Published
ReMamba: Equip Mamba with Effective Long-Sequence Modeling
Paper
• 2408.15496
• Published
• 12
Mamba Retriever: Utilizing Mamba for Effective and Efficient Dense
Retrieval
Paper
• 2408.08066
• Published
GrootVL: Tree Topology is All You Need in State Space Model
Paper
• 2406.02395
• Published
• 1
Paper
• 2507.06204
• Published
• 19