State space model for new-generation network alternative to transformers: A survey
In the post-deep learning era, the Transformer architecture has demonstrated its powerful
performance across pre-trained big models and various downstream tasks. However, the …
performance across pre-trained big models and various downstream tasks. However, the …
Tuning frequency bias of state space models
State space models (SSMs) leverage linear, time-invariant (LTI) systems to effectively learn
sequences with long-range dependencies. By analyzing the transfer functions of LTI …
sequences with long-range dependencies. By analyzing the transfer functions of LTI …
Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image Analysis
Sequence modeling plays a vital role across various domains, with recurrent neural
networks being historically the predominant method of performing these tasks. However, the …
networks being historically the predominant method of performing these tasks. However, the …
There is HOPE to Avoid HiPPOs for Long-memory State Space Models
State-space models (SSMs) that utilize linear, time-invariant (LTI) systems are known for
their effectiveness in learning long sequences. However, these models typically face several …
their effectiveness in learning long sequences. However, these models typically face several …
Flash stu: Fast spectral transform units
This paper describes an efficient, open source PyTorch implementation of the Spectral
Transform Unit. We investigate sequence prediction tasks over several modalities including …
Transform Unit. We investigate sequence prediction tasks over several modalities including …
A Deep State Space Model for Rainfall-Runoff Simulations
The classical way of studying the rainfall-runoff processes in the water cycle relies on
conceptual or physically-based hydrologic models. Deep learning (DL) has recently …
conceptual or physically-based hydrologic models. Deep learning (DL) has recently …
Provable Length Generalization in Sequence Prediction via Spectral Filtering
We consider the problem of length generalization in sequence prediction. We define a new
metric of performance in this setting--the Asymmetric-Regret--which measures regret against …
metric of performance in this setting--the Asymmetric-Regret--which measures regret against …
FutureFill: Fast Generation from Convolutional Sequence Models
We address the challenge of efficient auto-regressive generation in sequence prediction
models by introducing FutureFill-a method for fast generation that applies to any sequence …
models by introducing FutureFill-a method for fast generation that applies to any sequence …