A survey of mamba

H Qu, L Ning, R An, W Fan, T Derr, H Liu, X Xu… - arxiv preprint arxiv …, 2024 - arxiv.org
As one of the most representative DL techniques, Transformer architecture has empowered
numerous advanced models, especially the large language models (LLMs) that comprise …

Venturing into uncharted waters: The navigation compass from transformer to mamba

Y Zou, Y Chen, Z Li, L Zhang, H Zhao - arxiv preprint arxiv:2406.16722, 2024 - arxiv.org
Transformer, a deep neural network architecture, has long dominated the field of natural
language processing and beyond. Nevertheless, the recent introduction of Mamba …

State space model for new-generation network alternative to transformers: A survey

X Wang, S Wang, Y Ding, Y Li, W Wu, Y Rong… - arxiv preprint arxiv …, 2024 - arxiv.org
In the post-deep learning era, the Transformer architecture has demonstrated its powerful
performance across pre-trained big models and various downstream tasks. However, the …

GraphLSS: Integrating Lexical, Structural, and Semantic Features for Long Document Extractive Summarization

M Bugueño, HA Hamdan, G de Melo - arxiv preprint arxiv:2410.21315, 2024 - arxiv.org
Heterogeneous graph neural networks have recently gained attention for long document
summarization, modeling the extraction as a node classification task. Although effective …

Improving Accessibility of SCOTUS Opinions: A Benchmark Study and a New Dataset for Generic Heading Prediction and Specific Heading Generation

M Yaich, N Hernandez - The 31st International Conference on …, 2025 - hal.science
The opinions of the US Supreme Court (SCO-TUS) are known for their extensive length,
complex legal language, and lack of titled sections, which pose significant challenges for …

Handling Very Long Contexts in Neural Machine Translation: a Survey

Z Peng, R Bawden, F Yvon - 2024 - inria.hal.science
This report examines methods for integrating an extended discourse context in machine
translation, focusing on neural translation methods. Machine translation systems generally …

Neural abstractive summarization: improvements at the sequence-level

M Ravaut - 2024 - dr.ntu.edu.sg
Automatic text summarization has made a fantastic leap forward in the last five to ten years,
fueled by the rise of deep learning systems. Summarization at large consists in compressing …