A survey of mamba
As one of the most representative DL techniques, Transformer architecture has empowered
numerous advanced models, especially the large language models (LLMs) that comprise …
numerous advanced models, especially the large language models (LLMs) that comprise …
Venturing into uncharted waters: The navigation compass from transformer to mamba
Y Zou, Y Chen, Z Li, L Zhang, H Zhao - arxiv preprint arxiv:2406.16722, 2024 - arxiv.org
Transformer, a deep neural network architecture, has long dominated the field of natural
language processing and beyond. Nevertheless, the recent introduction of Mamba …
language processing and beyond. Nevertheless, the recent introduction of Mamba …
State space model for new-generation network alternative to transformers: A survey
In the post-deep learning era, the Transformer architecture has demonstrated its powerful
performance across pre-trained big models and various downstream tasks. However, the …
performance across pre-trained big models and various downstream tasks. However, the …
GraphLSS: Integrating Lexical, Structural, and Semantic Features for Long Document Extractive Summarization
Heterogeneous graph neural networks have recently gained attention for long document
summarization, modeling the extraction as a node classification task. Although effective …
summarization, modeling the extraction as a node classification task. Although effective …
Improving Accessibility of SCOTUS Opinions: A Benchmark Study and a New Dataset for Generic Heading Prediction and Specific Heading Generation
M Yaich, N Hernandez - The 31st International Conference on …, 2025 - hal.science
The opinions of the US Supreme Court (SCO-TUS) are known for their extensive length,
complex legal language, and lack of titled sections, which pose significant challenges for …
complex legal language, and lack of titled sections, which pose significant challenges for …
Handling Very Long Contexts in Neural Machine Translation: a Survey
This report examines methods for integrating an extended discourse context in machine
translation, focusing on neural translation methods. Machine translation systems generally …
translation, focusing on neural translation methods. Machine translation systems generally …
Neural abstractive summarization: improvements at the sequence-level
M Ravaut - 2024 - dr.ntu.edu.sg
Automatic text summarization has made a fantastic leap forward in the last five to ten years,
fueled by the rise of deep learning systems. Summarization at large consists in compressing …
fueled by the rise of deep learning systems. Summarization at large consists in compressing …