Dgmamba: Domain generalization via generalized state space model

S Long, Q Zhou, X Li, X Lu, C Ying, Y Luo… - Proceedings of the …, 2024 - dl.acm.org
Domain generalization (DG) aims at solving distribution shift problems in various scenes.
Existing approaches are based on Convolution Neural Networks (CNNs) or Vision …

Mamba in vision: A comprehensive survey of techniques and applications

MM Rahman, AA Tutul, A Nath, L Laishram… - arxiv preprint arxiv …, 2024 - arxiv.org
Mamba is emerging as a novel approach to overcome the challenges faced by
Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) in computer vision …

Pointrwkv: Efficient rwkv-like model for hierarchical point cloud learning

Q He, J Zhang, J Peng, H He, X Li, Y Wang… - arxiv preprint arxiv …, 2024 - arxiv.org
Transformers have revolutionized the point cloud learning task, but the quadratic complexity
hinders its extension to long sequence and makes a burden on limited computational …

Restore-rwkv: Efficient and effective medical image restoration with rwkv

Z Yang, H Zhang, D Zhao, B Wei, Y Xu - arxiv preprint arxiv:2407.11087, 2024 - arxiv.org
Transformers have revolutionized medical image restoration, but the quadratic complexity
still poses limitations for their application to high-resolution medical images. The recent …

Temporal and Interactive Modeling for Efficient Human-Human Motion Generation

Y Wang, S Wang, J Zhang, K Fan, J Wu, Z Jiang… - arxiv preprint arxiv …, 2024 - arxiv.org
Human-human motion generation is essential for understanding humans as social beings.
Although several transformer-based methods have been proposed, they typically model …

A Survey of Rwkv

Z Li, T **a, Y Chang, Y Wu - arxiv preprint arxiv:2412.14847, 2024 - arxiv.org
The Receptance Weighted Key Value (RWKV) model offers a novel alternative to the
Transformer architecture, merging the benefits of recurrent and attention-based systems …

On Efficient Variants of Segment Anything Model: A Survey

X Sun, J Liu, HT Shen, X Zhu, P Hu - arxiv preprint arxiv:2410.04960, 2024 - arxiv.org
The Segment Anything Model (SAM) is a foundational model for image segmentation tasks,
known for its strong generalization across diverse applications. However, its impressive …

Linear Attention Modeling for Learned Image Compression

D Feng, Z Cheng, S Wang, R Wu, H Hu, G Lu… - arxiv preprint arxiv …, 2025 - arxiv.org
Recent years, learned image compression has made tremendous progress to achieve
impressive coding efficiency. Its coding gain mainly comes from non-linear neural network …

StyleRWKV: High-Quality and High-Efficiency Style Transfer with RWKV-like Architecture

M Dai, Q Zhou, L Ma - arxiv preprint arxiv:2412.19535, 2024 - arxiv.org
Style transfer aims to generate a new image preserving the content but with the artistic
representation of the style source. Most of the existing methods are based on Transformers …

GDSR: Global-Detail Integration through Dual-Branch Network with Wavelet Losses for Remote Sensing Image Super-Resolution

Q Zhu, K Li, G Zhang, X Wang, J Huang, X Li - arxiv preprint arxiv …, 2024 - arxiv.org
In recent years, deep neural networks, including Convolutional Neural Networks,
Transformers, and State Space Models, have achieved significant progress in Remote …