Google Akademik

Turnitin 降AI改写早检测系统早降重系统 Turnitin-UK版万方检测-期刊版维普编辑部版 Grammarly检测 Paperpass检测 checkpass检测 PaperYY检测

Agent attention: On the integration of softmax and linear attention

D Han, T Ye, Y Han, Z **a, S Pan, P Wan… - … on Computer Vision, 2024 - Springer

The attention module is the key component in Transformers. While the global attention
mechanism offers high expressiveness, its excessive computational cost restricts its …

Kaydet Alıntı yap Alıntılanma sayısı: 80 İlgili makaleler 2 sürümün hepsi

CATNet: Cascaded attention transformer network for marine species image classification

W Zhang, G Chen, P Zhuang, W Zhao… - Expert Systems with …, 2024 - Elsevier

Complex physicochemical environmental effects result in the underwater species images'
highly intricate and diverse backgrounds, which poses significant challenges for identifying …

Kaydet Alıntı yap Alıntılanma sayısı: 28 İlgili makaleler 2 sürümün hepsi

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey on transformer compression

Y Tang, Y Wang, J Guo, Z Tu, K Han, H Hu… - arxiv preprint arxiv …, 2024 - arxiv.org

Large models based on the Transformer architecture play increasingly vital roles in artificial
intelligence, particularly within the realms of natural language processing (NLP) and …

Kaydet Alıntı yap Alıntılanma sayısı: 28 İlgili makaleler 2 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Sam-6d: Segment anything model meets zero-shot 6d object pose estimation

J Lin, L Liu, D Lu, K Jia - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com

Zero-shot 6D object pose estimation involves the detection of novel objects with their 6D
poses in cluttered scenes presenting significant challenges for model generalizability …

Kaydet Alıntı yap Alıntılanma sayısı: 41 İlgili makaleler 3 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Efficient diffusion transformer with step-wise dynamic attention mediators

Y Pu, Z **a, J Guo, D Han, Q Li, D Li, Y Yuan… - … on Computer Vision, 2024 - Springer

This paper identifies significant redundancy in the query-key interactions within self-attention
mechanisms of diffusion transformer models, particularly during the early stages of …

Kaydet Alıntı yap Alıntılanma sayısı: 9 İlgili makaleler 7 sürümün hepsi

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Vit-comer: Vision transformer with convolutional multi-scale feature interaction for dense predictions

C **a, X Wang, F Lv, X Hao… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

Abstract Although Vision Transformer (ViT) has achieved significant success in computer
vision it does not perform well in dense prediction tasks due to the lack of inner-patch …

Kaydet Alıntı yap Alıntılanma sayısı: 52 İlgili makaleler 3 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Hypformer: Exploring efficient transformer fully in hyperbolic space

M Yang, H Verma, DC Zhang, J Liu, I King… - Proceedings of the 30th …, 2024 - dl.acm.org

Hyperbolic geometry have shown significant potential in modeling complex structured data,
particularly those with underlying tree-like and hierarchical structures. Despite the …

Kaydet Alıntı yap Alıntılanma sayısı: 12 İlgili makaleler 5 sürümün hepsi

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

P-mamba: Marrying perona malik diffusion with mamba for efficient pediatric echocardiographic left ventricular segmentation

Z Ye, T Chen, F Wang, H Zhang, L Zhang - arxiv preprint arxiv …, 2024 - arxiv.org

In pediatric cardiology, the accurate and immediate assessment of cardiac function through
echocardiography is crucial since it can determine whether urgent intervention is required in …

Kaydet Alıntı yap Alıntılanma sayısı: 26 İlgili makaleler 2 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A novel state space model with local enhancement and state sharing for image fusion

Z Cao, X Wu, LJ Deng, Y Zhong - Proceedings of the 32nd ACM …, 2024 - dl.acm.org

In image fusion tasks, images from different sources possess distinct characteristics. This
has driven the development of numerous methods to explore better ways of fusing them …

Kaydet Alıntı yap Alıntılanma sayısı: 11 İlgili makaleler 2 sürümün hepsi

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution

Y Yue, Y Wang, B Kang, Y Han… - Advances in …, 2025 - proceedings.neurips.cc

Abstract Multimodal Large Language Models (MLLMs) have demonstrated remarkable
comprehension and reasoning capabilities with complex language and visual data. These …

Kaydet Alıntı yap Alıntılanma sayısı: 2 İlgili makaleler 4 sürümün hepsi HTML olarak görüntüle

Alıntı yap

Gelişmiş arama

Kitaplığım'a kaydedildi

Agent attention: On the integration of softmax and linear attention

CATNet: Cascaded attention transformer network for marine species image classification

A survey on transformer compression

Sam-6d: Segment anything model meets zero-shot 6d object pose estimation

Efficient diffusion transformer with step-wise dynamic attention mediators

Vit-comer: Vision transformer with convolutional multi-scale feature interaction for dense predictions

Hypformer: Exploring efficient transformer fully in hyperbolic space

P-mamba: Marrying perona malik diffusion with mamba for efficient pediatric echocardiographic left ventricular segmentation

A novel state space model with local enhancement and state sharing for image fusion

DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution