Študovňa Google

BC Das, MH Amini, Y Wu - ACM Computing Surveys, 2025 - dl.acm.org

Large language models (LLMs) have demonstrated extraordinary capabilities and
contributed to multiple fields, such as generating and summarizing text, language …

Uložiť Citovať Citované 686-krát Súvisiace články Všetky verzie 11

[Free GPT-4]
[DeepSeek]

[PDF] nature.com

Graph neural networks for materials science and chemistry

P Reiser, M Neubert, A Eberhard, L Torresi… - Communications …, 2022 - nature.com

Abstract Machine learning plays an increasingly important role in many areas of chemistry
and materials science, being used to predict materials properties, accelerate simulations …

Uložiť Citovať Citované 459-krát Súvisiace články Všetky verzie 14

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Resurrecting recurrent neural networks for long sequences

A Orvieto, SL Smith, A Gu, A Fernando… - International …, 2023 - proceedings.mlr.press

Abstract Recurrent Neural Networks (RNNs) offer fast inference on long sequences but are
hard to optimize and slow to train. Deep state-space models (SSMs) have recently been …

Uložiť Citovať Citované 266-krát Súvisiace články Všetky verzie 9 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Hungry hungry hippos: Towards language modeling with state space models

DY Fu, T Dao, KK Saab, AW Thomas, A Rudra… - arxiv preprint arxiv …, 2022 - arxiv.org

State space models (SSMs) have demonstrated state-of-the-art sequence modeling
performance in some modalities, but underperform attention in language modeling …

Uložiť Citovať Citované 465-krát Súvisiace články Všetky verzie 4 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Simplified state space layers for sequence modeling

JTH Smith, A Warrington, SW Linderman - arxiv preprint arxiv:2208.04933, 2022 - arxiv.org

Models using structured state space sequence (S4) layers have achieved state-of-the-art
performance on long-range sequence modeling tasks. An S4 layer combines linear state …

Uložiť Citovať Citované 493-krát Súvisiace články Všetky verzie 5 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

No language left behind: Scaling human-centered machine translation

MR Costa-Jussà, J Cross, O Çelebi, M Elbayad… - arxiv preprint arxiv …, 2022 - arxiv.org

Driven by the goal of eradicating language barriers on a global scale, machine translation
has solidified itself as a key focus of artificial intelligence research today. However, such …

Uložiť Citovať Citované 774-krát Súvisiace články Všetky verzie 2 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Xmem: Long-term video object segmentation with an atkinson-shiffrin memory model

HK Cheng, AG Schwing - European Conference on Computer Vision, 2022 - Springer

We present XMem, a video object segmentation architecture for long videos with unified
feature memory stores inspired by the Atkinson-Shiffrin memory model. Prior work on video …

Uložiť Citovať Citované 416-krát Súvisiace články Všetky verzie 8

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Spatext: Spatio-textual representation for controllable image generation

O Avrahami, T Hayes, O Gafni… - Proceedings of the …, 2023 - openaccess.thecvf.com

Recent text-to-image diffusion models are able to generate convincing results of
unprecedented quality. However, it is nearly impossible to control the shapes of different …

Uložiť Citovať Citované 200-krát Súvisiace články Všetky verzie 6 HTML verzia

[Free GPT-4]
[DeepSeek]

[PDF] google.com

Spectral–spatial morphological attention transformer for hyperspectral image classification

SK Roy, A Deria, C Shah, JM Haut… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

In recent years, convolutional neural networks (CNNs) have drawn significant attention for
the classification of hyperspectral images (HSIs). Due to their self-attention mechanism, the …

Uložiť Citovať Citované 173-krát Súvisiace články Všetky verzie 3

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Bevformer: learning bird's-eye-view representation from lidar-camera via spatiotemporal transformers

Z Li, W Wang, H Li, E **e, C Sima, T Lu… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Multi-modality fusion strategy is currently the de-facto most competitive solution for 3D
perception tasks. In this work, we present a new framework termed BEVFormer, which learns …

Uložiť Citovať Citované 1323-krát Súvisiace články Všetky verzie 14

Vytvoriť upozornenie

Citovať

Rozšírené vyhľadávanie

Uložené do mojej knižnice

On the properties of neural machine translation: Encoder-decoder approaches

Security and privacy challenges of large language models: A survey

Graph neural networks for materials science and chemistry

Resurrecting recurrent neural networks for long sequences

Hungry hungry hippos: Towards language modeling with state space models

Simplified state space layers for sequence modeling

No language left behind: Scaling human-centered machine translation

Xmem: Long-term video object segmentation with an atkinson-shiffrin memory model

Spatext: Spatio-textual representation for controllable image generation

Spectral–spatial morphological attention transformer for hyperspectral image classification

Bevformer: learning bird's-eye-view representation from lidar-camera via spatiotemporal transformers