محقق Google

J Kim, D Nguyen, S Min, S Cho… - Advances in Neural …, 2022‏ - proceedings.neurips.cc‏

We show that standard Transformers without graph-specific modifications can lead to
promising results in graph learning both in theory and practice. Given a graph, we simply …‏

ذخیره ارجاع بیان شده در 203 یافته مقاله‌های مربوط تمام نسخه‌های 9 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Rethinking attention with performers‏

K Choromanski, V Likhosherstov, D Dohan… - arxiv preprint arxiv …, 2020‏ - arxiv.org‏

We introduce Performers, Transformer architectures which can estimate regular (softmax)
full-rank-attention Transformers with provable accuracy, but using only linear (as opposed to …‏

ذخیره ارجاع بیان شده در 1827 یافته مقاله‌های مربوط تمام نسخه‌های 7 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Random feature attention‏

H Peng, N Pappas, D Yogatama, R Schwartz… - arxiv preprint arxiv …, 2021‏ - arxiv.org‏

Transformers are state-of-the-art models for a variety of sequence modeling tasks. At their
core is an attention function which models pairwise interactions between the inputs at every …‏

ذخیره ارجاع بیان شده در 377 یافته مقاله‌های مربوط تمام نسخه‌های 6 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Monarch: Expressive structured matrices for efficient and accurate training‏

T Dao, B Chen, NS Sohoni, A Desai… - International …, 2022‏ - proceedings.mlr.press‏

Large neural networks excel in many domains, but they are expensive to train and fine-tune.
A popular approach to reduce their compute or memory requirements is to replace dense …‏

ذخیره ارجاع بیان شده در 103 یافته مقاله‌های مربوط تمام نسخه‌های 8 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Federated learning: Strategies for improving communication efficiency‏

J Konečný, HB McMahan, FX Yu, P Richtárik… - arxiv preprint arxiv …, 2016‏ - arxiv.org‏

Federated Learning is a machine learning setting where the goal is to train a high-quality
centralized model while training data remains distributed over a large number of clients …‏

ذخیره ارجاع بیان شده در 5972 یافته مقاله‌های مربوط تمام نسخه‌های 11 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Random features for kernel approximation: A survey on algorithms, theory, and beyond‏

F Liu, X Huang, Y Chen… - IEEE Transactions on …, 2021‏ - ieeexplore.ieee.org‏

The class of random features is one of the most popular techniques to speed up kernel
methods in large-scale problems. Related works have been recognized by the NeurIPS Test …‏

ذخیره ارجاع بیان شده در 213 یافته مقاله‌های مربوط تمام نسخه‌های 10

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Distributed mean estimation with limited communication‏

AT Suresh, XY Felix, S Kumar… - … on machine learning, 2017‏ - proceedings.mlr.press‏

Motivated by the need for distributed learning and optimization algorithms with low
communication cost, we study communication efficient algorithms for distributed mean …‏

ذخیره ارجاع بیان شده در 404 یافته مقاله‌های مربوط تمام نسخه‌های 7 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] aps.org

Modeling the influence of data structure on learning in neural networks: The hidden manifold model‏

S Goldt, M Mézard, F Krzakala, L Zdeborová - Physical Review X, 2020‏ - APS‏

Understanding the reasons for the success of deep neural networks trained using stochastic
gradient-based methods is a key open problem for the nascent theory of deep learning. The …‏

ذخیره ارجاع بیان شده در 209 یافته مقاله‌های مربوط تمام نسخه‌های 16

[Free GPT-4]
[DeepSeek]

[PDF] openreview.net

Multiplicative filter networks‏

R Fathony, AK Sahu, D Willmott… - … Conference on Learning …, 2020‏ - openreview.net‏

Although deep networks are typically used to approximate functions over high dimensional
inputs, recent work has increased interest in neural networks as function approximators for …‏

ذخیره ارجاع بیان شده در 165 یافته مقاله‌های مربوط تمام نسخه‌های 7 نسخه HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Memory attention networks for skeleton-based action recognition‏

C Li, C **e, B Zhang, J Han, X Zhen… - IEEE Transactions on …, 2021‏ - ieeexplore.ieee.org‏

Skeleton-based action recognition has been extensively studied, but it remains an unsolved
problem because of the complex variations of skeleton joints in 3-D spatiotemporal space …‏

ذخیره ارجاع بیان شده در 240 یافته مقاله‌های مربوط تمام نسخه‌های 13

ایجاد هشدار

ارجاع

جستجوی پیشرفته

در «کتابخانه من» ذخیره شد

Orthogonal random features

Pure transformers are powerful graph learners‏

Rethinking attention with performers‏

Random feature attention‏

Monarch: Expressive structured matrices for efficient and accurate training‏

Federated learning: Strategies for improving communication efficiency‏

Random features for kernel approximation: A survey on algorithms, theory, and beyond‏

Distributed mean estimation with limited communication‏

Modeling the influence of data structure on learning in neural networks: The hidden manifold model‏

Multiplicative filter networks‏

Memory attention networks for skeleton-based action recognition‏