Google Tudós

A Rame, C Dancette, M Cord - International Conference on …, 2022 - proceedings.mlr.press

Learning robust models that generalize well under changes in the data distribution is critical
for real-world applications. To this end, there has been a growing surge of interest to learn …

Mentés Hivatkozás Idézetek száma: 237 Kapcsolódó cikkek Mind a(z) 8 változat HTML-változat

[Free GPT-4]

[PDF] thecvf.com

Estimating example difficulty using variance of gradients

C Agarwal, D D'souza… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com

In machine learning, a question of great interest is understanding what examples are
challenging for a model to classify. Identifying atypical examples ensures the safe …

Mentés Hivatkozás Idézetek száma: 120 Kapcsolódó cikkek Mind a(z) 9 változat HTML-változat

[Free GPT-4]

[PDF] arxiv.org

An experimental study of byzantine-robust aggregation schemes in federated learning

S Li, ECH Ngai, T Voigt - IEEE Transactions on Big Data, 2023 - ieeexplore.ieee.org

Byzantine-robust federated learning aims at mitigating Byzantine failures during the
federated training process, where malicious participants (known as Byzantine clients) may …

Mentés Hivatkozás Idézetek száma: 66 Kapcsolódó cikkek Mind a(z) 12 változat

[Free GPT-4]

[PDF] nsf.gov

Embarrassingly simple dataset distillation

Y Feng, S Vedantam, J Kempe - 2023 - par.nsf.gov

Training of large-scale models in general requires enormous amounts of traning data.
Dataset distillation aims to extract a small set of synthetic training samples from a large …

Mentés Hivatkozás Idézetek száma: 17 Kapcsolódó cikkek Mind a(z) 3 változat HTML-változat

[Free GPT-4]

[PDF] arxiv.org

On the limitations of compute thresholds as a governance strategy

S Hooker - arxiv preprint arxiv:2407.05694, 2024 - arxiv.org

At face value, this essay is about understanding a fairly esoteric governance tool called
compute thresholds. However, in order to grapple with whether these thresholds will achieve …

Mentés Hivatkozás Idézetek száma: 7 Kapcsolódó cikkek Mind a(z) 2 változat HTML-változat

[Free GPT-4]

[PDF] arxiv.org

On the generalization of models trained with SGD: Information-theoretic bounds and implications

Z Wang, Y Mao - arxiv preprint arxiv:2110.03128, 2021 - arxiv.org

This paper follows up on a recent work of Neu et al.(2021) and presents some new
information-theoretic upper bounds for the generalization error of machine learning models …

Mentés Hivatkozás Idézetek száma: 28 Kapcsolódó cikkek Mind a(z) 4 változat HTML-változat

[Free GPT-4]

[PDF] arxiv.org

A tale of two long tails

D D'souza, Z Nussbaum, C Agarwal… - arxiv preprint arxiv …, 2021 - arxiv.org

As machine learning models are increasingly employed to assist human decision-makers, it
becomes critical to communicate the uncertainty associated with these model predictions …

Mentés Hivatkozás Idézetek száma: 19 Kapcsolódó cikkek Mind a(z) 3 változat HTML-változat

[Free GPT-4]

[HTML] sciencedirect.com

[HTML][HTML] Low-variance Forward Gradients using Direct Feedback Alignment and momentum

F Bacho, D Chu - Neural Networks, 2024 - Elsevier

Supervised learning in deep neural networks is commonly performed using error
backpropagation. However, the sequential propagation of errors during the backward pass …

Mentés Hivatkozás Idézetek száma: 7 Kapcsolódó cikkek Mind a(z) 9 változat

[Free GPT-4]

[PDF] mcgill.ca

Jointly-learnt exit and inference for dynamic neural networks

J Chataoui - 2024 - escholarship.mcgill.ca

Les réseaux neuronaux artificiels dynamiques à sortie anticipée (RNDSA) ont pour but de
réduire le coût des prédictions en sautant les couches les plus profondes du réseau pour …

Mentés Hivatkozás Idézetek száma: 4 Kapcsolódó cikkek HTML-változat

[Free GPT-4]

[PDF] neurips.cc

Simigrad: Fine-grained adaptive batching for large scale training using gradient similarity measurement

H Qin, S Rajbhandari, O Ruwase… - Advances in Neural …, 2021 - proceedings.neurips.cc

Large scale training requires massive parallelism to finish the training within a reasonable
amount of time. To support massive parallelism, large batch training is the key enabler but …

Mentés Hivatkozás Idézetek száma: 6 Kapcsolódó cikkek Mind a(z) 8 változat HTML-változat

Hivatkozás

Speciális keresés

Mentve a Saját könyvtárba

Fishr: Invariant gradient variances for out-of-distribution generalization

Estimating example difficulty using variance of gradients

An experimental study of byzantine-robust aggregation schemes in federated learning

Embarrassingly simple dataset distillation

On the limitations of compute thresholds as a governance strategy

On the generalization of models trained with SGD: Information-theoretic bounds and implications

A tale of two long tails

[HTML][HTML] Low-variance Forward Gradients using Direct Feedback Alignment and momentum

Jointly-learnt exit and inference for dynamic neural networks

Simigrad: Fine-grained adaptive batching for large scale training using gradient similarity measurement