- Academic Search

P Vicol, L Metz, J Sohl-Dickstein - … Conference on Machine …, 2021 - proceedings.mlr.press

Unrolled computation graphs arise in many scenarios, including training RNNs, tuning
hyperparameters through unrolled optimization, and training learned optimizers. Current …

Speichern Zitieren Zitiert von: 66 Ähnliche Artikel Alle 12 Versionen HTML-Version

[Free GPT-4]

[PDF] neurips.cc

Meta-AdaM: An meta-learned adaptive optimizer with momentum for few-shot learning

S Sun, H Gao - Advances in Neural Information Processing …, 2024 - proceedings.neurips.cc

Abstract We introduce Meta-AdaM, a meta-learned adaptive optimizer with momentum,
designed for few-shot learning tasks that pose significant challenges to deep learning …

Speichern Zitieren Zitiert von: 25 Ähnliche Artikel Alle 3 Versionen HTML-Version

[Free GPT-4]

[PDF] mlr.press

Bidirectional learning for offline model-based biological sequence design

C Chen, Y Zhang, X Liu… - … Conference on Machine …, 2023 - proceedings.mlr.press

Offline model-based optimization aims to maximize a black-box objective function with a
static dataset of designs and their scores. In this paper, we focus on biological sequence …

Speichern Zitieren Zitiert von: 19 Ähnliche Artikel Alle 7 Versionen HTML-Version

[Free GPT-4]

[HTML] sciencedirect.com

[HTML][HTML] Efficient learning rate adaptation based on hierarchical optimization approach

GS Na - Neural Networks, 2022 - Elsevier

This paper proposes a new hierarchical approach to learning rate adaptation in gradient
methods, called learning rate optimization (LRO). LRO formulates the learning rate adaption …

Speichern Zitieren Zitiert von: 21 Ähnliche Artikel Alle 4 Versionen

[Free GPT-4]

[PDF] aaai.org

Hydra: Hypergradient data relevance analysis for interpreting deep neural networks

Y Chen, B Li, H Yu, P Wu, C Miao - … of the AAAI Conference on Artificial …, 2021 - ojs.aaai.org

The behaviors of deep neural networks (DNNs) are notoriously resistant to human
interpretations. In this paper, we propose Hypergradient Data Relevance Analysis, or …

Speichern Zitieren Zitiert von: 44 Ähnliche Artikel Alle 9 Versionen HTML-Version

[Free GPT-4]

[PDF] acm.org

Selecting and composing learning rate policies for deep neural networks

Y Wu, L Liu - ACM Transactions on Intelligent Systems and …, 2023 - dl.acm.org

The choice of learning rate (LR) functions and policies has evolved from a simple fixed LR to
the decaying LR and the cyclic LR, aiming to improve the accuracy and reduce the training …

Speichern Zitieren Zitiert von: 33 Ähnliche Artikel Alle 5 Versionen

[Free GPT-4]

[PDF] researchgate.net

[PDF][PDF] End-to-end deep learning framework for real-time inertial attitude estimation using 6dof imu

AA Golroudbari, MH Sabour - arxiv preprint arxiv:2302.06037, 2023 - researchgate.net

ABSTRACT Inertial Measurement Units (IMU) are commonly used in inertial attitude
estimation from engineering to medical sciences. There may be disturbances and high …

Speichern Zitieren Zitiert von: 13 Ähnliche Artikel HTML-Version

[Free GPT-4]

[PDF] neurips.cc

Amortized proximal optimization

J Bae, P Vicol, JZ HaoChen… - Advances in Neural …, 2022 - proceedings.neurips.cc

We propose a framework for online meta-optimization of parameters that govern
optimization, called Amortized Proximal Optimization (APO). We first interpret various …

Speichern Zitieren Zitiert von: 17 Ähnliche Artikel Alle 8 Versionen HTML-Version

[Free GPT-4]

[PDF] arxiv.org

Generalizable end-to-end deep learning frameworks for real-time attitude estimation using 6DoF inertial measurement units

AA Golroudbari, MH Sabour - Measurement, 2023 - Elsevier

This paper presents a novel end-to-end deep learning framework for real-time inertial
attitude estimation using 6DoF IMU measurements. Inertial Measurement Units are widely …

Speichern Zitieren Zitiert von: 12 Ähnliche Artikel Alle 7 Versionen

[Free GPT-4]

[PDF] arxiv.org

Adam through a second-order lens

RM Clarke, B Su, JM Hernández-Lobato - arxiv preprint arxiv:2310.14963, 2023 - arxiv.org

Research into optimisation for deep learning is characterised by a tension between the
computational efficiency of first-order, gradient-based methods (such as SGD and Adam) …

Speichern Zitieren Zitiert von: 1 Ähnliche Artikel Alle 4 Versionen HTML-Version

Zitieren

Erweiterte Suche

In „Meine Bibliothek“ gespeichert

Unbiased gradient estimation in unrolled computation graphs with persistent evolution strategies

Meta-AdaM: An meta-learned adaptive optimizer with momentum for few-shot learning

Bidirectional learning for offline model-based biological sequence design

[HTML][HTML] Efficient learning rate adaptation based on hierarchical optimization approach

Hydra: Hypergradient data relevance analysis for interpreting deep neural networks

Selecting and composing learning rate policies for deep neural networks

[PDF][PDF] End-to-end deep learning framework for real-time inertial attitude estimation using 6dof imu

Amortized proximal optimization

Generalizable end-to-end deep learning frameworks for real-time attitude estimation using 6DoF inertial measurement units

Adam through a second-order lens