Unbiased gradient estimation in unrolled computation graphs with persistent evolution strategies

P Vicol, L Metz, J Sohl-Dickstein - … Conference on Machine …, 2021 - proceedings.mlr.press
Unrolled computation graphs arise in many scenarios, including training RNNs, tuning
hyperparameters through unrolled optimization, and training learned optimizers. Current …

Meta-AdaM: An meta-learned adaptive optimizer with momentum for few-shot learning

S Sun, H Gao - Advances in Neural Information Processing …, 2024 - proceedings.neurips.cc
Abstract We introduce Meta-AdaM, a meta-learned adaptive optimizer with momentum,
designed for few-shot learning tasks that pose significant challenges to deep learning …

Bidirectional learning for offline model-based biological sequence design

C Chen, Y Zhang, X Liu… - … Conference on Machine …, 2023 - proceedings.mlr.press
Offline model-based optimization aims to maximize a black-box objective function with a
static dataset of designs and their scores. In this paper, we focus on biological sequence …

[HTML][HTML] Efficient learning rate adaptation based on hierarchical optimization approach

GS Na - Neural Networks, 2022 - Elsevier
This paper proposes a new hierarchical approach to learning rate adaptation in gradient
methods, called learning rate optimization (LRO). LRO formulates the learning rate adaption …

Hydra: Hypergradient data relevance analysis for interpreting deep neural networks

Y Chen, B Li, H Yu, P Wu, C Miao - … of the AAAI Conference on Artificial …, 2021 - ojs.aaai.org
The behaviors of deep neural networks (DNNs) are notoriously resistant to human
interpretations. In this paper, we propose Hypergradient Data Relevance Analysis, or …

Selecting and composing learning rate policies for deep neural networks

Y Wu, L Liu - ACM Transactions on Intelligent Systems and …, 2023 - dl.acm.org
The choice of learning rate (LR) functions and policies has evolved from a simple fixed LR to
the decaying LR and the cyclic LR, aiming to improve the accuracy and reduce the training …

[PDF][PDF] End-to-end deep learning framework for real-time inertial attitude estimation using 6dof imu

AA Golroudbari, MH Sabour - arxiv preprint arxiv:2302.06037, 2023 - researchgate.net
ABSTRACT Inertial Measurement Units (IMU) are commonly used in inertial attitude
estimation from engineering to medical sciences. There may be disturbances and high …

Amortized proximal optimization

J Bae, P Vicol, JZ HaoChen… - Advances in Neural …, 2022 - proceedings.neurips.cc
We propose a framework for online meta-optimization of parameters that govern
optimization, called Amortized Proximal Optimization (APO). We first interpret various …

Generalizable end-to-end deep learning frameworks for real-time attitude estimation using 6DoF inertial measurement units

AA Golroudbari, MH Sabour - Measurement, 2023 - Elsevier
This paper presents a novel end-to-end deep learning framework for real-time inertial
attitude estimation using 6DoF IMU measurements. Inertial Measurement Units are widely …

Adam through a second-order lens

RM Clarke, B Su, JM Hernández-Lobato - arxiv preprint arxiv:2310.14963, 2023 - arxiv.org
Research into optimisation for deep learning is characterised by a tension between the
computational efficiency of first-order, gradient-based methods (such as SGD and Adam) …