Unbiased gradient estimation in unrolled computation graphs with persistent evolution strategies
Unrolled computation graphs arise in many scenarios, including training RNNs, tuning
hyperparameters through unrolled optimization, and training learned optimizers. Current …
hyperparameters through unrolled optimization, and training learned optimizers. Current …
Meta-AdaM: An meta-learned adaptive optimizer with momentum for few-shot learning
Abstract We introduce Meta-AdaM, a meta-learned adaptive optimizer with momentum,
designed for few-shot learning tasks that pose significant challenges to deep learning …
designed for few-shot learning tasks that pose significant challenges to deep learning …
Bidirectional learning for offline model-based biological sequence design
Offline model-based optimization aims to maximize a black-box objective function with a
static dataset of designs and their scores. In this paper, we focus on biological sequence …
static dataset of designs and their scores. In this paper, we focus on biological sequence …
[HTML][HTML] Efficient learning rate adaptation based on hierarchical optimization approach
GS Na - Neural Networks, 2022 - Elsevier
This paper proposes a new hierarchical approach to learning rate adaptation in gradient
methods, called learning rate optimization (LRO). LRO formulates the learning rate adaption …
methods, called learning rate optimization (LRO). LRO formulates the learning rate adaption …
Hydra: Hypergradient data relevance analysis for interpreting deep neural networks
The behaviors of deep neural networks (DNNs) are notoriously resistant to human
interpretations. In this paper, we propose Hypergradient Data Relevance Analysis, or …
interpretations. In this paper, we propose Hypergradient Data Relevance Analysis, or …
Selecting and composing learning rate policies for deep neural networks
The choice of learning rate (LR) functions and policies has evolved from a simple fixed LR to
the decaying LR and the cyclic LR, aiming to improve the accuracy and reduce the training …
the decaying LR and the cyclic LR, aiming to improve the accuracy and reduce the training …
[PDF][PDF] End-to-end deep learning framework for real-time inertial attitude estimation using 6dof imu
ABSTRACT Inertial Measurement Units (IMU) are commonly used in inertial attitude
estimation from engineering to medical sciences. There may be disturbances and high …
estimation from engineering to medical sciences. There may be disturbances and high …
Amortized proximal optimization
We propose a framework for online meta-optimization of parameters that govern
optimization, called Amortized Proximal Optimization (APO). We first interpret various …
optimization, called Amortized Proximal Optimization (APO). We first interpret various …
Generalizable end-to-end deep learning frameworks for real-time attitude estimation using 6DoF inertial measurement units
This paper presents a novel end-to-end deep learning framework for real-time inertial
attitude estimation using 6DoF IMU measurements. Inertial Measurement Units are widely …
attitude estimation using 6DoF IMU measurements. Inertial Measurement Units are widely …
Adam through a second-order lens
Research into optimisation for deep learning is characterised by a tension between the
computational efficiency of first-order, gradient-based methods (such as SGD and Adam) …
computational efficiency of first-order, gradient-based methods (such as SGD and Adam) …