- Academic Search

B Hu, K Zhang, N Li, M Mesbahi… - Annual Review of …, 2023 - annualreviews.org

Gradient-based methods have been widely used for system design and optimization in
diverse application domains. Recently, there has been a renewed interest in studying …

保存引用被引用数: 87 関連記事全 6 バージョン

[Free GPT-4]

[PDF] ieee.org

Nonconvex optimization meets low-rank matrix factorization: An overview

Y Chi, YM Lu, Y Chen - IEEE Transactions on Signal …, 2019 - ieeexplore.ieee.org

Substantial progress has been made recently on develo** provably accurate and efficient
algorithms for low-rank matrix factorization via nonconvex optimization. While conventional …

保存引用被引用数: 521 関連記事全 13 バージョン

[Free GPT-4]

[PDF] ieee.org

Edge artificial intelligence for 6G: Vision, enabling technologies, and applications

KB Letaief, Y Shi, J Lu, J Lu - IEEE Journal on Selected Areas …, 2021 - ieeexplore.ieee.org

The thriving of artificial intelligence (AI) applications is driving the further evolution of
wireless networks. It has been envisioned that 6G will be transformative and will …

保存引用被引用数: 576 関連記事全 6 バージョン

SF-FWA: A Self-Adaptive Fast Fireworks Algorithm for effective large-scale optimization

M Chen, Y Tan - Swarm and Evolutionary Computation, 2023 - Elsevier

Computationally efficient algorithms for large-scale black-box optimization have become
increasingly important in recent years due to the growing complexity of engineering and …

保存引用被引用数: 101 関連記事

[Free GPT-4]

[PDF] arxiv.org

Sophia: A scalable stochastic second-order optimizer for language model pre-training

H Liu, Z Li, D Hall, P Liang, T Ma - arxiv preprint arxiv:2305.14342, 2023 - arxiv.org

Given the massive cost of language model pre-training, a non-trivial improvement of the
optimization algorithm would lead to a material reduction on the time and cost of training …

保存引用被引用数: 124 関連記事全 4 バージョン HTMLバージョン

[Free GPT-4]

[PDF] mlr.press

Understanding gradient descent on the edge of stability in deep learning

S Arora, Z Li, A Panigrahi - International Conference on …, 2022 - proceedings.mlr.press

Deep learning experiments by\citet {cohen2021gradient} using deterministic Gradient
Descent (GD) revealed an Edge of Stability (EoS) phase when learning rate (LR) and …

保存引用被引用数: 116 関連記事全 7 バージョン HTMLバージョン

A novel approach to large-scale dynamically weighted directed network representation

X Luo, H Wu, Z Wang, J Wang… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

A d ynamically w eighted d irected n etwork (DWDN) is frequently encountered in various big
data-related applications like a terminal interaction pattern analysis system (TIPAS) …

保存引用被引用数: 215 関連記事全 4 バージョン

[Free GPT-4]

[PDF] mlr.press

Understanding contrastive learning requires incorporating inductive biases

N Saunshi, J Ash, S Goel, D Misra… - International …, 2022 - proceedings.mlr.press

Contrastive learning is a popular form of self-supervised learning that encourages
augmentations (views) of the same input to have more similar representations compared to …

保存引用被引用数: 128 関連記事全 7 バージョン HTMLバージョン

[Free GPT-4]

[PDF] neurips.cc

Meta-learning with implicit gradients

A Rajeswaran, C Finn, SM Kakade… - Advances in neural …, 2019 - proceedings.neurips.cc

A core capability of intelligent systems is the ability to quickly learn new tasks by drawing on
prior experience. Gradient (or optimization) based meta-learning has recently emerged as …

保存引用被引用数: 967 関連記事全 13 バージョン HTMLバージョン

[Free GPT-4]

[PDF] mlr.press

Fine-grained analysis of optimization and generalization for overparameterized two-layer neural networks

S Arora, S Du, W Hu, Z Li… - … Conference on Machine …, 2019 - proceedings.mlr.press

Recent works have cast some light on the mystery of why deep nets fit any data and
generalize despite being very overparametrized. This paper analyzes training and …

保存引用被引用数: 1116 関連記事全 6 バージョン HTMLバージョン

アラートを作成

引用

検索オプション

マイライブラリに保存しました

How to escape saddle points efficiently

Toward a theoretical foundation of policy optimization for learning control policies

Nonconvex optimization meets low-rank matrix factorization: An overview

Edge artificial intelligence for 6G: Vision, enabling technologies, and applications

SF-FWA: A Self-Adaptive Fast Fireworks Algorithm for effective large-scale optimization

Sophia: A scalable stochastic second-order optimizer for language model pre-training

Understanding gradient descent on the edge of stability in deep learning

A novel approach to large-scale dynamically weighted directed network representation

Understanding contrastive learning requires incorporating inductive biases

Meta-learning with implicit gradients

Fine-grained analysis of optimization and generalization for overparameterized two-layer neural networks