- Academic Search

B Hu, K Zhang, N Li, M Mesbahi… - Annual Review of …, 2023 - annualreviews.org

Gradient-based methods have been widely used for system design and optimization in
diverse application domains. Recently, there has been a renewed interest in studying …

Salva Cita Citato da 87 Articoli correlati Tutte e 6 le versioni

[Free GPT-4]

[PDF] ieee.org

A primer on zeroth-order optimization in signal processing and machine learning: Principals, recent advances, and applications

S Liu, PY Chen, B Kailkhura, G Zhang… - IEEE Signal …, 2020 - ieeexplore.ieee.org

Zeroth-order (ZO) optimization is a subset of gradient-free optimization that emerges in many
signal processing and machine learning (ML) applications. It is used for solving optimization …

Salva Cita Citato da 250 Articoli correlati Tutte e 7 le versioni

[Free GPT-4]

[PDF] neurips.cc

Fine-tuning language models with just forward passes

S Malladi, T Gao, E Nichani… - Advances in …, 2023 - proceedings.neurips.cc

Fine-tuning language models (LMs) has yielded success on diverse downstream tasks, but
as LMs grow in size, backpropagation requires a prohibitively large amount of memory …

Salva Cita Citato da 182 Articoli correlati Tutte e 6 le versioni Versione HTML

[Free GPT-4]

[PDF] arxiv.org

Derivative-free optimization methods

J Larson, M Menickelly, SM Wild - Acta Numerica, 2019 - cambridge.org

In many optimization problems arising from scientific, engineering and artificial intelligence
applications, objective and constraint functions are available only as the output of a black …

Salva Cita Citato da 511 Articoli correlati Tutte e 9 le versioni

[Free GPT-4]

[PDF] arxiv.org

Conditional gradient methods

G Braun, A Carderera, CW Combettes… - arxiv preprint arxiv …, 2022 - arxiv.org

The purpose of this survey is to serve both as a gentle introduction and a coherent overview
of state-of-the-art Frank--Wolfe algorithms, also called conditional gradient algorithms, for …

Salva Cita Citato da 57 Articoli correlati Tutte e 2 le versioni Versione HTML

[Free GPT-4]

[PDF] mlr.press

No-regret learning in time-varying zero-sum games

M Zhang, P Zhao, H Luo… - … Conference on Machine …, 2022 - proceedings.mlr.press

Learning from repeated play in a fixed two-player zero-sum game is a classic problem in
game theory and online learning. We consider a variant of this problem where the game …

Salva Cita Citato da 50 Articoli correlati Tutte e 7 le versioni Versione HTML

[Free GPT-4]

[PDF] mlr.press

Learning the globally optimal distributed LQ regulator

L Furieri, Y Zheng… - Learning for Dynamics …, 2020 - proceedings.mlr.press

We study model-free learning methods for the output-feedback Linear Quadratic (LQ) control
problem in finite-horizon subject to subspace constraints on the control policy. Subspace …

Salva Cita Citato da 95 Articoli correlati Tutte e 8 le versioni Versione HTML

[Free GPT-4]

[PDF] arxiv.org

Zero-th order algorithm for softmax attention optimization

Y Deng, Z Li, S Mahadevan… - 2024 IEEE International …, 2024 - ieeexplore.ieee.org

Large language models (LLMs) have brought about significant transformations in human
society. Among the crucial computations in LLMs, the softmax unit holds great importance …

Salva Cita Citato da 10 Articoli correlati Tutte e 4 le versioni

[Free GPT-4]

[PDF] jmlr.org

Gradient-free optimization of highly smooth functions: improved analysis and a new algorithm

A Akhavan, E Chzhen, M Pontil, AB Tsybakov - Journal of Machine …, 2024 - jmlr.org

This work studies minimization problems with zero-order noisy oracle information under the
assumption that the objective function is highly smooth and possibly satisfies additional …

Salva Cita Citato da 12 Articoli correlati Tutte e 4 le versioni Versione HTML

[Free GPT-4]

[PDF] arxiv.org

Model-free nonlinear feedback optimization

Z He, S Bolognani, J He, F Dörfler… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Feedback optimization is a control paradigm that enables physical systems to autonomously
reach efficient operating points. Its central idea is to interconnect optimization iterations in …

Salva Cita Citato da 35 Articoli correlati Tutte e 4 le versioni

Crea avviso

Cita

Ricerca avanzata

Salvato in La mia biblioteca

Zeroth-order nonconvex stochastic optimization: Handling constraints, high dimensionality,...

Toward a theoretical foundation of policy optimization for learning control policies

A primer on zeroth-order optimization in signal processing and machine learning: Principals, recent advances, and applications

Fine-tuning language models with just forward passes

Derivative-free optimization methods

Conditional gradient methods

No-regret learning in time-varying zero-sum games

Learning the globally optimal distributed LQ regulator

Zero-th order algorithm for softmax attention optimization

Gradient-free optimization of highly smooth functions: improved analysis and a new algorithm

Model-free nonlinear feedback optimization