Google Académico

Artículos

Académico

Aproximadamente 28 resultados (0.02 s)

Mi perfil Mi biblioteca

Let’s be honest: An optimal no-regret framework for zero-sum games

Buscar en artículos que citan

[Free GPT-4]

[PDF] ai-plans.com

[PDF][PDF] Nash learning from human feedback

R Munos, M Valko, D Calandriello, MG Azar… - ar** pace in the dynamic case

H Fang, NJA Harvey, VS Portella… - Journal of Machine …, 2022 - jmlr.org

Online mirror descent (OMD) and dual averaging (DA)--two fundamental algorithms for
online convex optimization--are known to have very similar (and sometimes identical) …

Guardar Citar Citado por 37 Artículos relacionados Las 8 versiones Versión en HTML

A survey on noncooperative games and distributed Nash equilibrium seeking over multi-agent networks

P Yi, J Lei, X Li, S Liang, M Meng… - CAAI Artificial Intelligence …, 2022 - sciopen.com

The work gives a review on the distributed Nash equilibrium seeking of noncooperative
games in multi-agent networks, which emerges as one of the frontier research topics in the …

Guardar Citar Citado por 8 Artículos relacionados

Crear alerta

Citar

Búsqueda avanzada

Guardado en Mi biblioteca

Let’s be honest: An optimal no-regret framework for zero-sum games

[PDF][PDF] Nash learning from human feedback

A survey on noncooperative games and distributed Nash equilibrium seeking over multi-agent networks