Google znalac

J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang… - arxiv preprint arxiv …, 2023 - arxiv.org

AI alignment aims to make AI systems behave in line with human intentions and values. As
AI systems grow more capable, so do risks from misalignment. To provide a comprehensive …

Spremi Citiraj Spominje se 245 puta Srodni članci Svih 4 inačica Prikaži kao HTML

[Free GPT-4]
[DeepSeek]

[HTML] sciencedirect.com

[HTML][HTML] How are reinforcement learning and deep learning algorithms used for big data based decision making in financial industries–A review and research agenda

V Singh, SS Chen, M Singhania, B Nanavati… - International Journal of …, 2022 - Elsevier

Data availability and accessibility have brought in unseen changes in the finance systems
and new theoretical and computational challenges. For example, in contrast to classical …

Spremi Citiraj Spominje se 160 puta Srodni članci Svih 2 inačica

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Building cooperative embodied agents modularly with large language models

H Zhang, W Du, J Shan, Q Zhou, Y Du… - arxiv preprint arxiv …, 2023 - arxiv.org

In this work, we address challenging multi-agent cooperation problems with decentralized
control, raw sensory observations, costly communication, and multi-objective tasks …

Spremi Citiraj Spominje se 189 puta Srodni članci Svih 5 inačica Prikaži kao HTML

[Free GPT-4]
[DeepSeek]

[PDF] science.org

Mastering the game of Stratego with model-free multiagent reinforcement learning

J Perolat, B De Vylder, D Hennes, E Tarassov, F Strub… - Science, 2022 - science.org

We introduce DeepNash, an autonomous agent that plays the imperfect information game
Stratego at a human expert level. Stratego is one of the few iconic board games that artificial …

Spremi Citiraj Spominje se 260 puta Srodni članci Svih 8 inačica

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Video pretraining (vpt): Learning to act by watching unlabeled online videos

B Baker, I Akkaya, P Zhokov… - Advances in …, 2022 - proceedings.neurips.cc

Pretraining on noisy, internet-scale datasets has been heavily studied as a technique for
training models with broad, general capabilities for text, images, and other modalities …

Spremi Citiraj Spominje se 296 puta Srodni članci Svih 6 inačica Prikaži kao HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Compute trends across three eras of machine learning

J Sevilla, L Heim, A Ho, T Besiroglu… - … Joint Conference on …, 2022 - ieeexplore.ieee.org

Compute, data, and algorithmic advances are the three fundamental factors that drive
progress in modern Machine Learning (ML). In this paper we study trends in the most readily …

Spremi Citiraj Spominje se 381 puta Srodni članci Svih 4 inačica

[Free GPT-4]
[DeepSeek]

[PDF] springer.com

Multi-agent deep reinforcement learning: a survey

S Gronauer, K Diepold - Artificial Intelligence Review, 2022 - Springer

The advances in reinforcement learning have recorded sublime success in various domains.
Although the multi-agent domain has been overshadowed by its single-agent counterpart …

Spremi Citiraj Spominje se 724 puta Srodni članci Svih 9 inačica

[Free GPT-4]
[DeepSeek]

[PDF] nature.com

From attribution maps to human-understandable explanations through concept relevance propagation

R Achtibat, M Dreyer, I Eisenbraun, S Bosse… - Nature Machine …, 2023 - nature.com

The field of explainable artificial intelligence (XAI) aims to bring transparency to today's
powerful but opaque deep learning models. While local XAI methods explain individual …

Spremi Citiraj Spominje se 166 puta Srodni članci Svih 6 inačica

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Habitat 3.0: A co-habitat for humans, avatars and robots

X Puig, E Undersander, A Szot, MD Cote… - arxiv preprint arxiv …, 2023 - arxiv.org

We present Habitat 3.0: a simulation platform for studying collaborative human-robot tasks in
home environments. Habitat 3.0 offers contributions across three dimensions:(1) Accurate …

Spremi Citiraj Spominje se 91 puta Srodni članci Svih 4 inačica Prikaži kao HTML

[Free GPT-4]
[DeepSeek]

[PDF] ieee.org

Meta-learning in neural networks: A survey

T Hospedales, A Antoniou, P Micaelli… - IEEE transactions on …, 2021 - ieeexplore.ieee.org

The field of meta-learning, or learning-to-learn, has seen a dramatic rise in interest in recent
years. Contrary to conventional approaches to AI where tasks are solved from scratch using …

Spremi Citiraj Spominje se 2560 puta Srodni članci Svih 11 inačica

Stvori obavijest

Citiraj

Napredno pretraživanje

Spremljeno u Moju knjižnicu

Human-level performance in 3D multiplayer games with population-based reinforcement learning

Ai alignment: A comprehensive survey

[HTML][HTML] How are reinforcement learning and deep learning algorithms used for big data based decision making in financial industries–A review and research agenda

Building cooperative embodied agents modularly with large language models

Mastering the game of Stratego with model-free multiagent reinforcement learning

Video pretraining (vpt): Learning to act by watching unlabeled online videos

Compute trends across three eras of machine learning

Multi-agent deep reinforcement learning: a survey

From attribution maps to human-understandable explanations through concept relevance propagation

Habitat 3.0: A co-habitat for humans, avatars and robots

Meta-learning in neural networks: A survey