- Academic Search

S Milani, N Topin, M Veloso, F Fang - ACM Computing Surveys, 2024‏ - dl.acm.org‏

Explainable reinforcement learning (XRL) is an emerging subfield of explainable machine
learning that has attracted considerable attention in recent years. The goal of XRL is to …‏

שמור צטט צוטט על ידי 77 מאמרים בנושא זה

[Free GPT-4]
[DeepSeek]

[PDF] royalsocietypublishing.org Full View‏

Inductive biases for deep learning of higher-level cognition‏

A Goyal, Y Bengio - Proceedings of the Royal Society A, 2022‏ - royalsocietypublishing.org‏

A fascinating hypothesis is that human and animal intelligence could be explained by a few
principles (rather than an encyclopaedic list of heuristics). If that hypothesis was correct, we …‏

שמור צטט צוטט על ידי 437 מאמרים בנושא זה כל 5 הגרסאות

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Principle-driven self-alignment of language models from scratch with minimal human supervision‏

Z Sun, Y Shen, Q Zhou, H Zhang… - Advances in …, 2023‏ - proceedings.neurips.cc‏

Recent AI-assistant agents, such as ChatGPT, predominantly rely on supervised fine-tuning
(SFT) with human annotations and reinforcement learning from human feedback (RLHF) to …‏

שמור צטט צוטט על ידי 328 מאמרים בנושא זה כל 8 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Describe, explain, plan and select: Interactive planning with large language models enables open-world multi-task agents‏

Z Wang, S Cai, G Chen, A Liu, X Ma, Y Liang - ar** multi-task embodied agents. We've …‏

שמור צטט צוטט על ידי 77 מאמרים בנושא זה כל 3 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

Do embodied agents dream of pixelated sheep: Embodied decision making using language guided world modelling‏

K Nottingham, P Ammanabrolu, A Suhr… - International …, 2023‏ - proceedings.mlr.press‏

Reinforcement learning (RL) agents typically learn tabula rasa, without prior knowledge of
the world. However, if initialized with knowledge of high-level subgoals and transitions …‏

שמור צטט צוטט על ידי 83 מאמרים בנושא זה כל 11 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Interpretable reward redistribution in reinforcement learning: A causal approach‏

Y Zhang, Y Du, B Huang, Z Wang… - Advances in …, 2023‏ - proceedings.neurips.cc‏

A major challenge in reinforcement learning is to determine which state-action pairs are
responsible for future rewards that are delayed. Reward redistribution serves as a solution to …‏

שמור צטט צוטט על ידי 19 מאמרים בנושא זה כל 11 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

SALMON: Self-alignment with instructable reward models‏

Z Sun, Y Shen, H Zhang, Q Zhou, Z Chen… - arxiv preprint arxiv …, 2023‏ - arxiv.org‏

Supervised Fine-Tuning (SFT) on response demonstrations combined with Reinforcement
Learning from Human Feedback (RLHF) constitutes a powerful paradigm for aligning LLM …‏

שמור צטט צוטט על ידי 20 מאמרים בנושא זה כל 5 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] mlr.press

A dataset perspective on offline reinforcement learning‏

K Schweighofer, M Dinu, A Radler… - Conference on …, 2022‏ - proceedings.mlr.press‏

Abstract The application of Reinforcement Learning (RL) in real world environments can be
expensive or risky due to sub-optimal policies during training. In Offline RL, this problem is …‏

שמור צטט צוטט על ידי 57 מאמרים בנושא זה כל 9 הגרסאות פתיחה בתור HTML

יצירת התראה

צטט

חיפוש מתקדם

נשמר בספרייה שלי

Align-rudder: Learning from few demonstrations by reward redistribution

Explainable reinforcement learning: A survey and comparative review‏

Inductive biases for deep learning of higher-level cognition‏

Principle-driven self-alignment of language models from scratch with minimal human supervision‏

Describe, explain, plan and select: Interactive planning with large language models enables open-world multi-task agents‏

Do embodied agents dream of pixelated sheep: Embodied decision making using language guided world modelling‏

Interpretable reward redistribution in reinforcement learning: A causal approach‏

SALMON: Self-alignment with instructable reward models‏

A dataset perspective on offline reinforcement learning‏