- Academic Search

B Hambly, R Xu, H Yang - Mathematical Finance, 2023‏ - Wiley Online Library‏

The rapid changes in the finance industry due to the increasing amount of data have
revolutionized the techniques on data processing and data analysis and brought new …‏

שמור צטט צוטט על ידי 219 מאמרים בנושא זה כל 14 הגרסאות

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey on model-based reinforcement learning‏

FM Luo, T Xu, H Lai, XH Chen, W Zhang… - Science China Information …, 2024‏ - Springer‏

Reinforcement learning (RL) interacts with the environment to solve sequential decision-
making problems via a trial-and-error approach. Errors are always undesirable in real-world …‏

שמור צטט צוטט על ידי 134 מאמרים בנושא זה כל 6 הגרסאות

[Free GPT-4]
[DeepSeek]

[PDF] hust.edu.vn

[ספר][B] Algorithms for decision making‏

MJ Kochenderfer, TA Wheeler, KH Wray - 2022‏ - books.google.com‏

A broad introduction to algorithms for decision making under uncertainty, introducing the
underlying mathematical problem formulations and the algorithms for solving them …‏

שמור צטט צוטט על ידי 240 מאמרים בנושא זה כל 8 הגרסאות חיפוש ספריות

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Morel: Model-based offline reinforcement learning‏

R Kidambi, A Rajeswaran… - Advances in neural …, 2020‏ - proceedings.neurips.cc‏

In offline reinforcement learning (RL), the goal is to learn a highly rewarding policy based
solely on a dataset of historical interactions with the environment. This serves as an extreme …‏

שמור צטט צוטט על ידי 798 מאמרים בנושא זה כל 7 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Reinforcement learning in healthcare: A survey‏

C Yu, J Liu, S Nemati, G Yin - ACM Computing Surveys (CSUR), 2021‏ - dl.acm.org‏

As a subfield of machine learning, reinforcement learning (RL) aims at optimizing decision
making by using interaction samples of an agent with its environment and the potentially …‏

שמור צטט צוטט על ידי 794 מאמרים בנושא זה כל 5 הגרסאות

[Free GPT-4]
[DeepSeek]

[PDF] nowpublishers.com

An introduction to deep reinforcement learning‏

V François-Lavet, P Henderson, R Islam… - … and Trends® in …, 2018‏ - nowpublishers.com‏

Deep reinforcement learning is the combination of reinforcement learning (RL) and deep
learning. This field of research has been able to solve a wide range of complex …‏

שמור צטט צוטט על ידי 1971 מאמרים בנושא זה כל 16 הגרסאות חיפוש ספריות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Learning to explore using active neural slam‏

DS Chaplot, D Gandhi, S Gupta, A Gupta… - arxiv preprint arxiv …, 2020‏ - arxiv.org‏

This work presents a modular and hierarchical approach to learn policies for exploring 3D
environments, calledActive Neural SLAM'. Our approach leverages the strengths of both …‏

שמור צטט צוטט על ידי 612 מאמרים בנושא זה כל 7 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] jmlr.org

On the theory of policy gradient methods: Optimality, approximation, and distribution shift‏

A Agarwal, SM Kakade, JD Lee, G Mahajan - Journal of Machine Learning …, 2021‏ - jmlr.org‏

Policy gradient methods are among the most effective methods in challenging reinforcement
learning problems with large state and/or action spaces. However, little is known about even …‏

שמור צטט צוטט על ידי 516 מאמרים בנושא זה כל 13 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Policy finetuning: Bridging sample-efficient offline and online reinforcement learning‏

T **e, N Jiang, H Wang, C **ong… - Advances in neural …, 2021‏ - proceedings.neurips.cc‏

Recent theoretical work studies sample-efficient reinforcement learning (RL) extensively in
two settings: learning interactively in the environment (online RL), or learning from an offline …‏

שמור צטט צוטט על ידי 192 מאמרים בנושא זה כל 9 הגרסאות פתיחה בתור HTML

[Free GPT-4]
[DeepSeek]

[PDF] tor-lattimore.com

[ספר][B] Bandit algorithms‏

T Lattimore, C Szepesvári - 2020‏ - books.google.com‏

Decision-making in the face of uncertainty is a significant challenge in machine learning,
and the multi-armed bandit model is a commonly used framework to address it. This …‏

שמור צטט צוטט על ידי 3357 מאמרים בנושא זה כל 9 הגרסאות חיפוש ספריות

יצירת התראה

צטט

חיפוש מתקדם

נשמר בספרייה שלי

Near-optimal reinforcement learning in polynomial time

Recent advances in reinforcement learning in finance‏

A survey on model-based reinforcement learning‏

[ספר][B] Algorithms for decision making‏

Morel: Model-based offline reinforcement learning‏

Reinforcement learning in healthcare: A survey‏

An introduction to deep reinforcement learning‏

Learning to explore using active neural slam‏

On the theory of policy gradient methods: Optimality, approximation, and distribution shift‏

Policy finetuning: Bridging sample-efficient offline and online reinforcement learning‏

[ספר][B] Bandit algorithms‏