追蹤
Hosein Hasanbeig
Hosein Hasanbeig
Microsoft Research
在 microsoft.com 的電子郵件地址已通過驗證 - 首頁
標題
引用次數
引用次數
年份
Reinforcement Learning for Temporal Logic Control Synthesis with Probabilistic Satisfaction Guarantees
M Hasanbeig, Y Kantaros, A Abate, D Kroening, GJ Pappas, I Lee
IEEE Conference on Decision and Control (CDC), 2019
1552019
Logically-Constrained Reinforcement Learning
M Hasanbeig, A Abate, D Kroening
arXiv preprint arXiv:1801.08099, 2018
1272018
Cautious Reinforcement Learning with Logical Constraints
M Hasanbeig, A Abate, D Kroening
AAMAS, 483-491, 2020
1072020
Modular Deep Reinforcement Learning for Continuous Motion Planning with Temporal Logic
M Cai, M Hasanbeig, S Xiao, A Abate, Z Kan
IEEE Robotics and Automation and IROS, 2021
992021
Deep Reinforcement Learning with Temporal Logics
M Hasanbeig, D Kroening, A Abate
International Conference on Formal Modeling and Analysis of Timed Systems, 1-22, 2020
772020
Certified reinforcement learning with logic guidance
H Hasanbeig, D Kroening, A Abate
Artificial Intelligence 322, 103949, 2023
742023
Deepsynth: Program Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning
M Hasanbeig, NY Jeppu, A Abate, T Melham, D Kroening
AAAI Conference on Artificial Intelligence (AAAI-21), 2021
68*2021
Evaluating cognitive maps in large language models with cogeval: No emergent planning
I Momennejad, H Hasanbeig, FV Frujeri, H Sharma, RO Ness, N Jojic, ...
Advances in neural information processing systems 37, 2023
61*2023
Logically-Constrained Neural Fitted Q-iteration
M Hasanbeig, A Abate, D Kroening
AAMAS, 2012-2014, 2019
512019
Modular Deep Reinforcement Learning with Temporal Logic Specifications
LZ Yuan, M Hasanbeig, A Abate, D Kroening
arXiv preprint arXiv:1909.11591, 2019
482019
Towards Verifiable and Safe Model-free Reinforcement Learning
M Hasanbeig, D Kroening, A Abate
Workshop on Artificial Intelligence and Formal Verification, Logics …, 2020
31*2020
Shielding Atari Games with Bounded Prescience
M Giacobbe, M Hasanbeig, D Kroening, H Wijk
International Conference on Autonomous Agents and Multiagent Systems, 2021
302021
LCRL: Certified Policy Synthesis via Logically-Constrained Reinforcement Learning
M Hasanbeig, D Kroening, A Abate
International Conference on Quantitative Evaluation of Systems, 217-231, 2022
192022
Deepsynth: Program synthesis for automatic task segmentation in deep reinforcement learning
M Hasanbeig, NY Jeppu, A Abate, T Melham, D Kroening
arXiv preprint arXiv:1911.10244, 2019
192019
On Synchronous Binary Log-Linear Learning and Second Order Q-learning
M Hasanbeig, L Pavel
IFAC World Congress 50 (1), 8987-8992, 2017
142017
Allure: A systematic protocol for auditing and improving llm-based evaluation of text using iterative in-context-learning
H Hasanbeig, H Sharma, L Betthauser, FV Frujeri, I Momennejad
arXiv preprint arXiv:2309.13701 3, 2023
102023
From Game-theoretic Multi-agent Log Linear Learning to Reinforcement Learning
M Hasanbeig, L Pavel
arXiv preprint arXiv:1802.02277, 2018
92018
Distributed Coverage Control by Robot Networks in Unknown Environments using a Modified EM Algorithm
M Hasanbeig, L Pavel
International Journal of Computer and Information Engineering 11 (7), 815-823, 2017
82017
ALLURE: auditing and improving llm-based evaluation of text using iterative in-context-learning
H Hasanbeig, H Sharma, L Betthauser, FV Frujeri, I Momennejad
arXiv preprint arXiv:2309.13701, 2023
72023
Logically-correct reinforcement learning. CoRR abs/1801.08099
M Hasanbeig, A Abate, D Kroening
62017
系統目前無法執行作業,請稍後再試。
文章 1–20