Seguir
Hosein Hasanbeig
Hosein Hasanbeig
Microsoft Research
Email confirmado em microsoft.com - Página inicial
Título
Citado por
Citado por
Ano
Reinforcement Learning for Temporal Logic Control Synthesis with Probabilistic Satisfaction Guarantees
M Hasanbeig, Y Kantaros, A Abate, D Kroening, GJ Pappas, I Lee
IEEE Conference on Decision and Control (CDC), 2019
1372019
Logically-Constrained Reinforcement Learning
M Hasanbeig, A Abate, D Kroening
arXiv preprint arXiv:1801.08099, 2018
1132018
Cautious Reinforcement Learning with Logical Constraints
M Hasanbeig, A Abate, D Kroening
AAMAS, 483-491, 2020
962020
Modular Deep Reinforcement Learning for Continuous Motion Planning with Temporal Logic
M Cai, M Hasanbeig, S Xiao, A Abate, Z Kan
IEEE Robotics and Automation and IROS, 2021
802021
Certified reinforcement learning with logic guidance
H Hasanbeig, D Kroening, A Abate
Artificial Intelligence 322, 103949, 2023
642023
Deep Reinforcement Learning with Temporal Logics
M Hasanbeig, D Kroening, A Abate
International Conference on Formal Modeling and Analysis of Timed Systems, 1-22, 2020
612020
Deepsynth: Program Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning
M Hasanbeig, NY Jeppu, A Abate, T Melham, D Kroening
AAAI Conference on Artificial Intelligence (AAAI-21), 2021
52*2021
Logically-Constrained Neural Fitted Q-iteration
M Hasanbeig, A Abate, D Kroening
AAMAS, 2012-2014, 2019
512019
Modular Deep Reinforcement Learning with Temporal Logic Specifications
LZ Yuan, M Hasanbeig, A Abate, D Kroening
arXiv preprint arXiv:1909.11591, 2019
482019
Towards Verifiable and Safe Model-free Reinforcement Learning
M Hasanbeig, D Kroening, A Abate
Workshop on Artificial Intelligence and Formal Verification, Logics …, 2020
28*2020
Evaluating cognitive maps in large language models with cogeval: No emergent planning
I Momennejad, H Hasanbeig, FV Frujeri, H Sharma, RO Ness, N Jojic, ...
Advances in neural information processing systems 37, 2023
25*2023
Shielding Atari Games with Bounded Prescience
M Giacobbe, M Hasanbeig, D Kroening, H Wijk
International Conference on Autonomous Agents and Multiagent Systems, 2021
242021
Deepsynth: Program synthesis for automatic task segmentation in deep reinforcement learning
M Hasanbeig, NY Jeppu, A Abate, T Melham, D Kroening
arXiv preprint arXiv:1911.10244, 2019
192019
LCRL: Certified Policy Synthesis via Logically-Constrained Reinforcement Learning
M Hasanbeig, D Kroening, A Abate
International Conference on Quantitative Evaluation of Systems, 217-231, 2022
152022
On Synchronous Binary Log-Linear Learning and Second Order Q-learning
M Hasanbeig, L Pavel
IFAC World Congress 50 (1), 8987-8992, 2017
122017
Allure: A systematic protocol for auditing and improving llm-based evaluation of text using iterative in-context-learning
H Hasanbeig, H Sharma, L Betthauser, FV Frujeri, I Momennejad
arXiv preprint arXiv:2309.13701, 2023
82023
Distributed Coverage Control by Robot Networks in Unknown Environments using a Modified EM Algorithm
M Hasanbeig, L Pavel
International Journal of Computer and Information Engineering 11 (7), 815-823, 2017
82017
From Game-theoretic Multi-agent Log Linear Learning to Reinforcement Learning
M Hasanbeig, L Pavel
arXiv preprint arXiv:1802.02277, 2018
72018
Jump operator planning: Goal-conditioned policy ensembles and zero-shot transfer
TJ Ringstrom, M Hasanbeig, A Abate
arXiv preprint arXiv:2007.02527, 2020
62020
Logically-correct reinforcement learning. CoRR abs/1801.08099
M Hasanbeig, A Abate, D Kroening
62017
O sistema não pode efectuar a operação agora. Tente mais tarde.
Artigos 1–20