Seguir
Thiago D. Simão
Título
Citado por
Citado por
Ano
WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning.
Q Yang, TD Simão, SH Tindemans, MTJ Spaan
AAAI, 10639-10646, 2021
392021
AlwaysSafe: Reinforcement learning without safety constraint violations during training
TD Simão, N Jansen, MTJ Spaan
Proceedings of the 20th International Conference on Autonomous Agents and …, 2021
212021
Safe Policy Improvement with Baseline Bootstrapping in Factored Environments
TD Simão, MTJ Spaan
Proceedings of the AAAI Conference on Artificial Intelligence 33, 4967-4974, 2019
212019
Safe Policy Improvement with an Estimated Baseline Policy
TD Simão, R Laroche, R Tachet des Combes
Proceedings of the 19th International Conference on Autonomous Agents and …, 2020
18*2020
Safety-constrained reinforcement learning with a distributional safety critic
Q Yang, TD Simão, SH Tindemans, MTJ Spaan
Machine Learning, 1-29, 2022
62022
Structure Learning for Safe Policy Improvement
TD Simão, MTJ Spaan
Proceedings of the 28th International Joint Conference on Artificial …, 2019
62019
Training and transferring safe policies in reinforcement learning
Q Yang, TD Simão, N Jansen, SH Tindemans, MTJ Spaan
AAMAS 2022 Workshop on Adaptive Learning Agents, 2022
22022
A Modern Perspective on Safe Automated Driving for Different Traffic Dynamics Using Constrained Reinforcement Learning
D Kamran, TD Simão, Q Yang, CT Ponnambalam, J Fischer, MTJ Spaan, ...
2022 IEEE 25th International Conference on Intelligent Transportation …, 2022
12022
Planejamento probabilístico com becos sem saída
TD Simão
Universidade de São Paulo, 2017
12017
Planejamento Probabilístico com Becos Sem Saída
TD Simão, LN de Barros, FL Silva
XII Encontro Nacional de Inteligência Artificial e Computacional, 2015
12015
Desenvolvimento de Jogos 3D para a Educação a Distância
UA Leitão, TD Simão, JA Neves
VIII Congresso Brasileiro de Ensino Superior a Distância (ESUD). Ouro Preto …, 2011
12011
Safe Policy Improvement for POMDPs via Finite-State Controllers
TD Simão, M Suilen, N Jansen
arXiv preprint arXiv:2301.04939, 2023
2023
Safe Online and Offline Reinforcement Learning
TD Simão
2023
Targeted Adversarial Attacks on Deep Reinforcement Learning Policies via Model Checking
D Gross, TD Simao, N Jansen, GA Perez
arXiv preprint arXiv:2212.05337, 2022
2022
Safe Reinforcement Learning From Pixels Using a Stochastic Latent Representation
Y Hogewind, TD Simão, T Kachman, N Jansen
arXiv preprint arXiv:2210.01801, 2022
2022
Robust Anytime Learning of Markov Decision Processes
M Suilen, TD Simão, N Jansen, D Parker
arXiv preprint arXiv:2205.15827, 2022
2022
Refined Risk Management in Safe Reinforcement Learning with a Distributional Safety Critic
Q Yang, TD Simão, SH Tindemans, MTJ Spaan
International Workshop on Safe Reinforcement Learning, 2022
2022
Back to the Future: Solving Hidden Parameter MDPs with Hindsight
CT Ponnambalam, D Kamran, TD Simão, FA Oliehoek, MTJ Spaan
Adaptive Learning Agents Workshop at the 21st International Conference on …, 2022
2022
Safe and Sample-Efficient Reinforcement Learning Algorithms for Factored Environments.
TD Simão
IJCAI, 6460-6461, 2019
2019
When a Robot Reaches Out for Human Help
I Andrés, LN de Barros, DD Mauá, TD Simão
Ibero-American Conference on Artificial Intelligence, 277-289, 2018
2018
O sistema não pode efectuar a operação agora. Tente mais tarde.
Artigos 1–20