Seguir
Thiago D. Simão
Título
Citado por
Citado por
Ano
WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning
Q Yang, TD Simão, SH Tindemans, MTJ Spaan
AAAI, 10639-10646, 2021
1062021
AlwaysSafe: Reinforcement learning without safety constraint violations during training
TD Simão, N Jansen, MTJ Spaan
AAMAS, 1226-1235, 2021
472021
Safe Policy Improvement with an Estimated Baseline Policy
TD Simão, R Laroche, R Tachet des Combes
AAMAS, 1269-1277, 2020
33*2020
Safety-constrained reinforcement learning with a distributional safety critic
Q Yang, TD Simão, SH Tindemans, MTJ Spaan
Machine Learning 112 (3), 859-887, 2023
312023
Safe Policy Improvement with Baseline Bootstrapping in Factored Environments
TD Simão, MTJ Spaan
AAAI, 4967-4974, 2019
312019
Robust anytime learning of Markov decision processes
M Suilen, TD Simão, D Parker, N Jansen
NeurIPS, 28790-28802, 2022
232022
Structure Learning for Safe Policy Improvement
TD Simão, MTJ Spaan
IJCAI, 3453-3459, 2019
122019
Safe policy improvement for POMDPs via finite-state controllers
TD Simão, M Suilen, N Jansen
AAAI, 15109-15117, 2023
112023
Decision-making under uncertainty: beyond probabilities: Challenges and perspectives
T Badings, TD Simão, M Suilen, N Jansen
International Journal on Software Tools for Technology Transfer 25 (3), 375-391, 2023
102023
Safe Reinforcement Learning From Pixels Using a Stochastic Latent Representation
Y Hogewind, TD Simão, T Kachman, N Jansen
ICLR, 2023
92023
Reinforcement Learning by Guided Safe Exploration
Q Yang, TD Simão, N Jansen, SH Tindemans, MTJ Spaan
ECAI, 2858-2865, 2023
8*2023
A Modern Perspective on Safe Automated Driving for Different Traffic Dynamics Using Constrained Reinforcement Learning
D Kamran, TD Simão, Q Yang, CT Ponnambalam, J Fischer, MTJ Spaan, ...
2022 IEEE 25th International Conference on Intelligent Transportation …, 2022
72022
More for Less: Safe Policy Improvement With Stronger Performance Guarantees
P Wienhöft, M Suilen, TD Simão, C Dubslaff, C Baier, N Jansen
arXiv preprint arXiv:2305.07958, 2023
52023
Act-then-measure: reinforcement learning for partially observable environments with active measuring
M Krale, TD Simão, N Jansen
Proceedings of the International Conference on Automated Planning and …, 2023
42023
Scalable Safe Policy Improvement via Monte Carlo Tree Search
A Castellini, F Bianchi, E Zorzi, TD Simão, A Farinelli, MTJ Spaan
International Conference on Machine Learning, 3732-3756, 2023
32023
Recursive small-step multi-agent A* for dec-POMDPs
W Koops, N Jansen, S Junges, TD Simão
Sl: IJCAI, 2023
22023
Planejamento probabilístico com becos sem saída
TD Simão
Universidade de São Paulo, 2017
22017
Utilização de algoritmos genéticos para otimização de soluções para o timetabling escolar
TD SIMÃO
Tese apresentada ao Departamento de Ciência da Computação da Universidade …, 2013
22013
Risk-aware curriculum generation for heavy-tailed task distributions
C Koprulu, TD Simão, N Jansen, U Topcu
Uncertainty in Artificial Intelligence, 1132-1142, 2023
12023
Safe and Sample-Efficient Reinforcement Learning Algorithms for Factored Environments.
TD Simão
IJCAI, 6460-6461, 2019
12019
O sistema não pode efectuar a operação agora. Tente mais tarde.
Artigos 1–20