A new convergent variant of -learning with linear function approximation DS Carvalho, FS Melo, PA Santos
Advances in Neural Information Processing Systems 33, 2020
37 2020 The impact of data distribution on Q-learning with function approximation PP Santos, DS Carvalho, A Sardinha, FS Melo
Machine Learning, 1-23, 2024
3 * 2024 Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning PP Santos, DS Carvalho, M Vasco, A Sardinha, PA Santos, A Paiva, ...
arXiv preprint arXiv:2210.06274, 2022
2 2022 Hierarchically Structured Scheduling and Execution of Tasks in a Multi-agent Environment DS Carvalho, B Sengupta
EPIA Conference on Artificial Intelligence, 15-26, 2022
2 2022 Theoretical remarks on feudal hierarchies and reinforcement learning DS Carvalho, FS Melo, PA Santos
26th European Conference on Artificial Intelligence, 2023
1 2023 Multi-Bellman operator for convergence of -learning with linear function approximation DS Carvalho, PA Santos, FS Melo
arXiv preprint arXiv:2309.16819, 2023
1 2023 CHARET: Character-centered Approach to Emotion Tracking in Stories DS Carvalho, J Campos, M Guimarães, A Antunes, J Dias, PA Santos
arXiv preprint arXiv:2102.07537, 2021
1 2021 -learning with regularization converges with non-linear non-stationary featuresDS Carvalho, FS Melo, PA Santos
2022