Seguir
Diogo S. Carvalho
Diogo S. Carvalho
Instituto Superior Técnico, University of Lisbon, and INESC-ID
Email confirmado em tecnico.ulisboa.pt
Título
Citado por
Citado por
Ano
A new convergent variant of -learning with linear function approximation
DS Carvalho, FS Melo, PA Santos
Advances in Neural Information Processing Systems 33, 2020
372020
The impact of data distribution on Q-learning with function approximation
PP Santos, DS Carvalho, A Sardinha, FS Melo
Machine Learning, 1-23, 2024
3*2024
Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning
PP Santos, DS Carvalho, M Vasco, A Sardinha, PA Santos, A Paiva, ...
arXiv preprint arXiv:2210.06274, 2022
22022
Hierarchically Structured Scheduling and Execution of Tasks in a Multi-agent Environment
DS Carvalho, B Sengupta
EPIA Conference on Artificial Intelligence, 15-26, 2022
22022
Theoretical remarks on feudal hierarchies and reinforcement learning
DS Carvalho, FS Melo, PA Santos
26th European Conference on Artificial Intelligence, 2023
12023
Multi-Bellman operator for convergence of -learning with linear function approximation
DS Carvalho, PA Santos, FS Melo
arXiv preprint arXiv:2309.16819, 2023
12023
CHARET: Character-centered Approach to Emotion Tracking in Stories
DS Carvalho, J Campos, M Guimarães, A Antunes, J Dias, PA Santos
arXiv preprint arXiv:2102.07537, 2021
12021
-learning with regularization converges with non-linear non-stationary features
DS Carvalho, FS Melo, PA Santos
2022
O sistema não pode efectuar a operação agora. Tente mais tarde.
Artigos 1–8