Seguir
Bruno Castro da Silva
Título
Citado por
Citado por
Ano
Dealing with non-stationary environments using context detection
BC da Silva, EW Basso, ALC Bazzan, PM Engel
International Conference on Machine Learning (ICML 2006), 217-224, 2006
2242006
Learning parameterized skills
BC da Silva, G Konidaris, A Barto
International Conference on Machine Learning (ICML 2012), 2012
2182012
Preventing undesirable behavior of intelligent machines
PS Thomas, B Castro da Silva, AG Barto, S Giguere, Y Brun, E Brunskill
Science 366 (6468), 999-1004, 2019
1882019
Learning in groups of traffic signals
ALC Bazzan, D De Oliveira, BC da Silva
Engineering Applications of Artificial Intelligence 23 (4), 560-568, 2010
1262010
Gaussian Processes for Learning and Control: A Tutorial with Examples
M Liu, G Chowdhary, BC Da Silva, SY Liu, JP How
IEEE Control Systems Magazine 38 (5), 53-86, 2018
1002018
Reinforcement Learning based Control of Traffic Lights in Non-stationary Environments: A Case Study in a Microscopic Simulator.
D de Oliveira, ALC Bazzan, BC da Silva, EW Basso, L Nunes, R Rossetti, ...
4th European Workshop on Multi-Agent Systems (EUMAS 2006), 2006
902006
ITSUMO: an intelligent transportation system for urban mobility
BC Da Silva, R Junges, D de Oliveira, ALC Bazzan
[Demonstration Track] (AAMAS 2006) - Proceedings of the 5th International …, 2006
772006
A task-and-technique centered survey on visual analytics for deep learning model engineering
R Garcia, AC Telea, BC da Silva, J Tørresen, JLD Comba
Computers & Graphics 77, 30-49, 2018
582018
Learning parameterized motor skills on a humanoid robot
BC Da Silva, G Baldassarre, G Konidaris, A Barto
IEEE International Conference on Robotics and Automation (ICRA 2014), 5239-5244, 2014
572014
Universal off-policy evaluation
Y Chandak, S Niekum, B da Silva, E Learned-Miller, E Brunskill, ...
Advances in Neural Information Processing Systems (NeurIPS 2021) 34, 27475-27490, 2021
482021
Analysing the impact of travel information for minimising the regret of route choice
GO Ramos, ALC Bazzan, BC da Silva
Transportation Research Part C: Emerging Technologies 88, 257-271, 2018
462018
Fairness Guarantees under Demographic Shift
S Giguere, B Metevier, BC da Silva, Y Brun, PS Thomas, S Niekum
International Conference on Learning Representations (ICLR 2022), 2022
422022
Adaptive traffic control with reinforcement learning
B da Silva, D Oliveira, AL Bazzan, EW Basso
4th Workshop on Agents in Traffic and Transportation (ATT@AAMAS 2006), 80-86, 2006
392006
Optimistic linear support and successor features as a basis for optimal policy transfer
LN Alegre, A Bazzan, BC Da Silva
International Conference on Machine Learning (ICML 2022), 394-413, 2022
292022
Improving reinforcement learning with context detection
BC Da Silva, EW Basso, FS Perotto, AL C Bazzan, PM Engel
(AAMAS 2006) Intl. Joint Conference on Autonomous Agents and Multiagent …, 2006
292006
Active learning of parameterized skills
B Da Silva, G Konidaris, A Barto
International Conference on Machine Learning (ICML 2014), 1737-1745, 2014
282014
Autonomous Reinforcement Learning of Multiple Interrelated Tasks
VG Santucci, E Cartoni, BC da Silva, G Baldassarre
International Conference on Development and Learning (ICDL 2019), 2019
272019
Energetic natural gradient descent
P Thomas, BC Silva, C Dann, E Brunskill
International Conference on Machine Learning (ICML 2016), 2887-2895, 2016
232016
Comparing multi-armed bandit algorithms and Q-learning for multiagent action selection: a case study in route choice
TBF de Oliveira, ALC Bazzan, BC da Silva, R Grunitzki
International Joint Conference on Neural Networks (IJCNN 2018), 1-8, 2018
222018
Learning to minimise regret in route choice
GO Ramos, BC da Silva, ALC Bazzan
(AAMAS 2017) Intl. Joint Conference on Autonomous Agents and Multiagent …, 2017
222017
O sistema não pode efectuar a operação agora. Tente mais tarde.
Artigos 1–20