Follow
Saba Yahyaa
Saba Yahyaa
PhD student of Computer Science, Free University of Brussels (VUB)
Verified email at vub.ac.be - Homepage
Title
Cited by
Cited by
Year
Thompson Sampling for Multi-Objective Multi-Armed Bandits Problem.
SQ Yahyaa, B Manderick
ESANN, 2015
302015
Knowledge Gradient for Multi-objective Multi-armed Bandit Algorithms.
SQ Yahyaa, MM Drugan, B Manderick
ICAART (1), 74-83, 2014
242014
The scalarized multi-objective multi-armed bandit problem: An empirical study of its exploration vs. exploitation tradeoff
SQ Yahyaa, MM Drugan, B Manderick
2014 International Joint Conference on Neural Networks (IJCNN), 2290-2297, 2014
232014
Annealing-pareto multi-objective multi-armed bandit algorithm
SQ Yahyaa, MM Drugan, B Manderick
2014 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement …, 2014
222014
Thompson Sampling in the Adaptive Linear Scalarized Multi Objective Multi Armed Bandit.
SQ Yahyaa, MM Drugan, B Manderick
ICAART (2), 55-65, 2015
152015
The exploration vs exploitation trade-off in the multi-armed bandit problem: An empirical study
SQ Yahyaa, B Manderick
Proceedings of the 20th European Symposium on Artificial Neural Networks …, 2012
52012
Shortest path gaussian kernels for state action graphs: An empirical study
S Yahyaa, B Manderick
BNAIC 2012 The 24th Benelux Conference on Artificial Intelligence, 250, 2012
52012
Knowledge gradient for online reinforcement learning
S Yahyaa, B Manderick
Agents and Artificial Intelligence: 6th International Conference, ICAART …, 2015
42015
Multivariate normal distribution based multi-armed bandit pareto algorithm
SQ Yahyaa, MM Drugan, B Manderick
The 7th European Conference on Machine Learning and Principles and Practice …, 2014
42014
Correlated Gaussian multi-objective multi-armed bandit across arms algorithm
SQ Yahyaa, MM Drugan
2015 IEEE Symposium Series on Computational Intelligence, 593-600, 2015
32015
Knowledge Gradient Exploration in Online Least Squares Policy Iteration.
SQ Yahyaa, B Manderick
ICAART (2), 263-269, 2013
32013
Scalarized and pareto knowledge gradient for multi-objective multi-armed bandits
S Yahyaa, MM Drugan, B Manderick
Transactions on Computational Collective Intelligence XX, 99-116, 2015
22015
Linear Scalarized Knowledge Gradient in the Multi-Objective Multi-Armed Bandits Problem.
SQ Yahyaa, MM Drugan, B Manderick
ESANN, 2014
22014
Knowledge gradient exploration in online kernel-based LSPI
S Yahyaa, B Manderick
BNAIC 2013: Proceedings of the 25th Benelux Conference on Artificial …, 2013
22013
Annealing linear scalarized based multi-objective multi-armed bandit algorithm
SQ Yahyaa, MM Drugan, B Manderick
2015 IEEE Congress on Evolutionary Computation (CEC), 1738-1745, 2015
2015
Explorations in Reinforcement Learning: Online Action Selection and Value Function Approximation
SQ Yahyaa
2015
Online Knowledge Gradient Exploration in an Unknown Environment.
SQ Yahyaa, B Manderick
ICAART (1), 5-13, 2014
2014
Empirical Evaluation of Shortest Path Gaussian Kernels over State Action Graphs.
SQ Yahyaa, B Manderick
ICAART (2), 225-231, 2013
2013
The Exploration vs Exploitation Trade-Off in Bandit Problems: An Empirical Study
S Yahyaa, B Manderick
rn 1, 2, 2012
2012
The system can't perform the operation now. Try again later.
Articles 1–19