Follow
Kefan Dong
Kefan Dong
Verified email at stanford.edu - Homepage
Title
Cited by
Cited by
Year
Q-learning with ucb exploration is sample efficient for infinite-horizon mdp
K Dong, Y Wang, X Chen, L Wang
International Conference on Learning Representations, 2019
722019
Exploration via hindsight goal generation
Z Ren, K Dong, Y Zhou, Q Liu, J Peng
Advances in Neural Information Processing Systems 32, 2019
442019
Root-n-regret for learning in markov decision processes with function approximation and low bellman rank
K Dong, J Peng, Y Wang, Y Zhou
Conference on Learning Theory, 1554-1557, 2020
282020
Provable model-based nonlinear bandit and reinforcement learning: Shelve optimism, embrace virtual curvature
K Dong, J Yang, T Ma
Advances in Neural Information Processing Systems 34, 26168-26182, 2021
222021
On the expressivity of neural networks for deep reinforcement learning
K Dong, Y Luo, T Yu, C Finn, T Ma
International Conference on Machine Learning, 2627-2637, 2020
15*2020
Multinomial logit bandit with low switching cost
K Dong, Y Li, Q Zhang, Y Zhou
International Conference on Machine Learning, 2607-2615, 2020
132020
Design of experiments for stochastic contextual linear bandits
A Zanette, K Dong, JN Lee, E Brunskill
Advances in Neural Information Processing Systems 34, 22720-22731, 2021
62021
Asymptotic Instance-Optimal Algorithms for Interactive Decision Making
K Dong, T Ma
arXiv preprint arXiv:2206.02326, 2022
12022
Refined Analysis of FPL for Adversarial Markov Decision Processes
Y Wang, K Dong
arXiv preprint arXiv:2008.09251, 2020
12020
First Steps Toward Understanding the Extrapolation of Nonlinear Models to Unseen Domains
K Dong, T Ma
arXiv preprint arXiv:2211.11719, 2022
2022
The system can't perform the operation now. Try again later.
Articles 1–10