Rui Zhao
Título
Citado por
Citado por
Ano
Two-stream RNN/CNN for action recognition in 3D videos
R Zhao, H Ali, P Van der Smagt
2017 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2017
542017
Energy-based hindsight experience prioritization
R Zhao, V Tresp
2018 Conference on Robot Learning (CoRL) (Oral), 2018
342018
Maximum Entropy-Regularized Multi-Goal Reinforcement Learning
R Zhao, X Sun, V Tresp
2019 International Conference on Machine Learning (ICML), 2019
322019
Curiosity-Driven Experience Prioritization via Density Estimation
R Zhao, V Tresp
2018 NeurIPS (NIPS) Deep Reinforcement Learning Workshop, 2019
242019
Improving Goal-Oriented Visual Dialog Agents via Advanced Recurrent Nets with Tempered Policy Gradient
R Zhao, V Tresp
2018 IJCAI Linguistic and Cognitive Approaches To Dialog Agents Workshop, 2018
172018
Learning goal-oriented visual dialog via tempered policy gradient
R Zhao, V Tresp
2018 IEEE Spoken Language Technology (SLT), 868-875, 2018
102018
Efficient dialog policy learning via positive memory retention
R Zhao, V Tresp
2018 IEEE Spoken Language Technology (SLT), 823-830, 2018
72018
Mutual Information State Intrinsic Control
R Zhao, Y Gao, P Abbeel, V Tresp, W Xu
2021 International Conference on Learning Representations (ICLR) (Spotlight), 2021
2021
Maximum entropy regularised multi-goal reinforcement learning
V Tresp, R Zhao
US Patent App. 16/385,209, 2020
2020
Deep reinforcement learning in robotics and dialog systems
R Zhao
Ludwig Maximilian University of Munich, 2020
2020
Learning Individualized Treatment Rules with Estimated Translated Inverse Propensity Score
Z Wu, Y Yang, Y Ma, Y Liu, R Zhao, M Moor, V Tresp
2020 IEEE International Conference on Healthcare Informatics (ICHI) (Best Paper), 2020
2020
Mutual Information-based State-Control for Intrinsically Motivated Reinforcement Learning
R Zhao, Y Gao, P Abbeel, V Tresp, W Xu
arXiv preprint arXiv:2002.01963, 2020
2020
Self-Supervised State-Control through Intrinsic Mutual Information Rewards
R Zhao, V Tresp, W Xu
2019
O sistema não pode efectuar a operação agora. Tente novamente mais tarde.
Artigos 1–13