Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor T Haarnoja, A Zhou, P Abbeel, S Levine International conference on machine learning, 1861-1870, 2018 | 9233 | 2018 |
Soft actor-critic algorithms and applications T Haarnoja, A Zhou, K Hartikainen, G Tucker, S Ha, J Tan, V Kumar, ... arXiv preprint arXiv:1812.05905, 2018 | 2845 | 2018 |
Conservative q-learning for offline reinforcement learning A Kumar, A Zhou, G Tucker, S Levine Advances in Neural Information Processing Systems 33, 1179-1191, 2020 | 1788 | 2020 |
Efficient off-policy meta-reinforcement learning via probabilistic context variables K Rakelly, A Zhou, C Finn, S Levine, D Quillen International conference on machine learning, 5331-5340, 2019 | 722 | 2019 |
Learning to walk via deep reinforcement learning T Haarnoja, S Ha, A Zhou, J Tan, G Tucker, S Levine arXiv preprint arXiv:1812.11103, 2018 | 548 | 2018 |
Composable deep reinforcement learning for robotic manipulation T Haarnoja, V Pong, A Zhou, M Dalal, P Abbeel, S Levine 2018 IEEE international conference on robotics and automation (ICRA), 6244-6251, 2018 | 291 | 2018 |
Wayformer: Motion forecasting via simple & efficient attention networks N Nayakanti, R Al-Rfou, A Zhou, K Goel, KS Refaat, B Sapp 2023 IEEE International Conference on Robotics and Automation (ICRA), 2980-2987, 2023 | 208 | 2023 |
Soft actor-critic algorithms and applications. arXiv 2018 T Haarnoja, A Zhou, K Hartikainen, G Tucker, S Ha, J Tan, V Kumar, ... arXiv preprint arXiv:1812.05905, 1812 | 160 | 1812 |
Motionlm: Multi-agent motion forecasting as language modeling A Seff, B Cera, D Chen, M Ng, A Zhou, N Nayakanti, KS Refaat, R Al-Rfou, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 61 | 2023 |
Bayesian adaptation for covariate shift A Zhou, S Levine Advances in neural information processing systems 34, 914-927, 2021 | 40 | 2021 |
Mural: Meta-learning uncertainty-aware rewards for outcome-driven reinforcement learning K Li, A Gupta, A Reddy, VH Pong, A Zhou, J Yu, S Levine International conference on machine learning, 6346-6356, 2021 | 37 | 2021 |
Amortized conditional normalized maximum likelihood: Reliable out of distribution uncertainty estimation A Zhou, S Levine International Conference on Machine Learning, 12803-12812, 2021 | 24* | 2021 |
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement T Haarnoja, A Zhou, P Abbeel, S Levine Proceedings of the 35th International Conference on Machine Learning. July …, 1861 | 5 | 1861 |
2023 IEEE International Conference on Robotics and Automation (ICRA) N Nayakanti, R Al‐Rfou, A Zhou, K Goel, KS Refaat, B Sapp IEEE, 2022 | 4 | 2022 |