Ofir Nachum
Ofir Nachum
Google Brain
Verified email at google.com
TitleCited byYear
Learning to remember rare events
Ł Kaiser, O Nachum, A Roy, S Bengio
International Conference for Learning Representations, 2017
140*2017
Bridging the gap between value and policy based reinforcement learning
O Nachum, M Norouzi, K Xu, D Schuurmans
Advances in Neural Information Processing Systems, 2775-2785, 2017
1372017
Data-Efficient Hierarchical Reinforcement Learning
O Nachum, S Gu, H Lee, S Levine
Advances in Neural Information Processing Systems, 2018
1072018
Morphnet: Fast & simple resource-constrained structure learning of deep networks
A Gordon, E Eban, O Nachum, B Chen, H Wu, TJ Yang, E Choi
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018
882018
Trust-pcl: An off-policy trust region method for continuous control
O Nachum, M Norouzi, K Xu, D Schuurmans
International Conference for Learning Representations, 2018
502018
A Lyapunov-based Approach to Safe Reinforcement Learning
Y Chow, O Nachum, E Duenez-Guzman, M Ghavamzadeh
Advances in Neural Information Processing Systems, 2018
412018
Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparative Evaluation of Off-Policy Methods
D Quillen, E Jang, O Nachum, C Finn, J Ibarz, S Levine
IEEE International Conference on Robotics and Automation, 2018
402018
Near-optimal representation learning for hierarchical reinforcement learning
O Nachum, S Gu, H Lee, S Levine
arXiv preprint arXiv:1810.01257, 2018
302018
Improving policy gradient by exploring under-appreciated rewards
O Nachum, M Norouzi, D Schuurmans
International Conference for Learning Representations, 2017
162017
Path consistency learning in tsallis entropy regularized mdps
Y Chow, O Nachum, M Ghavamzadeh
International Conference on Machine Learning, 979-988, 2018
15*2018
Identifying and correcting label bias in machine learning
H Jiang, O Nachum
arXiv preprint arXiv:1901.04966, 2019
102019
Deepmdp: Learning continuous latent space models for representation learning
C Gelada, S Kumar, J Buckman, O Nachum, MG Bellemare
arXiv preprint arXiv:1906.02736, 2019
92019
Lyapunov-based safe policy optimization for continuous control
Y Chow, O Nachum, A Faust, M Ghavamzadeh, E Duenez-Guzman
arXiv preprint arXiv:1901.10031, 2019
72019
Dualdice: Behavior-agnostic estimation of discounted stationary distribution corrections
O Nachum, Y Chow, B Dai, L Li
Advances in Neural Information Processing Systems, 2315-2325, 2019
72019
The Laplacian in RL: Learning representations with efficient approximations
Y Wu, G Tucker, O Nachum
arXiv preprint arXiv:1810.04586, 2018
52018
Smoothed Action Value Functions for Learning Gaussian Policies
O Nachum, M Norouzi, G Tucker, D Schuurmans
International Conference on Machine Learning, 2018
52018
Multi-agent manipulation via locomotion using hierarchical sim2real
O Nachum, M Ahn, H Ponte, S Gu, V Kumar
arXiv preprint arXiv:1908.05224, 2019
42019
Robustness guarantees for density clustering
H Jiang, J Jang, O Nachum
The 22nd International Conference on Artificial Intelligence and Statistics …, 2019
22019
Learning neural network structure
O Nachum, A Gordon, E Eban, B Chen
US Patent App. 15/813,961, 2019
12019
Reinforcement Learning via Fenchel-Rockafellar Duality
O Nachum, B Dai
arXiv preprint arXiv:2001.01866, 2020
2020
The system can't perform the operation now. Try again later.
Articles 1–20