- (Zong 2018 ICLR) Deep Autoencoding Gaussian Mixture Model For Unsupervised Anomaly Detection
- (Ofir 2018 Nips) Data-Efficient Hierarchical Reinforcement Learning
- (Vezhnevets 2017 ICML) Feudal networks for hierarchical reinforcement learning
- (Haarnoja 2019 arxiv) Soft Actor-Critic Algorithms and Applications
- (Haarnoja 2018 ICML) Soft Actor-Critic; Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
Single-Agent 24
- (Fujimoto 2018 ICML) Addressing Function Approximation Error in Actor-Critic Methods Jul 1, 2021
- (Haarnoja 2019 arxiv) Soft Actor-Critic Algorithms and Applications Jun 27, 2021
- (Haarnoja 2018 ICML) Soft Actor-Critic; Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor Jun 26, 2021
- (Lillicrap 2015 ICLR) Continuous Control With Deep Reinforcement Learning Jun 20, 2021
- (Wu 2017 arxiv) Scalable trust-region method for deep reinforcementlearning using Kronecker-factored approximation Jun 18, 2021
- (ICLR 2017 Wang) Sample Efficient Actor-Critic with Experience Replay Jun 14, 2021
- 16. (Schulman 2017 arxiv) Proximal Policy Optimization Algorithms May 23, 2021
- 15. (Schulman 2017 ICML) Trust Region Policy Optimization May 16, 2021
- 14. (Mnih 2016 ICML) Asynchronous Methods for Deep Reinforcement Learning May 8, 2021
- 13. Advanced Actor-Critic(A2C) May 7, 2021
- 13. Actor-Critic May 7, 2021
- (Lee 2019 arxiv) Tsallis reinforcement learning; A unified framework for maximum entropy reinforcement learning May 2, 2021
- 12. (Sutton Nips 1999) Policy Gradient Methods for Reinforcement Learning with Function Approximation May 1, 2021
- 11. Policy Gradient Apr 27, 2021
- 10. Policy based Apr 27, 2021
- 9. Prioritized Experience Replay Apr 25, 2021
- 8. Dueling DQN Apr 19, 2021
- 7. DDQN Apr 18, 2021
- 6. DQN Apr 18, 2021
- 5. On/off-policy Apr 10, 2021
- 4. Monte Carlo and Temporal Difference Apr 10, 2021
- 3. Optimal Policy Apr 10, 2021
- 2. Bellman Equation Apr 3, 2021
- 1. Markov Decision Process Mar 27, 2021