- (Zong 2018 ICLR) Deep Autoencoding Gaussian Mixture Model For Unsupervised Anomaly Detection
- (Ofir 2018 Nips) Data-Efficient Hierarchical Reinforcement Learning
- (Vezhnevets 2017 ICML) Feudal networks for hierarchical reinforcement learning
- (Haarnoja 2019 arxiv) Soft Actor-Critic Algorithms and Applications
- (Haarnoja 2018 ICML) Soft Actor-Critic; Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
RL 34
- (Lasse 2018 ICML) Impala; Scalable distributed deep-rl with importance weighted actor-learner architectures Sep 20, 2021
- (Duan 2017 ICLR) RL2; Fast Reinforcement Learning Via Slow Reinforcement Learning Aug 1, 2021
- (Ofir 2018 Nips) Data-Efficient Hierarchical Reinforcement Learning Jul 25, 2021
- (Vezhnevets 2017 ICML) Feudal networks for hierarchical reinforcement learning Jul 8, 2021
- (Fujimoto 2018 ICML) Addressing Function Approximation Error in Actor-Critic Methods Jul 1, 2021
- (Haarnoja 2019 arxiv) Soft Actor-Critic Algorithms and Applications Jun 27, 2021
- (Haarnoja 2018 ICML) Soft Actor-Critic; Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor Jun 26, 2021
- (Lillicrap 2015 ICLR) Continuous Control With Deep Reinforcement Learning Jun 20, 2021
- (Wu 2017 arxiv) Scalable trust-region method for deep reinforcementlearning using Kronecker-factored approximation Jun 18, 2021
- (ICLR 2017 Wang) Sample Efficient Actor-Critic with Experience Replay Jun 14, 2021
- (Nips 2016 Vezhnevets) Strategic Attentive Writer for Learning Macro-Actions Jun 6, 2021
- 16. (Schulman 2017 arxiv) Proximal Policy Optimization Algorithms May 23, 2021
- 15. (Schulman 2017 ICML) Trust Region Policy Optimization May 16, 2021
- 14. (Mnih 2016 ICML) Asynchronous Methods for Deep Reinforcement Learning May 8, 2021
- 13. Advanced Actor-Critic(A2C) May 7, 2021
- 13. Actor-Critic May 7, 2021
- (Lee 2019 arxiv) Tsallis reinforcement learning; A unified framework for maximum entropy reinforcement learning May 2, 2021
- 12. (Sutton Nips 1999) Policy Gradient Methods for Reinforcement Learning with Function Approximation May 1, 2021
- 11. Policy Gradient Apr 27, 2021
- 10. Policy based Apr 27, 2021
- 9. Prioritized Experience Replay Apr 25, 2021
- 5. (Iqbal 2019 ICML) Actor-Attention-Critic for Multi-Agent Reinforcement Learning Apr 22, 2021
- 8. Dueling DQN Apr 19, 2021
- 7. DDQN Apr 18, 2021
- 6. DQN Apr 18, 2021
- 4. (Du 2019 NIPS) LIIR; Learning Individual Intrinsic Reward in Multi-Agent Reinforcement learning Apr 18, 2021
- 3. (Rashid 2018 ICML) Qmix; Monotonic value function factorisation for deep multi-agent reinforcement learning Apr 11, 2021
- 5. On/off-policy Apr 10, 2021
- 4. Monte Carlo and Temporal Difference Apr 10, 2021
- 3. Optimal Policy Apr 10, 2021
- 2. (Foerster 2017 AAAI) Counterfactual Multi-Agent Policy Gradients Apr 4, 2021
- 2. Bellman Equation Apr 3, 2021
- 1. (Tan 1993 ICML) Multi-Agent Reinforcement Learning; Independent vs Cooperative Agents Mar 28, 2021
- 1. Markov Decision Process Mar 27, 2021