Cancel

RL 34

(Lasse 2018 ICML) Impala; Scalable distributed deep-rl with importance weighted actor-learner architectures Sep 20, 2021
(Duan 2017 ICLR) RL2; Fast Reinforcement Learning Via Slow Reinforcement Learning Aug 1, 2021
(Ofir 2018 Nips) Data-Efficient Hierarchical Reinforcement Learning Jul 25, 2021
(Vezhnevets 2017 ICML) Feudal networks for hierarchical reinforcement learning Jul 8, 2021
(Fujimoto 2018 ICML) Addressing Function Approximation Error in Actor-Critic Methods Jul 1, 2021
(Haarnoja 2019 arxiv) Soft Actor-Critic Algorithms and Applications Jun 27, 2021
(Haarnoja 2018 ICML) Soft Actor-Critic; Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor Jun 26, 2021
(Lillicrap 2015 ICLR) Continuous Control With Deep Reinforcement Learning Jun 20, 2021
(Wu 2017 arxiv) Scalable trust-region method for deep reinforcementlearning using Kronecker-factored approximation Jun 18, 2021
(ICLR 2017 Wang) Sample Efficient Actor-Critic with Experience Replay Jun 14, 2021
(Nips 2016 Vezhnevets) Strategic Attentive Writer for Learning Macro-Actions Jun 6, 2021
16. (Schulman 2017 arxiv) Proximal Policy Optimization Algorithms May 23, 2021
15. (Schulman 2017 ICML) Trust Region Policy Optimization May 16, 2021
14. (Mnih 2016 ICML) Asynchronous Methods for Deep Reinforcement Learning May 8, 2021
13. Advanced Actor-Critic(A2C) May 7, 2021
13. Actor-Critic May 7, 2021
(Lee 2019 arxiv) Tsallis reinforcement learning; A unified framework for maximum entropy reinforcement learning May 2, 2021
12. (Sutton Nips 1999) Policy Gradient Methods for Reinforcement Learning with Function Approximation May 1, 2021
11. Policy Gradient Apr 27, 2021
10. Policy based Apr 27, 2021
9. Prioritized Experience Replay Apr 25, 2021
5. (Iqbal 2019 ICML) Actor-Attention-Critic for Multi-Agent Reinforcement Learning Apr 22, 2021
8. Dueling DQN Apr 19, 2021
7. DDQN Apr 18, 2021
6. DQN Apr 18, 2021
4. (Du 2019 NIPS) LIIR; Learning Individual Intrinsic Reward in Multi-Agent Reinforcement learning Apr 18, 2021
3. (Rashid 2018 ICML) Qmix; Monotonic value function factorisation for deep multi-agent reinforcement learning Apr 11, 2021
5. On/off-policy Apr 10, 2021
4. Monte Carlo and Temporal Difference Apr 10, 2021
3. Optimal Policy Apr 10, 2021
2. (Foerster 2017 AAAI) Counterfactual Multi-Agent Policy Gradients Apr 4, 2021
2. Bellman Equation Apr 3, 2021
1. (Tan 1993 ICML) Multi-Agent Reinforcement Learning; Independent vs Cooperative Agents Mar 28, 2021
1. Markov Decision Process Mar 27, 2021

Recent Update

Trending Tags

RL Single-Agent Deep learning Multi-Agent Hierarchial RL Anomaly Detection Distributed RL memo Meta-RL

Trending Tags

RL Single Agent Deep learning Multi Agent Hierarchial RL Anomaly Detection Distributed RL memo Meta RL