Collection of Deep Reinforcement Learning Algorithms in PyTorch.
Below a list of Jupyter Notebooks with implementations
-
Q-Learning βββ Source/Paper
-
Double DQN ββββ Paper
-
Dueling DQN ββββ Paper
-
Categorical DQN - C51 ββββPaper
-[Soft-DQN] TODO
-
Sarsa [Source/Paper]
-
Vanilla Policy Gradient +LSTM [Source/Paper]
-
A2C Paper
-
A2C with gae* [TODO]
-
A2C multi environment
-
PPO Paper
-
PPO with gae*
-
PPO with gae and curiosity driven exploration (single, digit inputs) Paper
-
PPO multi environment
-
A2C with gae* [TODO]
gae* = Generalized Advanted Estimation Source
-
DDPG [Source/Paper]
-
Twin Delayed DDPG (TD3) ββββPaper
-
Soft Actor Critic (SAC-newest 2019 version) ββββPaper
Discrete and continuous action space implementation of β κ€
Implementierungen von Munchausen RL
-
Evolution Strategies with mulit processing and novelty search
-
- Genetic Algorithm implementation with LSTM, Multiprocessing over several CPUs and Novelty Search for Exploration
Gridsearch
Random Forest [TODO]
Genetic Algorithm [TODO]
====================================
