Policy Gradient

Policy Gradient (REINFORCE)

Implementation of REINFORCE to solve OpenAI LunarLander environment

Soft Actor Critic

Implementation of SAC to solve OpenAI LunarLander environment

Value Actor Critic

Implementation of Value Actor Critic to solve OpenAI LunarLander environment