Exercises for reinforcement learning.
- numpy
- gym
- bandits & greedy epsilon
- gridworld & bellman equation
- cliffworld & policy/value iteration
- ...
You can find examples in run folder. Some results are shown in pictures folder. For example, ./pictures/exp6/exp6_2.png shows the result of different methods in a cliffworld: