CartPole-v0
Purpose: Solve the CartPole-v0
from OpenAI gym
using Q-learning with experience
resampling.
CartPole-v0
is solved using Q-learning with
experience resampling. The experience is saved in a reservoir
list.
From the terminal run:
python train.py
And then:
python evaluate.py
Training and hyperparameter tuning can be done in a Jupyter Notebook as showed below.
import train
train.main(=False,
render=0.95,
gamma=0.1,
epsilon=5,
n_episodes=10000,
training_size=10000,
experience_size=64,
batch_size=50,
epochs )
Feel free to comment here below. A Github account is required.