-
Notifications
You must be signed in to change notification settings - Fork 1
CartPole v0
Name: CartPole-v0
Category: Classic Control
Environment Page
Algorithms Page
A pole is attached by an un-actuated joint to a cart, which moves along a frictionless track. The pendulum starts upright, and the goal is to prevent it from falling over by increasing and reducing the cart's velocity.
This environment corresponds to the version of the cart-pole problem described by Barto, Sutton, and Anderson
Type: Box(4)
Num | Observation | Min | Max |
---|---|---|---|
0 | Cart Position | -2.4 | 2.4 |
1 | Cart Velocity | -Inf | Inf |
2 | Pole Angle | ~ -41.8° | ~ 41.8° |
3 | Pole Velocity At Tip | -Inf | Inf |
Type: Discrete(2)
Num | Action |
---|---|
0 | Push cart to the left |
1 | Push cart to the right |
Note: The amount the velocity is reduced or increased is not fixed as it depends on the angle the pole is pointing. This is because the center of gravity of the pole increases the amount of energy needed to move the cart underneath it
Reward is 1 for every step taken, including the termination step
All observations are assigned a uniform random value between ±0.05
- Pole Angle is more than ±20.9°
- Cart Position is more than ±2.4 (center of the cart reaches the edge of the display)
- Episode length is greater than 200
Considered solved when the average reward is greater than or equal to 195.0 over 100 consecutive trials.