Slides from the workshop presentations are now available.
Results are now available. Congratulations to our winners.
Testing round closed. Thank you to all of our competitors!
Updated Testing application (R15) is now available HERE.
Proving application is now available HERE.
The rules, schedule, and prizes have been announced.
GAME ON! The software is now available.
Sign up for our mailing list to receive important announcements about the competition.
The acrobot, a two-link, underactuated robot roughly analogous to a gymnast swinging on a high bar. The first joint (corresponding to the gymnast's hands on the bar) cannot exert torque, but the second joint (corresponding to the gymnast bending at the waist) can. The system has four continuous state variables: two joint positions and two joint velocities.

The acrobot version that we present here is a generalized version that changes some aspects of the original acrobot dynamics but (we hope) presents interesting challenges to Reinforcement Learning algorithms and practitioners.
The acrobot is an episodic task.
Observation Space: The system has four continuous observation variables: two joint positions and two joint velocities.
Action Space: In order to make the problem more interesting, we have changed the behaviour of the actions from the typical acrobot problem.
Reward:
see [http://www.cs.ualberta.ca/~sutton/book/ebook/node110.html]