Slides from the workshop presentations are now available.
Results are now available. Congratulations to our winners.
Testing round closed. Thank you to all of our competitors!
Updated Testing application (R15) is now available HERE.
Proving application is now available HERE.
The rules, schedule, and prizes have been announced.
GAME ON! The software is now available.
Sign up for our mailing list to receive important announcements about the competition.
For questions, suggestions, or bug reports, please join the discussion forums.
Agents must balance a two-jointed virtual gymnast. Challenges include:
Get more details on the acrobot domain here.
Agents play tetris with several twists. New pieces arrive according to probability distributions which are at the mercy of an adversary, who chooses the piece which he thinks would be the worst fit for the current board. Challenges include:
Get more details on the Tetris domain here.
Based on the helicopter simulator from Andrew Ng's group, agents must control a helicopter which is attempting to stably hover. Challenges include:
Get more details on the helicopter domain here.
Agents play a variant of Super Mario, a complete side-scrolling video game with destructible blocks, enemies,
fireballs, coins, chasms, platforms, etc. The state space is complicated, but factored in an object-oriented way,
which captures many aspects of the real world. Challenges include:
Get more details on the mario domain here.
Agents must control a long, flexible octopus arm and make it grab food, without leaving its tank or bumping into itself. Challenges include:
Get more details on the octopus arm domain here.
Competitors must code a general purpose RL agent. Agents are tested on a variety of different MDPs which do not exhibit systematic structure between themselves. This forces the agent to learn quickly and reason flexibly about general MDPs. Challenges include:
Get more details on the polyathlon here