Learning By Doing - NeurIPS 2021 Competition

Last edited: 2021-07-15

Controlling a Dynamical System using Control Theory, Reinforcement Learning, or Causality

Control theory, reinforcement learning, and causality are all ways of mathematically describing how the world changes when we interact with it. Each field offers a different perspective with its own strengths and weaknesses.

In this NeurIPS competition, we aim to bring together researchers from all three fields to encourage cross-disciplinary discussions. These can happen during the competition and also afterwards when solutions are being presented. The competition is constructed to readily fit into the mathematical frameworks of all three fields and participants of any background are encouraged to participate.

We designed two tracks that consider a dynamical system for which participants need to find controls/policies to optimally interact with a target process: an open loop/bandit track (CHEM) and a closed loop/online RL track (ROBO).

We hope that the challenge further bridges the gap between control theory, reinforcement learning, and causality. Seeing how the same problem can be tackled in different ways may be a first step towards understanding the reasoning in other communities and learning from each other.

Latest Announcements

[requires JavaScript]


For each of the two tracks, the following prizes are awarded:

In case of a tie the prizes are split among the winners. See the terms and conditions for eligibility criteria.

Sponsored by the Department of Mathematical Sciences, University of Copenhagen and the Copenhagen Causality Lab.

Getting Started

The competition is run on CodaLab:

Use our tutorial to learn the technical details of how to participate in the competition and get started. For further background information take a look at the white paper.

Please use the Codalab Fora for questions about the competition:

Important Dates

Track CHEM – Trial Phase Start (trial data, starter kits, tutorial)
July 6th, 10:00 UTC
Track CHEM – Validation Phase Start (main competition phase)
July 10th, 16:00 UTC
Track ROBO – Trial Phase Start
July 15th, 17:00 UTC
Track ROBO – Validation Phase Start
July 29th, 16:00 UTC
Registration Deadline (both tracks)
August 20th, 16:00 UTC
Track ROBO – Selection Phase Start
September 12th, 16:00 UTC
Track CHEM – Selection Phase Start
September 20th, 16:00 UTC
Competition Deadline (both tracks)
September 26th, 16:00 UTC
Technical Description Submission Deadline (both tracks)
October 3rd, 16:00 UTC
NeurIPS Conference and Announcement of Winners
December, 13–14, 2021

demo1 demo2


Contact us via email at LearningByDoing AT math DOT ku DOT dk.

Dominik Baumann
RWTH Aachen University

Timothy Lee
Carnegie Mellon University

Niklas Pfister
University of Copenhagen

Isabelle Guyon
Université Paris-Saclay, ChaLearn

Søren Wengel Mogensen
Lund University

Sebastian Trimpe
RWTH Aachen University

Oliver Kroemer
Carnegie Mellon University

Jonas Peters
University of Copenhagen

Sebastian Weichwald
University of Copenhagen