We’ve learned a lot from doing research in Avalon over the past year! Internally, we’ve extended it to deal with multiple agents, and add simple audio / linguistic inputs for the agents. We’ve conducted a number of experiments on the tasks in the benchmark, and found that while well-tuned PPO approaches can achieve reasonable performance on the more basic tasks, most RL algorithms really struggle with the more complex and compositional tasks. These results pushed us towards focusing on agents that were better able to incorporate more explicit reasoning. While we believe that Avalon remains a useful tool for conducting fundamental reinforcement learning research, we are currently more focused on creating agents in text-based environments (ex: your code editor, browsers, and computer desktop environment). This means that we are unlikely to be developing many significant new features for Avalon in the near future. See more in our blog post here.
Agents in Avalon must accomplish a wide range of tasks, all with the same sparse reward structure. Different tasks correspond to randomly generated worlds that require that skill.

See the paper and presentation (below) for more details about the benchmark.
The Avalon benchmark is built on top of a simulator that we created specifically to suit the needs of RL researchers. Features:
Avalon can be used as a largely drop-in replacement for Atari or other standard RL benchmarks, simply see this tutorial to get started.
Avalon is a great place to start, as it includes highly tuned, easy-to-understand implementations of a variety of popular and high-performance RL algorithms. See this tutorial to replicate our training.
Avalon is an extremely fast, extremely easy-to-use platform for research, built on top of the fully open source Godot game engine. See this tutorial to create a new environment from scratch.
Avalon is incredibly easy to try—just download 30MB for your platform, unzip, and run.
To get started with Avalon, simply run the below in your own notebook!
pythonpythonpythonSee the full documentation and source code on our Github repository, or try one of these tutorials:
Our paper on Avalon was published at NeurIPS 2022. We will be presenting it in person.
Check out our paper for all the details on the environment and tasks, along with human and RL baselines (PPO, IMPALA, and Dreamer v2).
To cite Avalon, use the following:
textIf you’re interested in using Avalon, please feel free to reach out and say hello! We’re excited to help the research community build on top of Avalon.