This AIIDE workshop is centered on the MARLÖ competition on Multi-Agent Reinforcement Learning in MalmÖ. The aim of the competition is to foster research in agents that can learn to play a range of multi-agent games. This workshop is a key opportunity to raise awareness of the competition and associated research challenges within the AIIDE community, to brainstorm and discuss research directions in multi-task, multi-agent learning in modern video games, and to create a fertile ground for novel collaborations.
This is a 1-day workshop which uniquely features the MARLO competition as is being run.
Key program elements are 2 invited talks, as well as short contributed talks and spotlight talks where competition participants give insights into the approaches their agents use. A short tutorial will allow interested attendees to get a hands on start to experimenting with MARLÖ competition agents. The conclusion of the day is a highlight, with announcement of the tournament winners and discussion with the winning teams.
The papers accepted would contribute to the AIIDE'18 Workshop Proceedings, including contributions from competition participants.
The program is as follows:
When | What | By |
---|---|---|
9:00 - 10:30 | MARLO Tutorial: Introductions and Hands-on | Diego Perez-Liebana |
10:30 - 11:00 | Coffee Break | |
11:00 - 11:50 | Keynote 1 - Game AI: The Appearance of Intelligence [Video] | Jesse Cluff (Coalition Games) |
11:50 - 12:10 | Like a DNA String: Sequence Mining for Player Profiling in Tom Clancy's The Division [Video] | Hendrik Baier |
12:10 - 12:30 | Modular Architecture for StarCraft II with Deep Reinforcement Learning [Video] | Dennis Lee |
12:30 - 14:00 | Lunch Break | |
14:00 - 14:50 | Keynote 2 - DeepStack: Expert-Level AI in Heads-Up No-Limit Poker | Martin Schmid (Google DeepMind) |
14:50 - 15:10 | Pommerman: A Multi-Agent Playground [Paper] [Video] | Cinjon Resnick et al. |
15:10 - 15:30 | Extending World Models for Multi-Agent Reinforcement Learning in MALMO [Paper] [Video] | Valliappa Chockalingam et al. |
15:30 - 16:00 | Coffee Break | |
16:00 - 16:45 | Discussion Panel | TBA |
16:45 - 17:00 | Closing |
The AIIDE 2018 workshop on 'Learning to Play: The Multi-Agent Reinforcement Learning in MalmO (MARLO) Competition' aims to encourage research towards more general AI approaches through multi-player games. Games have a long and fruitful history of both serving as test beds to push AI research forward and being the first to benefit from novel research developments. It is our belief that this is the right time to focus on multi-player games in the more complex and diverse 3D environments provided by modern video games such as Minecraft.
The problem of learning in multi-agent settings is one of the fundamental problems in artificial intelligence research and poses unique research challenges. For example, the presence of independently learning agents can result in non-stationarity, and the presence of adversarial agents can hamper exploration and consequently the learning progress. In addition to being particularly challenging, progress in multi-agent learning has far-ranging application potential, in particular in modern multi-player video games, where novel AI agents have great potential to enable novel game experiences.
A key feature of multi-agent play in video game settings is a rich diversity of tasks. Modern video games consist of varied maps or levels, all spanned by the theme of the game but varying in interesting and surprising ways. Such diversity poses the challenge of learning to generalize across multiple related tasks. Exciting research on multi-task learning has addressed some of these challenges, but key questions remain. How well can state of the art approaches learn to generalize to variants of a previously learned game?
Goal of the workshop is to bring together researchers and practitioners with diverse backgrounds in artificial intelligence and gaming, to provide a ground for the fruitful exchange of ideas to help start tackling these exciting key challenges associated with multi-task, multi-agent game settings.
Topics:
We invite submissions on all aspects of learning in multi-task, multi-agent settings, especially where they relate to video games. These include, but are not limited to:
In addition to novel research contributions, we invite the submission of extended abstracts that summarize recent published work that are related to the topic of the workshop. This is an opportunity for authors to present their work to the AIIDE community, and generate discussion and ideas for future work and collaboration.
We also specifically encourage submissions from teams planning to participate in the MARLO competition, for example with extended abstracts that detail their planned competition agents.
MARLO COMPETITTION
The Learning to Play workshop is associated with the MARLO (Multi-Agent Reinforcement Learning in MalmO) Competition, and will host a live tournament round. The competition asks participants to create agents that learn to play with and against other agents across a series of related mini-games that are implemented on top of the game Minecraft using the Malmo framework. It will be kicked off by August 2018, and will focus on learning to play in multi-agent, multi-task game settings.
Details of the MARLO competition will be posted at: https://www.crowdai.org/challenges/marlo-2018
IMPORTANT DATES
PAPER SUBMISSION
We invite submission of extended abstracts (up to 2 pages, including references) of work in progress or summarising recent relevant publications, as well as full papers (up 4 pages) for novel research contributions or position papers. Submissions should be formatted using the AAAI template
Authors must register at the workshop paper submission site before they submit their papers. Abstracts and papers must be submitted through the submission website; we cannot accept submissions by email.
Please submit papers and extended abstracts on Easychair: https://easychair.org/conferences/?conf=aiide18 (select "AIIDE-18 Workshop: Learning to Play: The Multi-Agent Reinforcement Learning in Malmö" when creating a new submission).
CODE OF CONDUCT
The open exchange of ideas and the freedom of thought and expression are central to the aims and goals of the Learning to Play workshop at AIIDE 2018. The workshop organizers commit to providing a harassment-free, accessible, inclusive, and pleasant workshop experience with equity in rights for all. We want every participant to feel welcome, included, and safe at the workshop. We aim to provide a safe, respectful, and harassment-free workshop environment for everyone involved regardless of age, sex, gender, gender identity and expression, sexual orientation, (dis)ability, physical appearance, race, ethnicity, nationality, marital status, military status, veteran status, religious beliefs, dietary requirements, medical conditions, pregnancy-related concerns or childcare requirements. We also respect any other status protected by laws of the country in which the workshop or program is being held.
We do not tolerate harassment of workshop participants. We expect all interactions between AIIDE members to be respectful and constructive, including interactions during the review process, at the workshop itself, and on social media. Workshop participants who violate the terms of this policy may not be welcome to submit to or attend future AIIDE meetings. Concerns should be brought to the attention of the workshop organizers in person (if needed) and definitely in writing, and will be investigated and reviewed by AAAI and the AIIDE Steering Committee. If there is an immediate need for intervention, outside law enforcement authorities may need to be contacted.
(This code of conduct was adapted from the Code of Conduct from the ACM Special Interest Group on Computer Human Interaction. See: https://chi2017.acm.org/diversity-inclusion-statement.html )
Check the competition site at Crowd AI
Clone the framework from our repository
The Multi-Agent Reinforcement Learning in MalmÖ (MARLÖ) competition is a new challenge that proposes research on Multi-Agent Reinforcement Learning using multiple games. Participants would create learning agents that will be able to play multiple 3D games as defined in the MalmÖ platform built on top of Minecraft. The aim of the competition is to encourage AI research on more general approaches via multi-player games. For this, the challenge will consist of not one but several games, each one of them with several tasks of varying difficulty and settings. Some of these tasks will be public and participants will be able to train on them. Others, however, will be private, only used to determine the final rankings of the competition.
A framework will be provided with easy instructions to install, create the first agent and submit it to the competition server. Documentation, tutorials and sample controllers for the development of entries will also be accessible to the participants of this challenge. The competition will be hosted on CrowdAI.org, which will determine the preliminary rankings of the competition. Recurring tournaments at regular intervals will determine which agents perform better in the games proposed. This competition will be sponsored by Microsoft Research for framework development and competition awards.
One of the main features of this competition is that agents play in multiple games. Therefore, several tasks are proposed for this contest. For the purpose of this document and the competition itself, we define:
The next figure sketches how games and tasks are organized in the challenge. As can be seen, tasks will be of public nature and accessible by the participants, while others are secret and will be used to evaluate the submitted entries at the end of the competition. Tasks are distributed across sets:
For more information about the competition, visit our Crowd AI competition site.
For queries about this workshop, please contact Diego Perez-Liebana
For queries about the competition, please refer to the contact section of our Crowd AI competition site.