Seminar on Multi-agent Reinforcement Learning

Saarland University — Winter Semester 2020

The course will cover the state of the art research papers in multi-agent reinforcement learning, including the following three topics: (i) game playing and social interaction, (ii) human-machine collaboration, and (iii) robustness, accountability, and safety.

Organizers

Goran Radanovic: Instructor
Adish Singla: Instructor
Rati Devidze: Teaching assistant
Stylianos Triantafyllou: Teaching assistant

Timeline and updates

Until 26 Oct 2020: Register for the seminar course allocation at https://seminars.cs.uni-saarland.de.
2 Nov 2020: We have a new mailing list that includes all the four organizers/tutors. To reach out to us, you should send an email to multiagentrl-w20-tutors@mpi-sws.org (instead of contacting individuals).
Until 30 Nov 2020: After you have been allocated a slot in the seminar, you then need to register for the seminar course examination at UdS. You should check with UdS when the examination registration starts and ends.
10 Nov 2020: Paper assignment for reading and writing reports will be sent to students. A total of six papers will be assigned to each student for which they will be writing reports.
30 Nov 2020: Reports for the first two papers are due.
20 Dec 2020: Reports for the next two papers are due.
10 Jan 2021: Reports for the last two papers are due.
15 Jan 2021: Paper assignment for presentations is sent to students. One paper is assigned to each student that they will be presenting.
15 Feb 2021: Presentation slides are due.
Between 20 Feb to 20 Mar 2021: Final presentations will take place where each student will present their assigned paper. The exact dates will be finalized in discussion with enrolled students.

Course structure

The course consists of two main components: (i) Reading research papers and (ii) Presentations. There will be no weekly classes. To resolve doubts in the assigned papers and provide feedback on the presentation slides, the tutors will arrange specific meeting times after each deadline — further information will be communicated to students via emails as we move along in the semester. If needed, you can reach out to us by sending an email to multiagentrl-w20-tutors@mpi-sws.org.

Reading research papers

Each student will be assigned a total of six research papers (two papers per topic). The complete list of papers is provided below, and the assignments will be done by choosing papers at random from this list. You will receive this assignment from us via email.
For each of the assigned paper, you will have to write a two-page report.
Each report should be submitted as a PDF file via sending an email to multiagentrl-w20-tutors@mpi-sws.org. You should name your PDF files as lastname_#.pdf (i.e., lastname_1.pdf, lastname_2.pdf, lastname_3.pdf, lastname_4.pdf, lastname_5.pdf, and lastname_6.pdf).
Reports should be written in latex using NeurIPS style files
- https://neurips.cc/Conferences/2020/PaperInformation/StyleFiles.
- You should use the option \usepackage[preprint]{neurips_2020} (Non-anonymous preprints).
- The report should include the paper title and your name at the top.
Structure the report as an extended review, e.g.,
- Summarize the paper.
- Write down main strengths of the paper.
- Write down main weaknesses of the paper.
- Write down ways in which this paper could be improved.
- Write down ideas in which this paper could be extended.
These six reports will correspond to 50% of the final score.

Presentations

Each student will be assigned a paper for presentation. This paper will be selected from the list of six papers assigned for writing reports.
You will have to prepare a presentation of 25 mins. You will have the possibility to get feedback on your slides before the final submission.
At the end of the semester, you will give a final presentation. We will block about 6 hours of time for the presentations. The exact dates will be finalized in discussion with enrolled students. Attendance to the final presentations will be mandatory.
The slides and presentation will correspond to 50% of the final score.

List of research papers

Imprint / Data Protection

Organizers

Timeline and updates

Course structure

Reading research papers

Presentations

List of research papers

Game playing and social interaction

Human-machine collaboration

Robustness, accountability, and safety