The course project is to help you get hands-on experience on developing reinforcement learning algorithms to solve interesting and practical problems. The project appreciates either research-oriented problems or “deliverables.” It is preferred that the outcome of your project could be directly publishable, e.g., your (unique) solution to some (interesting/important/new) problems, or tangible, e.g., some kind of prototype system that can be demonstrated. Bonus points will be given to the groups meet either one of above criteria. Group work (2-3 students) is required.

General Steps

  • Pick a topic and form a team
  • Survey related work and prepare a proposal
  • Work on the project
  • Write a report and present your project

Your project will be graded based on the following required components:

  • Project proposal (15%)
    • Everyone posts on our WeChat group to describe your project idea and find your teammates (by October 11th, 23:59);
    • Every team writes a maximum three-page proposal to deliver your motivation and management plan for your project;
    • The report is due by the end of 7th week (October 31st, 23:59).
  • Project presentation (50%)
    • 12 minutes presentation (including Q&A) about what you have done for this course project. Format could be tailored according to the nature of the project, e.g., slides presentation and/or system demo;
    • Performance will be graded by both instructor and other students;
    • Presentation will be given in the last week of this semester (Janurary 2nd, 2026).
  • Project report (35%)
    • A detailed written report of your project in the required templates;
    • Quality requirement is the same as research papers, i.e., in formal written language and rigorous paper format;
    • Report is due on the last day of this semester (Janurary 18th, 2026).

The official rubric for the final report and rubric for the project presentation is provided here.

Note that you are required to use the provided templates for your project proposal and final report. See the Materials page for the template and example file.

What To Work On?

When picking a topic, try to ask yourself the following questions:

  • What is exactly the (research) problem that you want to solve? Will it matter if nobody realizes this problem?
  • What kind of changes could your project make to the others?
  • Is there any existing alternative? If so, why do you still want to do it? How is your idea different from theirs? Would people appreciate about the difference?
  • What would be the major challenge(s) in this problem? Any specific background or resource you have to solve the identified problem?
  • What is the minimum goal to be achieved during this semester? (Try to drop everything non-essential and only keep the part that is truly novel.)
  • How do you plan to demonstrate that method to be developed is indeed solving the pain? Empirical experimentation and/or demo are required, unless you are doing a purely theoretic work.

Before making your decision, it is very important to be aware of whether the problem you would like to tackle has already been solved. If so, you may want to figure out where exactly your novelty is and whether novelty leads to any benefit to others. Your goal is to go beyond, rather than simply duplicate, the existing work.

Leveraging existing resources is especially encouraged as it allows you to minimize the amount of work that you have to do and focus on developing truly your ideas.

Tell Us About It!

At the end of the semester, each project team is expected to present their project in class. The purpose of this presentation is

  • Let you know about others’ projects.
  • Give you some opportunity to practice presentation skills, which are very important for a successful career in both academia and industry.
  • Obtain feedback from others about your project.

In general, the structure of your presentation should be prepared like a conference presentation. So it should touch all the following aspects (text in parenthesis states the instructor’s expectation):

  • What is the background/motivation of your work? What research question will you address? (Learn how to attract public attention.)
  • Why is this problem important? (Learn how to persuade others.)
  • Is there any existing work? How novel is yours? (Learn how to sell your ideas.)
  • How did you solve this problem? (Learn how to deliver your solution.)
  • How good was your method? (Learn how to quantitatively/qualitatively evaluate your work.)
  • Any ideas for further improvement? (Learn how to look ahead.)

To make it even more concrete, here how we are going to evaluate your presentation.

Aspects
Score
Slides content was clearly visible and self-explainable.
[0-10]
Presenters were confident about their work and clearly explained it to me.
[0-10]
Background and research question were clearly highlighted, and the logic and argument were reasonable.
[0-10]
There was sufficient discussion of state-of-the-art and why do we need this new method.
[0-10]
Description of the proposed method is clear, comprehensive, coherent and consistent with the claim in the introduction.
[0-15]
Thorough experimentations that proved all necessary components in the proposed method and detailed analysis of the experimental results.
[0-15]
The presenters well managed their time during presentation.
[0-10]
The presenters did a good job in answering the questions.
[0-10]
I like this work!
[0-10]

Tell Me Your Story

You should write your report as if you were writing a regular research paper. You should address the same questions as those you have addressed in the proposal and presentation, only with more details. Pay special attention to the challenges that you have solved and your detailed solutions. Basic sections to be included in the report should be the same as those in a conference paper, e.g., abstract, introduction, related work, method, experiment and conclusion. If you are developing a demo system or toolkit, your report can follow the format of a demo paper.

The project report must be at most six pages with that format (no minimal requirement, as long as you feel it is sufficient to prove the merit of your work).

And this is how I am going to read it,

Aspects
Score
Strictly follow the provided template.
[0-10]
Background and research question were clearly stated in the introduction and the logic and argument were reasonable.
[0-10]
Contribution of the work was properly articulated in the introduction.
[0-10]
Sufficient discussion of state-of-the-art in related work section.
[0-10]
Description of the proposed method was clear, comprehensive, coherent and consistent with the claim in the introduction.
[0-20]
Precise description of experiment design and experimental data set.
[0-10]
Thorough experimentations that proved all necessary components in the proposed method and detailed analysis of the experimental results.
[0-20]
Summarization of the work, reasonable discussion of limitation of the proposed solution and future work.
[0-10]