Tsinghua University
Reinforcement Learning
Fall 2025

Main Navigation

  • Home
  • Schedule
  • Lectures
  • Assignments
  • Project
  • Materials

Assignments

You can find the instruction to your assignments here. Also check out each assignment page for any additional info.

  • MP #1 - Bandit Algorithms  
  • MP #2 - Markov Decision Process  
  • MP #3 - Temporal difference and policy gradient methods  
  • MP #4 - Deep RL methods

Department of Computer Science and Technology
Beijing, China

  • tsinghua.edu.cn/