Schedule
-
EventDateDescriptionCourse Material
-
Lecture09/19/2025
FridayIntroductionSuggested Readings:
-
Lecture09/26/2025
FridayBasic Concepts in Reinforcement Learning[slides] -
Lecture10/10/2025
FridayMulti-Armed Bandits[slides] [KDD20 Tutorial] -
Due10/16/2025 23:59
ThursdayProject Idea Due-
Required content: Sales pitch about your course project idea, especially what makes you excited.
-
Purpose: Find your teammates who share the same passion and complementary skills
-
Where to submit: Post on our 网络学堂 forum.
-
-
Assignment10/18/2025
SaturdayMP #1 - Bandit Algorithms released! -
Quiz10/24/2025
FridayQuiz #1 - RL Basics & Bandits[solution]This quiz is designed to cover essential concepts in basic concepts in reinforcement learning and multi-armed bandit.
-
Lecture10/24/2025
FridayMarkov Decision Process[slides] -
Due10/25/2025 23:59
SaturdayMP #1 due -
Due10/27/2025 23:59
MondayProject Proposal Due-
Required template: You are required to use the latest NeurIPS templates for your project proposal.
-
Maximum length: 3 pages, excluding references and appendix.
-
Where to submit: A submission page will be created on Xuetang. One group only needs to submit one proposal to collab; and please name your submission as “name[+name]*-proposal.pdf”, for example, “王宏宁-李旭浚-proposal.pdf”.
-
-
Lecture10/31/2025
FridayDynamic Programming[slides] -
Lecture11/07/2025
FridayMonte Carlo Methods[slides] -
Assignment11/07/2025
FridayMP #2 - Markov Decision Process released! -
Lecture11/14/2025
FridayTemporal-Difference Learning[slides] -
Due11/15/2025 23:59
SaturdayMP #2 due -
Quiz11/21/2025
FridayQuiz #2 - DP & MC[solution]This quiz is designed to cover essential concepts in dynamic programming and Monto Carlo methods.
-
Lecture11/28/2025
FridayPolicy Gradient Methods[slides] -
Assignment12/06/2025
SaturdayMP #3 - Temporal difference and policy gradient methods released! -
Due12/13/2025 23:59
SaturdayMP #3 due -
Quiz12/19/2025
FridayQuiz #3 - TD & PG[solution]This quiz is designed to cover essential concepts in temporal difference method and policy gradient method.
-
Lecture12/19/2025
FridayApproximation Methods -
Assignment12/20/2025
SaturdayMP #4 - Deep RL methods released! -
Lecture12/26/2025
FridayDeep Reinforcement Learning -
Due12/27/2025 23:59
SaturdayMP #4 due -
Quiz01/02/2026
FridayQuiz #4 - Deep RL[solution]This quiz is designed to cover essential concepts in deep reinforcement learning methods.
-
Due01/02/2026 15:35
FridayProject Presentation-
Presentation location: 四教4104.
-
Presentation length: maximum 15 minutes presentation, including Q&A, given in person.
-
Presentation format: any format you prefer, power point slides or live demonstration.
-
-
Due01/14/2026 23:59
WednesdayProject Report Due-
Required template: You should use the same template that you have used for your project proposal.
-
Maximum length: 6 pages, excluding references and appendix.
-
Where to submit: A submission page will be created on Xuetang. One group only needs to one report to 网络学堂; and please name your submission as “name[+name]*-report.pdf”, for example, “王宏宁-李旭浚-report.pdf”.
-
