News
Re-Exam Inspection & Deep RL SeminarWritten on 09.04.24 by Felix Jahn Hi everyone, the inspection of the re-exam will take place on Wednesday, April 17, from 9:30 to 11:00 in building E 1.1, seminar room 1.21. We would also like to take this opportunity to draw your attention to our seminar in the… Read more Hi everyone, the inspection of the re-exam will take place on Wednesday, April 17, from 9:30 to 11:00 in building E 1.1, seminar room 1.21. We would also like to take this opportunity to draw your attention to our seminar in the upcoming summer term on "Special Topics in Deep Reinforcement Learning". Based on the knowledge about RL from this lecture, the seminar will deal with the theory and practice of the most important concepts of deep reinforcement learning. The registration is possible via the SIC Seminar Assignment. Best wishes |
Re-Exam ResultsWritten on 28.03.24 by Felix Jahn Hi everyone, We have published the results and grades of the re-exam, you find them on your Personal Status page. The exam inspection will take place in the week from April, 15-19 (this is the first week of lectures in the upcomming semester). Hi everyone, We have published the results and grades of the re-exam, you find them on your Personal Status page. The exam inspection will take place in the week from April, 15-19 (this is the first week of lectures in the upcomming semester). Best wishes and happy eastern, |
Final Re-Exam InformationWritten on 22.03.24 by Felix Jahn Hi everyone, Hi everyone, You are allowed to bring one DIN A4 page with handwritten notes on both sides, no further auxiliary aids (in particular no calculator) are allowed. Seating will begin at 10:00. We will put up seating plans next to the lecture hall entrances. You can also look up your personal seat already now on your Personal Status page. When entering, please leave your jackets and bags on the sides, turn off all electronic devices (smartphones, laptops, smartwatches, ...) and leave them in your bag. Take to your seat only
You are not allowed to bring/use your own paper. We will provide you with enough paper during the exam. Please make sure to arrive at the lecture hall on time at 10:00. Kind regards and much success! The Reinforcement Learning Team |
Re-Exam InformationWritten on 15.03.24 by Felix Jahn Hi everyone! The re-exam will take place on March 25, 2024 from 10:00-12:00, the procedure will be very similar to the first exam. Most important reminder: Please make sure that you register for the exam until one week before (so until next Monday) in the HISPOS-LSF! At those of you whose study… Read more Hi everyone! The re-exam will take place on March 25, 2024 from 10:00-12:00, the procedure will be very similar to the first exam. Most important reminder: Please make sure that you register for the exam until one week before (so until next Monday) in the HISPOS-LSF! At those of you whose study program does not use HISPOS-LSF for exam administration: please send an E-Mail to Felix Jahn that you want to attend the re-exam. Again, you are allowed to bring one DIN A4 page with handwritten notes on both sides, no further auxiliary aids (in particular no calculator) are allowed. Also for the re-exam preparation, we will offer additional office hours. In the next week, there will be office hours on
We are happy to see many of you there! We will again provide more detailed information about boarding, seating, etc. in the course of the next week. Best wishes, |
Exam Inspection LocationWritten on 15.02.24 by Felix Jahn Hi everyone, the exam inspection tomorrow from 9:00 to 11:00 will take place in E 1.3, Lecture Hall 001. Best wishes |
Final Exam ResultsWritten on 09.02.24 by Felix Jahn Hi everyone, We have published the results and grades of the final exam, you find them on your Personal Status page. The exam inspection will take place on Friday, February 16, from 9:00 to 11:00. We will inform about the exact room of the inspection during the next week. The re-exam will… Read more Hi everyone, We have published the results and grades of the final exam, you find them on your Personal Status page. The exam inspection will take place on Friday, February 16, from 9:00 to 11:00. We will inform about the exact room of the inspection during the next week. The re-exam will take place on Monday, March 25th, from 10:00 to 12:00. Best wishes, |
Final Exam InformationWritten on 02.02.24 by Felix Jahn Hi everyone, Hi everyone, You are allowed to bring one DIN A4 page with handwritten notes on both sides, no further auxiliary aids (in particular no calculator) are allowed. Seating will begin at 14:00. We will put up seating plans next to the lecture hall entrances. You can look up your personal seat as well as the lecture hall also already now on your Personal Status page. When entering, please leave your jackets and bags on the sides, turn off all electronic devices (smartphones, laptops, smartwatches, ...) and leave them in your bag. Take to your seat only
You are not allowed to bring/use your own paper. We will provide you with enough paper during the exam. Please make sure to arrive at the lecture hall on time at 14:00. Kind regards and much success! The Reinforcement Learning Team |
Room Exchange Office HourWritten on 29.01.24 by Felix Jahn Hi everyone,
Today's office hour will not take place in the lecture hall but in the Seminar Room 016.
See you there! |
A Lot of News Regarding the ExamWritten on 24.01.24 (last change on 24.01.24) by Felix Jahn Hi everyone! We just published the results of the last exercise sheet, they are now visible on your Personal Status page. There, you can also see whether you have collected the necessary number of points to be admitted to the exam. The exam will take place on February 5, 2024 from 14:00-16:00… Read more Hi everyone! We just published the results of the last exercise sheet, they are now visible on your Personal Status page. There, you can also see whether you have collected the necessary number of points to be admitted to the exam. The exam will take place on February 5, 2024 from 14:00-16:00 in Building E 1.3, Lecture Halls 002 and 003. Very important reminder: Please make sure that you register for the exam until one week before (so until next Monday) in the HISPOS-LSF! At those of you whose study program does not use HISPOS-LSF for exam administration: please send an E-Mail to Felix Jahn that you want to attend the exam. For the exam preparation, we will offer additional office hours. In the next week, there will be office hours on
We are happy to see many of you there as well as in tomorrow’s last tutorial. And a last announcement: Also the re-exam is now scheduled and will be on March 25, 10:00-12:00. Best wishes,
|
Tomorrow's Lecture - Q&A SessionWritten on 21.01.24 by Verena Wolf Dear Students, Unfortunately, Timo Gros, who was scheduled to give a practical introduction to Deep Q-learning, is currently ill and unable to present tomorrow. We wish Timo a speedy recovery and look forward to his presentation, which has been rescheduled for the 29th. Timo has prepared extensive… Read more Dear Students, Unfortunately, Timo Gros, who was scheduled to give a practical introduction to Deep Q-learning, is currently ill and unable to present tomorrow. We wish Timo a speedy recovery and look forward to his presentation, which has been rescheduled for the 29th. Timo has prepared extensive material and demos that promise to be highly informative, so we are eager to have him with us on the new date. In lieu of tomorrow's lecture, we will instead hold a short Q&A session. This will be an excellent opportunity for you to ask any questions and clarify doubts regarding the material we have covered so far. I would especially encourage you to bring questions related to the contents of our last video on policy gradient, ensuring that everyone is on the same page. I apologize for any inconvenience this change may cause and appreciate your understanding and flexibility. The Q&A session will be held at the usual time and place (14:15 in HS002). Best regards, |
Online Lecture Video & Exercise Sheet EWritten on 16.01.24 by Felix Jahn Hi everyone! The video of yesterdays online lecture is now accessible in the Materials section under "Lecture Recordings". It covers the remaining content about policy gradient methods and a short recap/overview of what we have seen so far in the course. On January 22 and January 29, we will have… Read more Hi everyone! The video of yesterdays online lecture is now accessible in the Materials section under "Lecture Recordings". It covers the remaining content about policy gradient methods and a short recap/overview of what we have seen so far in the course. On January 22 and January 29, we will have the two final lectures as usual in-person. Also, the final Exercise Sheet E is now uploaded. As the Sheet was published a day later as usual, we extend the deadline for the exercises E.1 and E.2 to Tuesday, January 23, 14:00. The deadline for the exercises E.3 and E.4 remains on Monday, January 22, 12:00. Best wishes, |
Important Update on Today's LectureWritten on 15.01.24 by Verena Wolf Dear Students, I am writing to inform you that I have tested positive for COVID-19 this morning. Hence, I will not give our scheduled lecture today but will prepare a video lecture covering today's material. This will be uploaded in the next few days for you to view at your convenience. We will… Read more Dear Students, I am writing to inform you that I have tested positive for COVID-19 this morning. Hence, I will not give our scheduled lecture today but will prepare a video lecture covering today's material. This will be uploaded in the next few days for you to view at your convenience. We will have a dedicated Q&A session on the 22nd to address any questions regarding the lecture content. Please feel free to also use our forum if you have any questions. Best regards, Verena Wolf. |
Results Exercise Sheet DWritten on 12.01.24 by Felix Jahn Hi everyone! The results of Sheet D are now visible on your Personal Status page. In case of any questions regarding the grading, please visit the Office Hour or write us an E-Mail. Best wishes and have a nice weekend, |
Exercise Sheet D & Video on Eligibility TracesWritten on 19.12.23 by Felix Jahn Hi everyone! With a solid delay of 24 hours, we have now published Exercise Sheet D and the corresponding materials. As usual, you can find them in the Materials section. As announced, the sheet contains a larger programming exercise. The deadline of this exercise sheet is due on Monday, January… Read more Hi everyone! With a solid delay of 24 hours, we have now published Exercise Sheet D and the corresponding materials. As usual, you can find them in the Materials section. As announced, the sheet contains a larger programming exercise. The deadline of this exercise sheet is due on Monday, January 8, 12:00. Furthermore, also the lecture video about eligibility traces is now listed in the Materials section. Best wishes, |
Exercise Sheet C - ResultsWritten on 13.12.23 by Felix Jahn Hi everyone, The results of Exercise Sheet C have just been published. Have a nice evening and best wishes! |
Exercise Sheet C - ResultsWritten on 13.12.23 by Felix Jahn Hi everyone, The results of Exercise Sheet C have just been published. Have a nice evening and best wishes! |
Exercise Sheet C DeadlineWritten on 10.12.23 by Felix Jahn Hi everyone! As discussed already in the forum, we made a very unfortunate mistake when creating the submission for the current exercise sheet. The deadline of the submission was mistakenly set to Monday, December 18, deviating from the usual "1-week-rhythm" and the deadline stated on the… Read more Hi everyone! As discussed already in the forum, we made a very unfortunate mistake when creating the submission for the current exercise sheet. The deadline of the submission was mistakenly set to Monday, December 18, deviating from the usual "1-week-rhythm" and the deadline stated on the sheet. To dissolve the resulting confusion: The deadline for the Exercise Sheet C is as usual on Monday, December 11, 12:00. As a courtesy, we extend the deadline for the programming exercise (Exercise C4) to Tuesday, December 12, 16:00. We hope that this helps you a little bit despite the spontaneity. We really apologize for the confusion and the now very close deadline! Have a nice Sunday evening!
|
Exercise Sheet CWritten on 04.12.23 by Felix Jahn Hi everyone, The Exercise Sheet C and the corresponding Notebook C can now be found in the Materials section. Have a nice and snowy evening! |
Exercise Sheet B & Tutorial MergingWritten on 29.11.23 by Felix Jahn Hi everyone, We have published the results and feedback to Exercise Sheet B. The sheet will be discussed in tomorrows tutorial, so it's the perfect place to ask any questions regarding the exercises or the grading. As announced already at the beginning of last week's tutorials, the tutorials at… Read more Hi everyone, We have published the results and feedback to Exercise Sheet B. The sheet will be discussed in tomorrows tutorial, so it's the perfect place to ask any questions regarding the exercises or the grading. As announced already at the beginning of last week's tutorials, the tutorials at 10am and 2pm will be merged for the rest of the semester such that all tutorials will take place from now on in room E1.3, SR014. Have a nice evening and see you tomorrow! |
Exercise Sheet BWritten on 20.11.23 by Felix Jahn Hello everyone, The Exercise Sheet B can now be found in the Materials section. Have a nice evening! |
Assignment FeedbackWritten on 16.11.23 by Felix Jahn Hi everyone! Unfortunately, we forgot to publish the feedback to your submissions yesterday. They should now be visible on your Personal Status page.
|
Results Assignment 1& OH Room ChangeWritten on 15.11.23 by Felix Jahn Hi everyone! We have published the results of the first assignment sheet. If you have any questions about the feedback or difficulties with certain topics covered on the sheet, please attend tomorrow's tutorials, come to office hours next week or ask in the forum. Hi everyone! We have published the results of the first assignment sheet. If you have any questions about the feedback or difficulties with certain topics covered on the sheet, please attend tomorrow's tutorials, come to office hours next week or ask in the forum. Best wishes and see you tomorrow! |
Exercise Sheet A & Office HourWritten on 06.11.23 by Felix Jahn Hi everyone, We just uploaded the first exercise sheet and the corresponding notebook, you can find both files in the Materials section. Your solutions must be uploaded via your Personal Status page to the mCMS, recall that the sheet is due next Monday, 12:00. We would also like to remind you… Read more Hi everyone, We just uploaded the first exercise sheet and the corresponding notebook, you can find both files in the Materials section. Your solutions must be uploaded via your Personal Status page to the mCMS, recall that the sheet is due next Monday, 12:00. We would also like to remind you of the Office Hour XXL in E1.3, SR 107 on Thursday from 10-16, where you can get help with questions and problems concerning the exercise sheet. Have a nice week!
|
Forum & Exercise Sheet TeamsWritten on 03.11.23 by Felix Jahn Hi everyone, As some of you have already noticed, we have set up a Discourse forum for discussions about all topics surrounding the lecture and reinforcement learning. It is reachable via the navigation bar at the top of the cms-page. Hi everyone, As some of you have already noticed, we have set up a Discourse forum for discussions about all topics surrounding the lecture and reinforcement learning. It is reachable via the navigation bar at the top of the cms-page. Have a nice weekend! |
Tutorial AssignmentsWritten on 01.11.23 by Felix Jahn Hello everyone! We have just published the tutorial assignments. You can see your tutorial and the corresponding slot on your Personal Status page. As announced, the tutorials will start tomorrow with a probability and statistics training. See you there and have a great remaining holiday! |
Trainings Sheet A & Tutorial AssignmentWritten on 30.10.23 by Felix Jahn Hello everyone! We have just published the first training sheet that covers important basics of probability theory and statistics. You can find it under Information --> Materials. We will discuss the sheet and the content in the tutorials on this Thursday, November 2nd. Although the training… Read more Hello everyone! We have just published the first training sheet that covers important basics of probability theory and statistics. You can find it under Information --> Materials. We will discuss the sheet and the content in the tutorials on this Thursday, November 2nd. Although the training sheets are completely optional, we recommend you to make sure you understand the concepts as they will be important as the lecture progresses. Also, we would like to remind you again that you can submit your tutorial preferences until 2pm on Wednesday. The tutorial assignments will then be published on Wednesday afternoon. Have a nice evening! |
Welcome & RegistrationWritten on 12.10.23 (last change on 12.10.23) by Felix Jahn Welcome everyone to the Reinforcement Learning lecture! The registration for the course is open until Wednesday, November 1st, 2pm. Best wishes, |
Reinforcement Learning
Reinforcement learning is an area of machine learning where the goal is to develop (near-)optimal policies for solving sequential decision-making problems. The policy is typically represented by an agent who learns to achieve a goal by interacting with the environment. RL is often seen as the third area of machine learning (in addition to supervised and unsupervised areas) in which training samples are generated as a result of the agent's actions and interaction with the environment. In recent years, there have been remarkable successes in reinforcement learning research in both theoretical and applied fields. These successes are mostly the result of a new development in the field: representing policies by artificial neural networks allows us to solve much more complex decision problems.
Course Content
This course provides a broad introduction to reinforcement learning and its applications. You will learn about Markov Decision Processes as the underlying formal framework for decision-making problems, as well as popular reinforcement learning algorithms such as Monte-Carlo methods, temporal difference methods, and different "deep" reinforcement learning approaches. We will consider the open-source Python library Gymnasium to train RL agents in different pre-built environments.
We recommend as prerequisites for this lecture the succesful attendance in Programming 1, Programming 2 and basic knowledge of probability theory as taught for example in Mathematics for Computer Scientists 3.
Course Modalities
The course is a 6 ECTS Advanced Lecture, consisting out of weekly lectures, bi-weekly assignment sheets and a final exam.
Lecture: every Monday at 14:15 in Bld E13, HS II, Start: Oct 30th
Assignment Sheets: bi-weekly, containing theoretical & programming exercises
In total, you must achieve overall 50% of the points of the assignments in order to be admitted to the exam.
Tutorial: bi-weekly on Thursday (different slots between 10-16), Start: Nov 2nd
Office Hours: bi-weekly on Thursday from 10-16, Start: Nov 9th
Exam: February 5th, 2024, 14:00-16:00
Re-Exam: March 25th, 2024, 10:00-12:00
The exam will take 90 minutes. For the exam, you are allowed to bring a two-sided handwritten DIN A4 sheet with you ("Cheat Sheet").
Literature
Reinforcement Learning: An Introduction, Richard S. Sutton and Andrew G. Barto