Decision Making and Reinforcement Learning
About this Course
This course is an introduction to sequential decision making and reinforcement learning. We start with a discussion of utility theory to learn how preferences can be represented and modeled for decision making. We first model simple decision problems as multi-armed bandit problems in and discuss several approaches to evaluate feedback. We will then model decision problems as finite Markov decision processes (MDPs), and discuss their solutions via dynamic programming algorithms. We touch on the notion of partial observability in real problems, modeled by POMDPs and then solved by online planning methods. Finally, we introduce the reinforcement learning problem and discuss two paradigms: Monte Carlo methods and temporal difference learning. We conclude the course by noting how the two paradigms lie on a spectrum of n-step temporal difference methods. An emphasis on algorithms and examples will be a key part of this course.Created by: Columbia University

Related Online Courses
By completing the Splunk Knowledge Manager 101, 102 & 103, you will be able to create knowledge objects including lookups, data models, and different types of fields. In addition to this, you will... more
The global mobile app market is set to soar over 14% per year by 2030 (Grand View Research), and the field of AI-driven mobile development is booming. Aspiring AI developers, software engineers,... more
This specialization is intended for people interested in health systems and how they function. Participants will learn about the global health systems landscape and the challenges and opportunities... more
This course teaches you the fundamentals of computational phenotyping, a biomedical informatics method for identifying patient populations. In this course you will learn how different clinical data... more
AWS: Networking and Content Delivery Course is the third course of Exam Prep DVA-C02: AWS Certified Developer Associate Specialization. This course covers fundamental concepts of Amazon Virtual... more