COMPSCI 188 - 2018-09-20 - Markov Decision Processes (MDPs) Part 2/2

COMPSCI 188 - 2018-09-20 - Markov Decision Processes (MDPs) Part 2/2

37.192 Lượt nghe
COMPSCI 188 - 2018-09-20 - Markov Decision Processes (MDPs) Part 2/2
COMPSCI 188, LEC 001 - Fall 2018 COMPSCI 188, LEC 001 - Pieter Abbeel, Daniel Klein Copyright @2018 UC Regents; all rights reserved "Slides (from 2018): https://inst.eecs.berkeley.edu/~cs188/fa18 Latest website: https://inst.eecs.berkeley.edu/~cs188 More resources: http://ai.berkeley.edu 00:00 Setup [no content] 03:22 Contest Results [outdated] 09:22 Review: MDPs 20:26 The Bellman Equations 25:41 Value Iteration 29:21 Convergence of Value Iteration 33:15 Policy Evaluation 39:02 Policy Evaluation: Example 41:14 Policy Evaluation: Computation 44:39 Policy Extraction 50:41 Break [no content] 53:28 Problems with Value Iteration 56:37 Policy Iteration 1:02:03 Policy Iteration: Q&A, Summary 1:06:26 Reinforcement Learning: Slots Demo 1:13:04 Reinforcement Learning Preview 1:16:40 End [no content]"