COMPSCI 188 - 2018-09-25 - Reinforcement Learning Part 1/2

COMPSCI 188 - 2018-09-25 - Reinforcement Learning Part 1/2

40.476 Lượt nghe
COMPSCI 188 - 2018-09-25 - Reinforcement Learning Part 1/2
COMPSCI 188, LEC 001 - Fall 2018 COMPSCI 188, LEC 001 - Pieter Abbeel, Daniel Klein Copyright @2018 UC Regents; all rights reserved "Slides (from 2018): https://inst.eecs.berkeley.edu/~cs188/fa18 Latest website: https://inst.eecs.berkeley.edu/~cs188 More resources: http://ai.berkeley.edu 00:00 Setup [no content] 02:03 Announcements [outdated] 05:13 RL Introduction 07:15 RL Applications 15:26 RL Definition 18:40 Model-Based Learning 28:15 Model-Based vs. Model-Free Estimation 34:18 Passive RL 35:51 Direct Evaluation 41:40 Sample-Based Policy Evaluation? 45:47 Temporal Difference Learning 50:26 TD Learning: Example 53:33 Break [no content] 55:55 Problems with TD Learning 1:00:32 Active RL 1:02:01 Q-Value Iteration 1:05:50 Q-Learning 1:16:43 Q-Learning: Crawler Bot Demo 1:18:38 Q-Learning Properties 1:19:53 End [no content]"