L3 Policy Gradients and Advantage Estimation (Foundations of Deep RL Series)

36.704 Lượt nghe

00:00

Update Required To play the media you will need to either update your browser to a recent version or update your Flash plugin.

Tải MP3

MÔ TẢ MP3TIẾP THEO

L3 Policy Gradients and Advantage Estimation (Foundations of Deep RL Series)

Lecture 3 of a 6-lecture series on the Foundations of Deep RL 
Topic: Policy Gradients and Advantage Estimation
Instructor: Pieter Abbeel

Slides: https://www.dropbox.com/s/7y82w1q70ftt2fv/l3-policy-gradient-and-advantage-estimation.pdf?dl=0					

L3 Policy Gradients and Advantage Estimation (Foundations of Deep RL Series)

Nhạc Theo Chủ Đề

Liên kết website