Reinforcement Learning from scratch

Reinforcement Learning from scratch

178.119 Lượt nghe
Reinforcement Learning from scratch
How does Reinforcement Learning work? A short cartoon that intuitively explains this amazing machine learning approach, and how it was used in AlphaGo and ChatGPT. Part 1 of 3. 0:00 - intro 0:13 - pong 0:28 - the policy 0:51 - policy as neural network 1:32 - supervised learning 2:51 - reinforcement learning using policy gradient 4:24 - minimizing error using gradient descent 4:45 - probabilistic policy 5:01 - pong from pixels 6:58 - visualizing learned weights 8:18 - pointer to Karpathy "pong from pixels" blogpost