CMU Advanced NLP Fall 2024 (8): Reinforcement Learning and Human Feedback

CMU Advanced NLP Fall 2024 (8): Reinforcement Learning and Human Feedback

1.597 Lượt nghe
CMU Advanced NLP Fall 2024 (8): Reinforcement Learning and Human Feedback
This lecture (by Graham Neubig) for CMU CS 11-711, Advanced NLP (Fall 2024) covers: * Methods to Gather Feedback * Error and Risk * Reinforcement Learning * Stabilizing Reinforcement Learning Class Site: https://phontron.com/class/anlp-fall2024/