10 – Self / cross, hard / soft attention and the Transformer

36.836 Lượt nghe

00:00

Update Required To play the media you will need to either update your browser to a recent version or update your Flash plugin.

Tải MP3

MÔ TẢ MP3TIẾP THEO

10 – Self / cross, hard / soft attention and the Transformer

Course website: http://bit.ly/DLSP21-web
Playlist: http://bit.ly/DLSP21-YouTube
Speaker: Alfredo Canziani

Chapters
00 – Welcome to class
15 – Listening to YouTube from the terminal
36 – Summarising papers with @Notion
45 – Reading papers collaboratively
15 – Attention! Self / cross, hard / soft
44 – Use cases: set encoding!
10 – Self-attention
45 – Key-value store
32 – Queries, keys, and values  → self-attention
49 – Queries, keys, and values → cross-attention
27 – Implementation details
11 – The Transformer: an encoder-predictor-decoder architecture
59 – The Transformer encoder
47 – The Transformer “decoder” (which is an encoder-predictor-decoder module)
01:49 – Jupyter Notebook and PyTorch implementation of a Transformer encoder
10:51 – Goodbye :)					

10 – Self / cross, hard / soft attention and the Transformer

Nhạc Theo Chủ Đề

Liên kết website