CMU Advanced NLP Spring 2025 (21): Multimodal Modeling I

376 Lượt nghe

00:00

Update Required To play the media you will need to either update your browser to a recent version or update your Flash plugin.

Tải MP3

MÔ TẢ MP3TIẾP THEO

CMU Advanced NLP Spring 2025 (21): Multimodal Modeling I

This lecture (by Sean Welleck) for CMU CS 11-711, Advanced NLP covers:
- Vision architecture basics (ViT)
- Learning image representations (CLIP)
- Combining with a language model					

CMU Advanced NLP Spring 2025 (21): Multimodal Modeling I

Nhạc Theo Chủ Đề

Liên kết website