Swin Transformer paper animated and explained

Swin Transformer paper animated and explained

78.766 Lượt nghe
Swin Transformer paper animated and explained
Swin Transformer paper explained, visualized, and animated by Ms. Coffee Bean. Find out what the Swin Transformer proposes to do better than the ViT vision transformer. 📺 ViT explained: https://youtu.be/DVoHvmww2lQ 📺 Transformer explained: https://youtu.be/FWFA4DGuzSc 📺► Positional embeddings (playlist): https://youtube.com/playlist?list=PLpZBeKTZRGPOQtbCIES_0hAvwukcs-y-x ▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀ Thanks to our Patrons who support us in Tier 2, 3, 4: 🙏 donor, Dres. Trost GbR, Yannik Schneider ➡️ AI Coffee Break Merch! 🛍️ https://aicoffeebreak.creator-spring.com/ 🔥 Optionally, pay us a coffee to help with our Coffee Bean production! ☕ Patreon: https://www.patreon.com/AICoffeeBreak Ko-fi: https://ko-fi.com/aicoffeebreak ▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀▀ Paper discussed: 📜 Liu, Ze, Yutong Lin, Yue Cao, Han Hu, Yixuan Wei, Zheng Zhang, Stephen Lin, and Baining Guo. "Swin transformer: Hierarchical vision transformer using shifted windows." arXiv preprint arXiv:2103.14030 (2021). https://arxiv.org/abs/2103.14030 💻 Swin Transformer code on GitHub: https://github.com/microsoft/Swin-Transformer Outline: 00:00 Problems with ViT / Swin Motivation 04:16 Swin Transformer explained 06:00 Shifted Window based Self-attention 08:58 positional embeddings in the Swin Transformer 09:29 Task performance of the Swin Transformer Music 🎵 : Bay Street Millionaires by Squadda B --------------------- 🔗 Links: AICoffeeBreakQuiz: https://www.youtube.com/c/AICoffeeBreak/community Twitter: https://twitter.com/AICoffeeBreak Reddit: https://www.reddit.com/r/AICoffeeBreak/ YouTube: https://www.youtube.com/AICoffeeBreak #AICoffeeBreak #MsCoffeeBean #MachineLearning #AI #research​ Video and thumbnail contain emojis designed by OpenMoji – the open-source emoji and icon project. License: CC BY-SA 4.0 16x16 pixels comprehensible artificial intelligence