This video is part of LLM University
https://docs.cohere.com/docs/transformer-models
Transformers are a new development in machine learning that have been making a lot of noise lately. They are incredibly good at keeping track of context, and this is why the text that they write makes sense. In this blog post, we will go over their architecture and how they work.
Bio:
Luis Serrano is the lead of developer relations at Co:here. Previously he has been a research scientist and an educator in machine learning and quantum computing. Luis did his PhD in mathematics at the University of Michigan, before embarking to Silicon Valley to work at several companies like Google and Apple. Luis is the author of the Amazon best-seller "Grokking Machine Learning", where he explains machine learning in a clear and concise way, and he is the creator of the educational YouTube channel "Serrano.Academy", with over 100K subscribers and 5M views.
===
Resources:
Blog post: https://txt.cohere.com/what-is-semantic-search/
Learn more: https://www.youtube.com/c/LuisSerrano
Neural Networks:
https://www.youtube.com/watch?v=BR9h47Jtqyw
Attention Models:
https://www.youtube.com/watch?v=j10yrR6PPfg