What are Transformer Models and How do they Work?

What are Transformer Models and How do they Work?

23.707 Lượt nghe
What are Transformer Models and How do they Work?
This video is part of LLM University https://docs.cohere.com/docs/transformer-models Transformers are a new development in machine learning that have been making a lot of noise lately. They are incredibly good at keeping track of context, and this is why the text that they write makes sense. In this blog post, we will go over their architecture and how they work. Bio: Luis Serrano is the lead of developer relations at Co:here. Previously he has been a research scientist and an educator in machine learning and quantum computing. Luis did his PhD in mathematics at the University of Michigan, before embarking to Silicon Valley to work at several companies like Google and Apple. Luis is the author of the Amazon best-seller "Grokking Machine Learning", where he explains machine learning in a clear and concise way, and he is the creator of the educational YouTube channel "Serrano.Academy", with over 100K subscribers and 5M views. === Resources: Blog post: https://txt.cohere.com/what-is-semantic-search/ Learn more: https://www.youtube.com/c/LuisSerrano Neural Networks: https://www.youtube.com/watch?v=BR9h47Jtqyw Attention Models: https://www.youtube.com/watch?v=j10yrR6PPfg