What are Transformer Models and How do they Work?

23.707 Lượt nghe

00:00

Update Required To play the media you will need to either update your browser to a recent version or update your Flash plugin.

Tải MP3

MÔ TẢ MP3TIẾP THEO

What are Transformer Models and How do they Work?

This video is part of LLM University
https://docs.cohere.com/docs/transformer-models

Transformers are a new development in machine learning that have been making a lot of noise lately. They are incredibly good at keeping track of context, and this is why the text that they write makes sense. In this blog post, we will go over their architecture and how they work.

Bio: 
Luis Serrano is the lead of developer relations at Co:here. Previously he has been a research scientist and an educator in machine learning and quantum computing. Luis did his PhD in mathematics at the University of Michigan, before embarking to Silicon Valley to work at several companies like Google and Apple. Luis is the author of the Amazon best-seller "Grokking Machine Learning", where he explains machine learning in a clear and concise way, and he is the creator of the educational YouTube channel "Serrano.Academy", with over 100K subscribers and 5M views.

===
Resources:
Blog post: https://txt.cohere.com/what-is-semantic-search/
Learn more: https://www.youtube.com/c/LuisSerrano
Neural Networks: https://www.youtube.com/watch?v=BR9h47Jtqyw
Attention Models: https://www.youtube.com/watch?v=j10yrR6PPfg					

What are Transformer Models and How do they Work?

Nhạc Theo Chủ Đề

Liên kết website