Understanding the Llama 3 Tokenizer | Llama for Developers

9.997 Lượt nghe

00:00

Update Required To play the media you will need to either update your browser to a recent version or update your Flash plugin.

Tải MP3

MÔ TẢ MP3TIẾP THEO

Understanding the Llama 3 Tokenizer | Llama for Developers

Download Meta Llama 3 ➡️ https://go.fb.me/kbpn54

Aston Zhang, research scientist working on Llama at Meta discusses the new tokenizer in Meta Llama 3. He discusses the improvements made to the tokenizer in Meta's latest Llama 3 models. The new tokenizer uses Tiktoken instead of SentencePiece and has a larger vocabulary size of 128k, resulting in better performance on coding, reasoning, and more. The increased vocabulary size allows for more specific and nuanced encoding of inputs, while the higher compression ratio reduces the number of tokens required to represent an input. Additionally, the use of Group Query Attention helps balance out the increased memory and compute needs, resulting in a model that can process larger batches without increasing latency.

# Timestamps
00:00 Introduction
00:25 What's new in the Llama 3 tokenizer?
01:58 Vocabulary size and compression ratio
13:01 Performance, efficiency and improving costs
17:46 Recap and resources


# Additional Resources
• Dive into Deep Learning ebook: https://go.fb.me/ao405f 
• Getting Started Guide: https://go.fb.me/xucc2m 


#llama3 #llm #opensource 
- - - 
Subscribe: https://www.youtube.com/aiatmeta?sub_confirmation=1
Learn more about our work: https://ai.meta.com 

# Follow us on social media

Follow us on Twitter: https://twitter.com/aiatmeta/
Follow us on LinkedIn: https://www.linkedin.com/showcase/aiatmeta
Follow us on Threads: https://threads.net/aiatmeta
Follow us on Facebook: https://www.facebook.com/AIatMeta/

Meta AI focuses on bringing the world together by advancing AI, powering meaningful and safe experiences, and conducting open research.					

Understanding the Llama 3 Tokenizer | Llama for Developers

Nhạc Theo Chủ Đề

Liên kết website