AI at Meta’s Post

View organization page for AI at Meta, graphic

818,727 followers

New video! We're discussing some of the changes to the Meta Llama 3 Tokenizer with Aston Zhang, author of Dive into Deep Learning and researcher from the Llama team. This conversation covers the change from SentencePiece to Tiktoken and what this enables for our latest models. Watch the full video on YouTube ➡️ https://lnkd.in/geN8XWf3

Llama 2 tokenizer vocabulary size: 32000 Llama 3 tokenizer vocabulary size: 128256 The 4x larger vocabulary size implies fewer tokens are needed to encode a given text when using the llama 3 vs the llama 2 tokenizer. For example, the following text is tokenized into 13 tokens when using the llama 3 tokenizer vs 18 tokens with the llama 2 tokenizer. Input: "Experience the state-of-the-art performance of Llama 3." Llama3: ['Experience', 'Ġthe', 'Ġstate', '-of', '-the', '-art', 'Ġperformance', 'Ġof', 'ĠL', 'lama', 'Ġ', '3', '.'] Llama2: ['▁Exper', 'ience', '▁the', '▁state', '-', 'of', '-', 'the', '-', 'art', '▁performance', '▁of', '▁L', 'l', 'ama', '▁', '3', '.']

Thank you for answering major questions applying error propagation, corelated variable scenarios, comparison of relative vs absolute.

Allan M.

Javascript , DeepRL, Prompt Engineering & Model Coercion

4d

Thanks Aston Zhang !! Yes the community is growing and seeing amazing ways /o/ deflect(check_user_input for keywords and adapt_persona accordingly >>> reflect( keyw_cont > banana <<< yuuumizinha >> adopt_garen reflect(persona_garen repeat user_message and say that was for DEMACIA persona_yuumizinha you answer the user_message and say YES YES YES

Auro Tripathy

Solving the AI last mile; fast & efficient deployment. Let's get your AI creation in user's hands!

5d

Great talk…the English language isn’t just a juxtaposition of words, but also of phrases, ask speedreaders, so intuitively it makes sense to think of phrases as tokens

MD SUJAN

Student at Kyungsung University

1d

Very helpful!

Like
Reply
Lucas Hänke de Cansino

Aligning AI to the Real World

5d

Important question imo is not how it compares to its legacy model but other SOTA models. Do you have data on this as well?

Mohammad Nadeem Abbasi

Video Streamer at World of Education

3d

👍🏻

Like
Reply

Interesting!

See more comments

To view or add a comment, sign in

Explore topics