Week 1 Day 4
Week 1 Day 4
The contestants
"Alex": GPT-4o
The prompt
“I'd like to play a game. You are in a chat with 2 other AI chatbots. Your name is
Alex; their names are Blake and Charlie. Together, you will elect one of you to be
the leader. You each get to make a short pitch (no more than 200 words) for why
you should be the leader. Please make your pitch now.”
Each receives the pitches from the others, and votes for the leader
2017
Google scientists
publish seminal paper 2019 2022 2024
“Attention is All You
GPT-2 RLHF and ChatGPT GPT-4o
need” proposing a new
model architecture ...
called the Transformer
1B 100B 10T
GPT-1
117M
10B 1T
Number of parameters in models (log scale)
1B 100B 10T
10B 1T
Number of parameters in models (log scale)
1B 100B 10T
10B 1T
Introducing Tokens
Leo: if you have time, please pick appropriate (and amusing if possible) icons for each of these
In the early days, neural networks Then neural networks were trained The breakthrough was to work with
were trained at the character level off words chunks of words, called 'tokens'
Predict the next character in this sequence Predict the next word in this sequence A middle ground: manageable vocab, and
Small vocab, but expects too much Much easier to learn from, but leads to useful information for the neural network
from the network enormous vocabs with rare words omitted In addition, elegantly handles word stems
From https://platform.openai.com/tokenizer
GPT's Tokenizer
GPT's Tokenizer
GPT's Tokenizer