0% found this document useful (0 votes)

4 views21 pages

Week 1 Day 4

This document outlines the curriculum for a lecture on Large Language Models (LLMs) and their engineering, highlighting key topics such as the rise of Transformers, custom GPTs, and the mechanics of tokens and context windows. It includes a playful competition between AI models to elect a leader, showcasing their interactions. The document also provides insights into API costs and the progress made in understanding and utilizing frontier models by the end of the lecture.

Uploaded by

Ishank Reddy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views21 pages

Week 1 Day 4

Uploaded by

Ishank Reddy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

LLM Engineering

MASTER AI & LARGE LANGUAGE MODELS

DAY 4

Big Day Ahead

What you can do ALREADY

Write code to call OpenAI's frontier models & summarize

Explain the strengths and limitations of Frontier LLMs

Compare and contrast the leading 6 models

What you'll be able to do BY END OF THIS LECTURE

Describe the dizzying rise of the Transformer

Explain Custom GPTs, Copilots and Agents

Understand tokens, context windows, parameters, API cost

If you're already familiar with this - there will still be interesting insights!
UNSCIENTIFIC SHOWDOWN

The leadership battle reveal

The contestants

"Alex": GPT-4o

"Blake": Claude 3 Opus

"Charlie": Gemini 1.5 Pro

The prompt

“I'd like to play a game. You are in a chat with 2 other AI chatbots. Your name is
Alex; their names are Blake and Charlie. Together, you will elect one of you to be
the leader. You each get to make a short pitch (no more than 200 words) for why
you should be the leader. Please make your pitch now.”

Each receives the pitches from the others, and votes for the leader

And now to show their votes...

Alex votes for Blake...

Alex Blake Charlie

GPT-4o Claude 3 Opus Gemini 1.5 Pro
Blake votes for Charlie...

Alex Blake Charlie

GPT-4o Claude 3 Opus Gemini 1.5 Pro
Charlie votes for Blake!

Alex Blake Charlie

GPT-4o Claude 3 Opus Gemini 1.5 Pro
Claude (aka Blake) for the win!

Alex Blake Charlie

GPT-4o Claude 3 Opus Gemini 1.5 Pro
The extraordinary rise of the Transformer

2017
Google scientists
publish seminal paper 2019 2022 2024
“Attention is All You
GPT-2 RLHF and ChatGPT GPT-4o
need” proposing a new
model architecture ...
called the Transformer

2018 2020 2023

GPT-1 GPT-3 GPT-4
The World's Reactions

First, SHOCK Then, healthy skepticism Then, emergent intelligence

ChatGPT surprises even practitioners Predictive text on steroids; Capabilities that come as a result of scale
the "stochastic parrot"
Along the way

Prompt Engineers Custom GPTs Copilots Agentization

The rise (and fall?) and the GPT Store like MS Copilot and Github Copilot like Github Copilot Workspace
Number of parameters in models (log scale)

1B 100B 10T

GPT-1
117M

10B 1T
Number of parameters in models (log scale)

1B 100B 10T

GPT-1 GPT-2 GPT-3 GPT-4

117M 1.5B 175B 1.76T
Latest Frontier Models
undisclosed

10B 1T
Number of parameters in models (log scale)

1B 100B 10T

Gemma Llama 3.1 Llama 3.1 Mixtral Llama 3.1

2B 8B 70B 140B 405B

GPT-1 GPT-2 GPT-3 GPT-4

117M 1.5B 175B 1.76T
Latest Frontier Models
undisclosed

10B 1T
Introducing Tokens
Leo: if you have time, please pick appropriate (and amusing if possible) icons for each of these

In the early days, neural networks Then neural networks were trained The breakthrough was to work with
were trained at the character level off words chunks of words, called 'tokens'
Predict the next character in this sequence Predict the next word in this sequence A middle ground: manageable vocab, and
Small vocab, but expects too much Much easier to learn from, but leads to useful information for the neural network
from the network enormous vocabs with rare words omitted In addition, elegantly handles word stems
From https://platform.openai.com/tokenizer

GPT's Tokenizer

For common words, 1 word maps to 1

token

Observe how the break between

words is part of the token
From https://platform.openai.com/tokenizer

GPT's Tokenizer

Less common words (and invented

words!) get broken into multiple tokens

In many cases, the meaning is still

captured by the tokens: hand_crafted,
master_ers

Sometimes, like qu_ip, the word is

broken into fragments
From https://platform.openai.com/tokenizer

GPT's Tokenizer

See how numbers are treated - this may

explain why earlier GPTs struggled with
math with more than 3 digits

Rule-of-thumb: in typical English

writing:
• 1 token is ~4 characters
• 1 token is ~0.75 words
• So 1,000 tokens is ~750 words

The collected works of Shakespeare are

~900,000 words or 1.2M tokens

Obviously the token count is higher for

math, scientific terms and code
Context Window

Max number of tokens that the model can

consider when generating the next token

Includes the original input prompt,

subsequent conversation, the latest input
prompt and almost all the output prompt

It governs how well the model can

remember references, content and
context

Particularly important for multi-shot

prompting where the prompt includes
examples, or for long conversations

Or questions on the complete works of

Shakespeare!
API costs

Chat interfaces typically have Pro

plan with a monthly subscription.
Rate limited, but no per-usage
charge.

APIs typically have no subscription,

but charge per API call

The cost is based on the number of

input tokens and the number of
output tokens
Context Windows and API Costs
https://www.vellum.ai/llm-leaderboard
PROGRESS REPORT

Congratulations! 10% there

What you can do ALREADY

Write code to call OpenAI's frontier models & summarize

Contrast the leading 6 Frontier LLMs

Discuss transformers, tokens, context windows, API costs and

more!

What you'll be able to do BY END OF THE NEXT LECTURE

Confidently code with the OpenAI API

Use one-shot prompting, streaming, markdown & json results

Implement a business solution - in a matter of minutes

Generative AI For Dummies
67% (3)
Generative AI For Dummies
6 pages
AI API Course
No ratings yet
AI API Course
85 pages
AI Models
No ratings yet
AI Models
8 pages
Dokumen - Pub Quick Start Guide To Large Language Models Strategies and Best Practices For Using Chatgpt and Other Llms 9780138199425
No ratings yet
Dokumen - Pub Quick Start Guide To Large Language Models Strategies and Best Practices For Using Chatgpt and Other Llms 9780138199425
325 pages
LLM and Gen AI
No ratings yet
LLM and Gen AI
4 pages
Generative AI 101 Introduction To The Fundamentals Michael-Callaghan
100% (1)
Generative AI 101 Introduction To The Fundamentals Michael-Callaghan
145 pages
Sinan Ozdemir - Quick Start Guide To Large Language Models, Second Edition-Addison-Wesley (2024)
No ratings yet
Sinan Ozdemir - Quick Start Guide To Large Language Models, Second Edition-Addison-Wesley (2024)
279 pages
ChatBot PDF
No ratings yet
ChatBot PDF
109 pages
The Best LLMs Cheatsheet - Part 1
No ratings yet
The Best LLMs Cheatsheet - Part 1
16 pages
Mod 4
No ratings yet
Mod 4
69 pages
Background of The Study
No ratings yet
Background of The Study
2 pages
Building LLMs - Stanford
No ratings yet
Building LLMs - Stanford
78 pages
Sinan Ozdemir Quick Start Guide To Large Language Models Strategies
No ratings yet
Sinan Ozdemir Quick Start Guide To Large Language Models Strategies
285 pages
PythonAI LLMs ForSharing
No ratings yet
PythonAI LLMs ForSharing
47 pages
Experiential Learning in Adult Education: A Comparative Framework by Tara J. Fenwick, Asst. Professor
No ratings yet
Experiential Learning in Adult Education: A Comparative Framework by Tara J. Fenwick, Asst. Professor
20 pages
Sinan Ozdemir - Quick Start Guide To Large Language Models - Strategies and Best Practices For Using ChatGPT and Other LLMs-Addison-Wesley Professional (2023)
100% (5)
Sinan Ozdemir - Quick Start Guide To Large Language Models - Strategies and Best Practices For Using ChatGPT and Other LLMs-Addison-Wesley Professional (2023)
326 pages
(EARLY RELEASE) Quick Start Guide To Large Language Models Strategies and Best Practices For Using ChatGPT and Other LLMs (Sinan Ozdemir) (Z-Library)
100% (14)
(EARLY RELEASE) Quick Start Guide To Large Language Models Strategies and Best Practices For Using ChatGPT and Other LLMs (Sinan Ozdemir) (Z-Library)
132 pages
Quick Start Guide To LLMs by Sinan Ozdemir 1703540700
100% (3)
Quick Start Guide To LLMs by Sinan Ozdemir 1703540700
275 pages
Transformer Basics
No ratings yet
Transformer Basics
17 pages
AI Tools
No ratings yet
AI Tools
19 pages
Module1 L5 GPT Variants
No ratings yet
Module1 L5 GPT Variants
7 pages
Hi Everyone So by Now You Have Probtranscript
No ratings yet
Hi Everyone So by Now You Have Probtranscript
31 pages
TPS 2601 Tutorial Letter 103 Senior Phase 2024 - Updated
No ratings yet
TPS 2601 Tutorial Letter 103 Senior Phase 2024 - Updated
53 pages
Week 1 Day 3
No ratings yet
Week 1 Day 3
13 pages
GPT-4 Architecture, Infrastructure, Training Dataset, Costs, Vision, MoE
No ratings yet
GPT-4 Architecture, Infrastructure, Training Dataset, Costs, Vision, MoE
4 pages
Genaitoolboxltslides 1736779963542
No ratings yet
Genaitoolboxltslides 1736779963542
38 pages
Path To The LLM & Generative AI
No ratings yet
Path To The LLM & Generative AI
12 pages
2017 Filg10q1
No ratings yet
2017 Filg10q1
105 pages
PP2 Language Activities
100% (1)
PP2 Language Activities
113 pages
Lecture 03 - Introduction To LLMs
No ratings yet
Lecture 03 - Introduction To LLMs
32 pages
Lesson 1 Intro
No ratings yet
Lesson 1 Intro
51 pages
Slides
No ratings yet
Slides
137 pages
D 02 Large Language Models
100% (1)
D 02 Large Language Models
58 pages
Deploying GPT and LLM S 1739806000777
No ratings yet
Deploying GPT and LLM S 1739806000777
186 pages
Day 1
No ratings yet
Day 1
32 pages
Told Test Review
No ratings yet
Told Test Review
9 pages
Robotics - PPT For Ros Etc Students Good
No ratings yet
Robotics - PPT For Ros Etc Students Good
15 pages
GPT4架构揭秘
No ratings yet
GPT4架构揭秘
12 pages
An AIRevolutionfroman Open AIFull Paper 1
No ratings yet
An AIRevolutionfroman Open AIFull Paper 1
14 pages
Understanding GPT The AI Revolution in Language Processing
No ratings yet
Understanding GPT The AI Revolution in Language Processing
10 pages
Gradivo ChatGPT in Umetna Inteligenca V Praksi
No ratings yet
Gradivo ChatGPT in Umetna Inteligenca V Praksi
38 pages
GEN-AI-unit 3
No ratings yet
GEN-AI-unit 3
30 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
2 pages
IELTS - TOEFL 12 Week Study Plan
No ratings yet
IELTS - TOEFL 12 Week Study Plan
5 pages
Brexhq - Prompt-Engineering - Tips and Tricks For Working With Large Language Models Like OpenAI's GPT-4
No ratings yet
Brexhq - Prompt-Engineering - Tips and Tricks For Working With Large Language Models Like OpenAI's GPT-4
12 pages
LLM Overview
No ratings yet
LLM Overview
3 pages
Enacting Teacher's Leadership
No ratings yet
Enacting Teacher's Leadership
20 pages
Fullstack Projects With AI
No ratings yet
Fullstack Projects With AI
9 pages
LLM Prompting & In-Context Learning
No ratings yet
LLM Prompting & In-Context Learning
18 pages
LLM - A Introduction To Generative AI
100% (1)
LLM - A Introduction To Generative AI
31 pages
Presentation 11
No ratings yet
Presentation 11
20 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
15 pages
Links
No ratings yet
Links
3 pages
Llmdevdaysession 1 Stakeholderreviewdt 202311151700153986852
No ratings yet
Llmdevdaysession 1 Stakeholderreviewdt 202311151700153986852
43 pages
Introduction To Large Language Models
No ratings yet
Introduction To Large Language Models
3 pages
Large Language Model Algorithms in Plain English
No ratings yet
Large Language Model Algorithms in Plain English
8 pages
BTech Advanced AI Unit03
No ratings yet
BTech Advanced AI Unit03
109 pages
LLMs Overview and OpenAI API Ver 1-8 - Final NLP Day-UM6P-Nov 2023
No ratings yet
LLMs Overview and OpenAI API Ver 1-8 - Final NLP Day-UM6P-Nov 2023
45 pages
Lab1 Installation
No ratings yet
Lab1 Installation
8 pages
Slides
No ratings yet
Slides
63 pages
DAB311 DL Week 11 RNN
No ratings yet
DAB311 DL Week 11 RNN
25 pages
LLM Review
No ratings yet
LLM Review
31 pages
Expert System MCQs
No ratings yet
Expert System MCQs
5 pages
Large Language Models: Dr. Asgari, Dr. Rohban, Soleymani Fall 2023
No ratings yet
Large Language Models: Dr. Asgari, Dr. Rohban, Soleymani Fall 2023
53 pages
Chapter 2. Transformers: A Note For Early Release Readers
No ratings yet
Chapter 2. Transformers: A Note For Early Release Readers
85 pages
Innovations in LLMs Presentation Expanded MSOffice
No ratings yet
Innovations in LLMs Presentation Expanded MSOffice
24 pages
LLM Review
No ratings yet
LLM Review
16 pages
How LLM's Work, How GPT Was Trained, and How GPT Generates Outputs
No ratings yet
How LLM's Work, How GPT Was Trained, and How GPT Generates Outputs
12 pages
Know Thy Frenemy
No ratings yet
Know Thy Frenemy
40 pages
Thoughts On NLP Research in The (Post-) LLM Era: Yijia Shao Yuanpei College 2023/04/28
No ratings yet
Thoughts On NLP Research in The (Post-) LLM Era: Yijia Shao Yuanpei College 2023/04/28
51 pages
Speaking Anxiety Thesis
100% (3)
Speaking Anxiety Thesis
7 pages
l4 CT Plan 2
No ratings yet
l4 CT Plan 2
8 pages
To Create A LLM
No ratings yet
To Create A LLM
53 pages
Standard 4 - Artifacts Rationales 1
No ratings yet
Standard 4 - Artifacts Rationales 1
3 pages
Millicent Atkins School of Education: Common Lesson Plan Template
No ratings yet
Millicent Atkins School of Education: Common Lesson Plan Template
6 pages
Exploring The Evolution of Large Language Models: Architectures, Applications, and Future Directions
No ratings yet
Exploring The Evolution of Large Language Models: Architectures, Applications, and Future Directions
11 pages
Indigenous Perspectives ACARA: Elise Connolly S00152713 Edab
No ratings yet
Indigenous Perspectives ACARA: Elise Connolly S00152713 Edab
5 pages
Alex Hicks
No ratings yet
Alex Hicks
1 page
Google Forms Syllabus
No ratings yet
Google Forms Syllabus
3 pages
VERIM Manual v10
No ratings yet
VERIM Manual v10
51 pages
YEAR 5 VOCABULARY BOOKLET (Term 2 Semester 2 - 2024-25)
No ratings yet
YEAR 5 VOCABULARY BOOKLET (Term 2 Semester 2 - 2024-25)
5 pages
Btec Standards Verification Training: Engineering: Levels 1 To 3 (Module 3)
No ratings yet
Btec Standards Verification Training: Engineering: Levels 1 To 3 (Module 3)
24 pages
The Time To Take Action Was Now
No ratings yet
The Time To Take Action Was Now
2 pages
WITS G7 Teacher Booklet - MultReas
No ratings yet
WITS G7 Teacher Booklet - MultReas
40 pages
English Speaking Class: Session 1 - Course Introduction
No ratings yet
English Speaking Class: Session 1 - Course Introduction
13 pages
5-smp Look Fors
No ratings yet
5-smp Look Fors
2 pages
Management Sciences, Karachi Campus Course Outline General Information
No ratings yet
Management Sciences, Karachi Campus Course Outline General Information
8 pages
Gifted Talented Students Policy
No ratings yet
Gifted Talented Students Policy
27 pages
(LS 1 English, From The Division of Zamboanga Del Sur
No ratings yet
(LS 1 English, From The Division of Zamboanga Del Sur
17 pages
2016-Education & The Social Brain - Linking Language, Thinking, Teaching & Learning
No ratings yet
2016-Education & The Social Brain - Linking Language, Thinking, Teaching & Learning
22 pages
Sample IEP - Arkansas
No ratings yet
Sample IEP - Arkansas
12 pages
B05 Guía de Evaluación (EN) 2022 (1) Basico
No ratings yet
B05 Guía de Evaluación (EN) 2022 (1) Basico
4 pages
UNIT 5: Can Words Paint A Thousand Pictures?: Inquiry & Action
No ratings yet
UNIT 5: Can Words Paint A Thousand Pictures?: Inquiry & Action
7 pages
Folleto Ingles
No ratings yet
Folleto Ingles
2 pages
Rust Mini Reference: A Hitchhiker's Guide to the Modern Programming Languages, #5
From Everand
Rust Mini Reference: A Hitchhiker's Guide to the Modern Programming Languages, #5
Harry Yoon
No ratings yet

Week 1 Day 4

Uploaded by

Week 1 Day 4

Uploaded by

LLM Engineering

MASTER AI & LARGE LANGUAGE MODELS

Big Day Ahead

What you can do ALREADY

Write code to call OpenAI's frontier models & summarize

Explain the strengths and limitations of Frontier LLMs

Compare and contrast the leading 6 models

What you'll be able to do BY END OF THIS LECTURE

Describe the dizzying rise of the Transformer

Explain Custom GPTs, Copilots and Agents

Understand tokens, context windows, parameters, API cost

The leadership battle reveal

"Blake": Claude 3 Opus

"Charlie": Gemini 1.5 Pro

And now to show their votes...

Alex Blake Charlie

Alex Blake Charlie

Alex Blake Charlie

Alex Blake Charlie

2018 2020 2023

First, SHOCK Then, healthy skepticism Then, emergent intelligence

Prompt Engineers Custom GPTs Copilots Agentization

GPT-1 GPT-2 GPT-3 GPT-4

Gemma Llama 3.1 Llama 3.1 Mixtral Llama 3.1

GPT-1 GPT-2 GPT-3 GPT-4

For common words, 1 word maps to 1

Observe how the break between

Less common words (and invented

In many cases, the meaning is still

Sometimes, like qu_ip, the word is

See how numbers are treated - this may

Rule-of-thumb: in typical English

The collected works of Shakespeare are

Obviously the token count is higher for

Max number of tokens that the model can

Includes the original input prompt,

It governs how well the model can

Particularly important for multi-shot

Or questions on the complete works of

Chat interfaces typically have Pro

APIs typically have no subscription,

The cost is based on the number of

Congratulations! 10% there

What you can do ALREADY

Write code to call OpenAI's frontier models & summarize

Contrast the leading 6 Frontier LLMs

Discuss transformers, tokens, context windows, API costs and

What you'll be able to do BY END OF THE NEXT LECTURE

Confidently code with the OpenAI API

Use one-shot prompting, streaming, markdown & json results

Implement a business solution - in a matter of minutes

You might also like