0% found this document useful (0 votes)

25 views31 pages

LLM Review

Uploaded by

sonyquekss

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views31 pages

LLM Review

Uploaded by

sonyquekss

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

LARGE LANGUAGE

MODELS

A REVIEW FOR LE AI
HACKATHON

Shay Zweig, 19/4/2023

WHAT ARE WE GOING TO COVER
• What are Large Language models
• LLMs Strengths and limitation
• Prompt engineering
• Context and short term memory
• Retrieval augmented generation
• Augmenting LLMS: Tools, plugins and agents
• Helpful libraries (Langchain)
WHAT ARE Given a sequence of words – predict the next
LANGUAGE
word Clean
0.19

MODELS? 0.4
Amazing
The room in the hotel was ___
Disappointing
0.3
Haunted
0.01
Has been around for a while
WHAT ARE Large language models are huge artificial neural
LARGE networks trained on the word prediction task
LANGUAGE
MODELS • Not magic – Function optimization
(LLM)?
• Only trained to predict the next word *

How big? (GPT3) Why does it work?

• 175B parameter • Quality data

• Trained on all the • Turns out word
internet ~500B tokens completion is a great
task
• Also – we don't know
* also RLHF
WHAT IS IT GOOD FOR?
Completion – write a rap song about Knowledge extraction– given a hotel
luxury escapes description, extract the name, location,
number of rooms...
Q&A– When was luxury escapes founded?
Sentiment analysis – what is the
Summarization – summarize the
sentiment of the following text: "The hotel
following document...
was..."
Classification – given a hotel Paraphrasing – rewrite the following text
description, classify it to the one or more of in 10 different styles
the following classes: [family friendly, city
break...] Coding – write a python function that
takes a document and analyzes...

And much more!!!

LIMITATION – BE CAREFUL....
Hallucinations and alignment:
Knowledge cutoff:

Consistency and predictability - how do I know I get the right result?

Evaluation - how to evaluate the results?
Number of tokens
Cost of inference
...
OPEN AI Completion: text-davinci-003
API openai.api_key = os.getenv("OPENAI_API_KEY")

response = openai.Completion.create(model="text-davinci-003", prompt="Say this is a test)

Chat completion: gpt-4/gpt-3.5-turbo

openai.ChatCompletion.create(
model="gpt-3.5-turbo",
messages=[
{"role": "system", "content": "You are a helpful assistant."},
{"role": "user", "content": "Who won the world series in 2020?"},
{"role": "assistant", "content": "The Los Angeles Dodgers won the World Series in 2020."},
{"role": "user", "content": "Where was it played?"}
]
)

Embeddings:text-embedding-ada-002
def get_embedding(text, model="text-embedding-ada-002"):
text = text.replace("\n", " ")
return openai.Embedding.create(input = [text], model=model)['data'][0]['embedding']
PROMPT ENGINEERING
The prompt: Our main way to control the model behavior
From To

Emergent behaviour
Task specific model training Zero shot and few shot
and fine tuning learning
PROMPT ENGINEERING
The prompt: The "program language" of the model
Instructions: (Answer the user query given the specifies hotel description. If there is no information in
the description to answer the query answer "I don't know".)

Context: (hotel description: Fly high in one of the world's ultimate....)

Examples (Few shots) :

Query: Did the hotel win any prizes
Answer: Yes, it won the Tripadvisor Travellers' Choice for 2020

Query: Does it have Gluten free meal?

Answer: I don't know

User input: "query: Can I charge my electric car in the hotel?

Output indicator: Answer:

PROMPT ENGINEERING
PROMPT ENGINEERING
CHAT PROMPTS – ROLES
Only relevant in the new chat models (GPT-3.5/GPT-4)

system: "You are LeGPT you are an expert in travel you System prompts are used for
can answer questions in reference to provided context. you general instructions to the model –
answer questions in a fun and engaging way." They are more useful in GPT4.0
user: I want to take my family to a vacation in December,
where should I go? User prompts are used for the user
interaction within the conversation
Assistant: December is a great time to take a family
vacation! If you're looking for a fun and festive experience,
Assistant prompts are used for the
I suggest visiting one of the many Christmas markets in
model response within the
Europe. Germany, Austria, and Switzerland are known for
conversation
their beautiful markets...
PROMPT ENGINEERING - TIPS AND TRICKS
• Tell the model it’s role: “As an expert in...”
• Be as explicit and elaborate as possible:
• ...If you don't have the answer, say: I don't know
• ...No more than 60 words but can be less than 60 words.

• Chain of thought reasoning (CoT) - Let's think step by step

• A good reference:
https://lilianweng.github.io/posts/2023-03-15-prompt-engineering/
• Automatic prompt generator (careful – expensive....):
https://github.com/keirp/automatic_prompt_engineer
THE TEMPERATURE PARAMETER
The temperature parameter sets the randomness level of the model.

Temperature = 0 ==> an (almost) deterministic output

Temperature = 1 ==> Increase randomness – different outputs every time, higher "creativity"
CONTEXT - SHORT TERM MEMORY
• LLMs are stateless, all the context needs to be passed in the prompt....
CONTEXT - SHORT TERM MEMORY
• LLMs are stateless, all the context needs to be passed in the prompt....

"total_tokens": 263

"total_tokens": 488

* Tokens are the atoms of the language model – each token can be one or more words or even parts of a word
CONTEXT - SHORT TERM MEMORY
• LLMs are stateless, all the context needs to be passed in the prompt....

"total_tokens": 263

"total_tokens": 488

Problem: Token* explosion Possible solutions:

• Every model has a token limitation (4K for • Context window (include only the last X iterations)
ChatGpt/ 8K for GPT4) • Summarization (Summarize the chat to this point)
• Billing I usually by the token as well as runtime

* Tokens are the atoms of the language model – each token can be one or more words or even parts of a word
CONTEXT - FEW SHOTS
Providing the model with examples of the desired behavior, will greatly improve performance:

Problem: Possible solutions:

• Few shots increases the number of tokens • Example selection
significantly... • Fine tuning (only available for GPT 3)
CONTEXT
Remember – Always count your tokens...
https://github.com/openai/openai-cookbook/blob/main/examples/How_to_count_tokens_with_tiktoken.ipynb
RETRIEVAL AUGMENTED GENERATION
• The best way to handle: knowledge cutoff, hallucinations, referencing and using internal
knowledge
• Use the strengths of the generative model but ground them to external knowledge.
EMBEDDINGS

Top-Rated Five-Star Maldives

Paradise with Two Infinity Pools & 2 -79 1 4 -30 26 8 94 -1 ...
Eight Restaurants
EMBEDDINGS

Ultimate All-Inclusive Pullman Maldives

Villas with Unlimited Drinks & ...

Roundtrip Domestic Malé Flights

Top-Rated Five-Star Maldives Paradise

with Two Infinity Pools & Eight
2 -79 1 4 -30 26 8 94 -1 ...
Restaurants

Vibrant Five-Star Pullman Stay in the

Heart of Melbourne CBD's Shopping & ...
Dining District with Daily Breakfast
RETRIEVAL AUGMENTED GENERATION
• Embeddings + Vector DB + retrieval + contextual generation

Query
LLM
Embedding Generation

Context
Knn
Similarity search Response
store
Embedding
Vector
DB*
Documents Embedding vectors Similar docs
* pinecone/chrome/Faiss
Tutorial link
RETRIEVAL AUGMENTED GENERATION
• Pros
• Reduces hallucinations dramatically!
• LLM augmentation with new and external knowledge (like organizational knowledge)
• Can reduce cost (embeddings are cheap and one time)
• Leverage LLM strength for generation.
• Allows referencing

• Cons
• Complexity –
• preprocessing the data
• Vector DB ops
• Cost - Vector DB costs
AUGMENTING LLMS - TOOLS
• external APIs meant to augment LLMs such as:
• Search
• Calculator
• DB query
• Bash / Python interpreter
• OTHER AI models (HuggingGPT)
• Humans!
• …

• Should have a good description of function and expected input.

AUGMENTING LLMS - PLUGINS
• Exposing external API to OpenAI’s ChatGPT
• You can think of it as an app store for the new chat UI
LLM AGENTS
• LLMs that can plan, use tools and self improve:
• Plan strategy to reach a goal Be careful!
Use of agents can get expensive…
• Perform tasks (use of external tools or internal subtasks)
• Take Observations from tasks
• Self reflect and improve

AutoGPT, Demo, LangChain Agents

LANGCHAIN – A LIBRARY FOR LLM APP DEV
A lot of wrappers and functionality - make life very easy!
• Multiple LLMs
• Prompt templates and selectors
• Chains (composing different LLM components together)
• Memory (multiple types)
• Agents
• Indexes (Vector db wrappers)
• Loaders (easily load text data) Tutorial link

Should be the go-to lib for the AI hackathon

NOT ONLY OPEN AI
• AI21Labs (Task specific APIs)
• CoHere (multilingual embeddings!)
• HuggingFace – many open-source models
• Anthropic (closed beta)
REFERENCES

• Great post about production LLMS: link

• Prompt engineering tricks tl;dr link
• Retrieval augmented generation tutorial: link
• Langchain crash course: link
• Data preprocessing for link

NCOI Annotations Form For Teacher II Applicant
100% (10)
NCOI Annotations Form For Teacher II Applicant
6 pages
Cs224u Intro 2023 Handout
No ratings yet
Cs224u Intro 2023 Handout
98 pages
Lecture 05 - Prompt Engineering
100% (1)
Lecture 05 - Prompt Engineering
31 pages
The Art of Prompt Engineering With Chatgpt A Hands-On Guide PDF Download
No ratings yet
The Art of Prompt Engineering With Chatgpt A Hands-On Guide PDF Download
4 pages
Generative AI 101 Introduction To The Fundamentals Michael-Callaghan
100% (1)
Generative AI 101 Introduction To The Fundamentals Michael-Callaghan
145 pages
Building LLMs - Stanford
No ratings yet
Building LLMs - Stanford
78 pages
OpenAI Generative Pre-Trained Transformer 3 (GPT-3) For Developers
No ratings yet
OpenAI Generative Pre-Trained Transformer 3 (GPT-3) For Developers
24 pages
Applications of Generative AI - Somsuvra Chatterjee
No ratings yet
Applications of Generative AI - Somsuvra Chatterjee
35 pages
Large Language Models (LLM)
100% (1)
Large Language Models (LLM)
139 pages
Prompt Engineering 2
No ratings yet
Prompt Engineering 2
6 pages
2025 04 22 Intro To LLMsv1
No ratings yet
2025 04 22 Intro To LLMsv1
41 pages
Summer Course Material
No ratings yet
Summer Course Material
52 pages
Prompt Egineering Techniques
100% (1)
Prompt Egineering Techniques
31 pages
Advanced Prompt Engineering
No ratings yet
Advanced Prompt Engineering
27 pages
Chapter 1 Thesis Noise
100% (1)
Chapter 1 Thesis Noise
11 pages
Hope To Skills: Lecture# 02 Irfan Malik, Dr. Sheraz Naseer
No ratings yet
Hope To Skills: Lecture# 02 Irfan Malik, Dr. Sheraz Naseer
38 pages
Berryman
No ratings yet
Berryman
24 pages
Langchain Onepager
No ratings yet
Langchain Onepager
1 page
Azure-OpenAI-LVC-2
No ratings yet
Azure-OpenAI-LVC-2
20 pages
AI Tools
No ratings yet
AI Tools
19 pages
Intro To Intelligent Apps Workshop
100% (1)
Intro To Intelligent Apps Workshop
106 pages
Gradivo ChatGPT in Umetna Inteligenca V Praksi
No ratings yet
Gradivo ChatGPT in Umetna Inteligenca V Praksi
38 pages
Prompt Engineer Xar
No ratings yet
Prompt Engineer Xar
26 pages
Chatgpt Slides
100% (1)
Chatgpt Slides
112 pages
Lab Session1 25oct2024
No ratings yet
Lab Session1 25oct2024
29 pages
LLMs Overview and OpenAI API Ver 1-8 - Final NLP Day-UM6P-Nov 2023
No ratings yet
LLMs Overview and OpenAI API Ver 1-8 - Final NLP Day-UM6P-Nov 2023
45 pages
LLM - Introduction 2024
No ratings yet
LLM - Introduction 2024
77 pages
GPT-4.1 Prompting Guide
No ratings yet
GPT-4.1 Prompting Guide
31 pages
LLaMA Ankit - Rawat
No ratings yet
LLaMA Ankit - Rawat
52 pages
Clase1 Generating Your First Text
No ratings yet
Clase1 Generating Your First Text
18 pages
Slides
No ratings yet
Slides
63 pages
Prompt Design and Engineering
No ratings yet
Prompt Design and Engineering
25 pages
P D E: I A M: Rompt Esign and Ngineering Ntroduction and Dvanced Ethods
No ratings yet
P D E: I A M: Rompt Esign and Ngineering Ntroduction and Dvanced Ethods
26 pages
Building LLM Applications For Production
100% (3)
Building LLM Applications For Production
28 pages
02 - Embeddings, Prompting, & Moderation
No ratings yet
02 - Embeddings, Prompting, & Moderation
54 pages
01 - What and Why of Prompts
No ratings yet
01 - What and Why of Prompts
21 pages
Llmdevdaysession 1 Stakeholderreviewdt 202311151700153986852
No ratings yet
Llmdevdaysession 1 Stakeholderreviewdt 202311151700153986852
43 pages
LLM Intro
No ratings yet
LLM Intro
19 pages
Course 1 - Chatgpt Prompt Engineering For Developers Guidelines For Prompting Clear and Specific Instructions
No ratings yet
Course 1 - Chatgpt Prompt Engineering For Developers Guidelines For Prompting Clear and Specific Instructions
7 pages
Recent Advances in Language Modeling (2022-2025)
No ratings yet
Recent Advances in Language Modeling (2022-2025)
5 pages
Unit Ix Cost Effectiveness and Cost Accounting
No ratings yet
Unit Ix Cost Effectiveness and Cost Accounting
38 pages
Large Language Models: Dr. Asgari, Dr. Rohban, Soleymani Fall 2023
No ratings yet
Large Language Models: Dr. Asgari, Dr. Rohban, Soleymani Fall 2023
53 pages
Lab1 Installation
No ratings yet
Lab1 Installation
8 pages
10.48550 Arxiv.2204.02311
No ratings yet
10.48550 Arxiv.2204.02311
87 pages
Everything I'll Forget About Prompting LLMs
No ratings yet
Everything I'll Forget About Prompting LLMs
36 pages
Large Language Models and Where To Use Them - Part 1
No ratings yet
Large Language Models and Where To Use Them - Part 1
12 pages
Language Models Application Development
No ratings yet
Language Models Application Development
5 pages
Truera Slides LLM Workshop Session 1
No ratings yet
Truera Slides LLM Workshop Session 1
41 pages
Master Prompt Engineering Like Pro
No ratings yet
Master Prompt Engineering Like Pro
31 pages
Merged
No ratings yet
Merged
28 pages
Augmenting LLMs Survey
No ratings yet
Augmenting LLMs Survey
33 pages
Prompt Engineering
No ratings yet
Prompt Engineering
24 pages
Huyenchip Com 2023 04 11 LLM Engineering HTML
No ratings yet
Huyenchip Com 2023 04 11 LLM Engineering HTML
13 pages
Lesson 01 Getting Started With GenAI
No ratings yet
Lesson 01 Getting Started With GenAI
48 pages
Global Logic Interview Questions and Answers
No ratings yet
Global Logic Interview Questions and Answers
6 pages
Free Code Camp
No ratings yet
Free Code Camp
5 pages
14 Key Skills To Master Large Language Models 1729745509
No ratings yet
14 Key Skills To Master Large Language Models 1729745509
17 pages
Finkster-Python Cheatsheet
No ratings yet
Finkster-Python Cheatsheet
11 pages
Spectrele Lui Marx - Derrida PDF
100% (1)
Spectrele Lui Marx - Derrida PDF
35 pages
Lang Chain
No ratings yet
Lang Chain
7 pages
Day 1
No ratings yet
Day 1
32 pages
Prompt Engineering 201 Advanced Methods and Toolkits - AI, Software, Tech, and People. Not in That Order. by X
No ratings yet
Prompt Engineering 201 Advanced Methods and Toolkits - AI, Software, Tech, and People. Not in That Order. by X
2 pages
Properties of KMnO4 and K2Cr2O7.PDF-65
No ratings yet
Properties of KMnO4 and K2Cr2O7.PDF-65
7 pages
Guide 4 Prompt Engineering
No ratings yet
Guide 4 Prompt Engineering
1 page
Advertising Response Models
50% (2)
Advertising Response Models
36 pages
Power in The Stones: by Daniel Carlson
No ratings yet
Power in The Stones: by Daniel Carlson
7 pages
WebSphere DataPower SOA Appliances and XSLT Part 1
No ratings yet
WebSphere DataPower SOA Appliances and XSLT Part 1
23 pages
Perencanaan Tebal Perkerasan Landasan Pacu
No ratings yet
Perencanaan Tebal Perkerasan Landasan Pacu
8 pages
Disclosure To Promote The Right To Information: IS 9875 (1990) : Lipstick (PCD 19: Cosmetics)
No ratings yet
Disclosure To Promote The Right To Information: IS 9875 (1990) : Lipstick (PCD 19: Cosmetics)
18 pages
Geol 194 Syllabus Revised
No ratings yet
Geol 194 Syllabus Revised
4 pages
Ben Beya Article Rodopi Caribbean Global Ethics
No ratings yet
Ben Beya Article Rodopi Caribbean Global Ethics
14 pages
How To Choose The Journal That's Right For Your Study - PLOS
No ratings yet
How To Choose The Journal That's Right For Your Study - PLOS
13 pages
Combining Hospitality With Security: Are We Secure Enough?
No ratings yet
Combining Hospitality With Security: Are We Secure Enough?
20 pages
Batl006 PDF
No ratings yet
Batl006 PDF
26 pages
T B T S S B D: HE Ig and HE Mall Ides of IG ATA
No ratings yet
T B T S S B D: HE Ig and HE Mall Ides of IG ATA
87 pages
Mitochondrial Disorders Biochemical and Molecular Analysis Methods in Molecular Biology Vol 837 2012th Edition Lee-Jun C. Wong (Editor) Download PDF
100% (2)
Mitochondrial Disorders Biochemical and Molecular Analysis Methods in Molecular Biology Vol 837 2012th Edition Lee-Jun C. Wong (Editor) Download PDF
84 pages
Mathematics Digital Text Book: Class Ix
No ratings yet
Mathematics Digital Text Book: Class Ix
16 pages
Taj Wellington Mews - Tri Fold Brochure
No ratings yet
Taj Wellington Mews - Tri Fold Brochure
2 pages
CG Project Report
No ratings yet
CG Project Report
25 pages
Science Literacy Strategies
No ratings yet
Science Literacy Strategies
3 pages
BOQs 444
No ratings yet
BOQs 444
33 pages
Newborn Care 2
No ratings yet
Newborn Care 2
2 pages
Dept. of Chemistry, Rajabazar Science College, 92-Acharya Prafulla Chandra Road, University of Calcutta, Kolkata - 700009, West Bengal, India
No ratings yet
Dept. of Chemistry, Rajabazar Science College, 92-Acharya Prafulla Chandra Road, University of Calcutta, Kolkata - 700009, West Bengal, India
6 pages
Lab 0 - (Part 1) Lab Environment Setup
No ratings yet
Lab 0 - (Part 1) Lab Environment Setup
5 pages
Short Story
No ratings yet
Short Story
2 pages
Bizhub C25 Spec
No ratings yet
Bizhub C25 Spec
8 pages
A World of Art Exam Chapter 1
No ratings yet
A World of Art Exam Chapter 1
7 pages
Tan ChineseLiteratureEssays 2016
No ratings yet
Tan ChineseLiteratureEssays 2016
5 pages
Cambridge International AS & A Level: Biology 9700/51
No ratings yet
Cambridge International AS & A Level: Biology 9700/51
16 pages
Working with Linux – Quick Hacks for the Command Line
From Everand
Working with Linux – Quick Hacks for the Command Line
Petru Ișfan
5/5 (1)