0% found this document useful (0 votes)

560 views24 pages

OpenAI Generative Pre-Trained Transformer 3 (GPT-3) For Developers

Uploaded by

triveneebadgujar28

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

560 views24 pages

OpenAI Generative Pre-Trained Transformer 3 (GPT-3) For Developers

Uploaded by

triveneebadgujar28

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 24

Artificial intelligence is making a huge progress in mimicking the cognitive capabilities of humans.

One such breakthrough in this area is the introduction of Generative Pre-trained Transformer 3 (GPT-
3) language model which can perform any natural language task. GPT-3 can easily solve wide range of
problems including text summarization, classification, question answer, creating articles, create
functioning code and many more.

This course will help learners to gain understanding of GPT-3 language model and its possible
applications. It will also enable learners to use GPT-3 for their specific language tasks. Learners will
be introduced to the details of the transformer model which is the core of GPT-3.

After completing the course, learner will be able to:

 Understand GPT-3 and its possible applications

 Familiarize with GPT-3 playground and its configurations

 Learn to use GPT-3 for solving specific natural language task using playground and python
programming language

 Understand building blocks of a language model

 Know the working of transformer model which is the core of GPT-3

In Artificial intelligence/Machine Learning, a model is an algorithm trained with huge corpus of data
and human input to perform tasks which otherwise would be performed by humans when provided
with the same information. Models augment human intelligence by performing tasks at speed , scale
and in an efficient manner.

Generative Pre-trained Transformer 3 (GPT-3) is a language model that is trained to produce human
like text. At its core, GPT-3 is a neural network based deep learning model named "transformers".
This model can produce text sequence given the input text sequence and are primarily designed to
perform language tasks like text summarization, text classification, language translation, Question &
Answer systems and so on.

GPT-3 is developed by OpenAI, a San Francisco-based artificial intelligence research laboratory and
was introduced in May 2020. It is the third-generation language model in the GPT-n series succeeding
GPT-2.

In addition to GPT-2, GPT-3 joins the list of other pre-trained language models like Google’s BERT,
Facebook’s RoBERTa and Microsoft’s Turing-NLG among others. These pre-trained language models
are trained on massive generic data sets and can solve v

GPT-3 has become popular and has attracted attention due to the following factors:

Size:

1. GPT-3 is the largest language model created till date. The largest version of GPT-3 is pre-
trained with diverse and enormous text data sets consisting of billions of words and 175
billion parameters .
2. The data set used for training includes the text from CommonCrawl, which is a publicly
available dataset created by crawling the internet along with other texts selected by OpenAI,
including the text of Wikipedia. This vast size of the model makes GPT-3 perform significantly
better than other language models.

Simplicity & Performance:

1. Most of the language models like GPT-3 are trained with diverse and large corpus of
unlabeled text data . This pre-trained model is further fine-tuned to perform specific
language tasks like summarization, classification etc.

2. In contrast, GPT-3 goes one step further and does not require fine tuning the pre-trained
model. Users can interact with the GPT-3 by giving any text prompt like a phrase or a
sentence. GPT-3 returns a text completion in natural language. Users can also program GPT-3
by showing few examples to perform more complex language tasks.

3. GPT-3 can perform range of tasks like writing articles, answering questions, classifying text,
summarizing text, creating SQL queries for a given natural language description, generating
functioning code , code translation and so on.

Currently, GPT-3 is not open-source and OpenAI is making the model available through a commercial
API which you can find here: https://beta.openai.com/ . Users can interact with the API through
HTTP requests from different programming languages. OpenAI officially supports Python bindings.
Users also have choice to use other languages like C#/.Net, Crystal, Dart, Go, Java, JavaScript/Node,
Ruby and Unity through community libraries built and maintained by the broader developer
community.

To work with GPT-3 you will need the following:

 OpenAI Account

 OpenAI GPT-3 license.

 Python 3.6 or above ( optional and it is needed when you want to create standalone GPT-3
applications with Python)
Playground is a web-based interface that allows users to experiment and iterate range of use cases
or problems using GPT-3 language model. Before exploring the playground, one needs to understand
the basic constructs of OpenAI API.

There are three concepts that are core to the API: prompt, completion, and tokens.

 The “prompt” is text input to the API

 The “completion” is the text that the API generates based on the prompt.

 The “Tokens” can be thought of as pieces of words.

For example:

Prompt: “i saw a cat drinking milk”

Completion: i saw a cat drinking milk in the street. He was a white cat with black spots, a very large
cat. He was not afraid of people, and he sat there, licking his whiskers, and cleaning his paws. "That's
a strange cat," said the old man. "He acts as if he owned the street."

To use the API, user needs to give text prompt (the text-based input or "instructions" you provide to
the API) and it will return a text completion, attempting to match the context or pattern you gave it.
A well-written prompt provides enough information for the API to know what you are asking for and
how it should respond.

Note: One limitation to keep in mind is that combined, the text prompt and generated completion
must be below 2048 tokens (roughly ~1500 words). Read more about the key concepts and prompt
design in “Documentation” section.
You can explore additional examples across catergories like Translation, Conversation, Generation,
Classification, Answering etc.

The examples can be accessed in https://beta.openai.com/examples.

API provided by OpenAI is a text in-text out interface, which can be used in various language tasks. To
use the API, one should enter the text prompt. Text prompt is a text-based input or set of instructions
or set of examples one provides to the API. Based on the text prompt, the GPT-3 model generates the
completion text in relation to the context and set of examples given in the text prompt.

The GPT-3 model is trained on variety of data available in internet and Wikipedia. Training was
stopped in October 2019. Therefore, GPT-3 model may not be aware of the events after October
2019. OpenAI is planning to add on continuous training in future.

There are three main concepts which forms the core of the API are:

Prompt: Prompt is the text input given to the API.

Completion: Completion is the text generated by the API based on the text input given in the
Prompt.

It is important to know that, completion generated by API is stochastic in nature. Which means, even
though the text prompt is same, the API might generate slightly different completions each time you
call it.

Tokens: The API turns the input text prompt into tokens (pieces of words) prior to processing. For
example: the word “Descartes” can be broken into three tokens, “Desc”, “art”, “es”. Simple words like
“pear” will be not be broken further. It is considered as a single token.

Note: One limitation to keep in mind is, resulting tokens from the combination of the text input and
completion from the GPT-3 should be less than 2048 (~1500 words).

Playground: Playground is simple text box like interface, where one can write the input text prompt
and click on the submit button to generate the text completion.
Prompt design:

There are three basic guidelines to make best use of prompts to get things done, they are:

1. Show and tell: If you want the GPT-3 to do certain task like sentiment classification. The prompt
can have a clear instruction saying sentiment classification followed by some examples.

2. Quality of data: This step is to make sure the data provided in the examples are clear and free
from errors.

3. Check settings: If the answer expected from GPT-3 model is deterministic (only one right answer
types), then it is recommended to use lower values for temperature and top_p in settings. If the
expected answer is not obvious, it is recommended to use higher values for temperature and top_p
in settings.

Engines:

OpenAI offers access to four different engines: Ada, Babbage, Curie and Davinci. Davinci engine is the
most capable engine which can provide nearly accurate predictions for many language tasks. It is
recommended to use Davinci engine during experimentation and development. Other engines have
advantages in terms of the low latency compared to Davinci and can chose if they perform well with
low latency for a particular application.
Step 2: Word Embedding

 Token IDs are just numbers, they do not represent any meaning. These token IDs (words) are
transformed into vectors of fixed length called word embeddings.

 Word embeddings are great, but they do not have the contextual information. For example,

Sentence 1: There is a tree next to river bank.

Sentence 2: I deposited some money to my bank account.

 The word bank is used in different contexts in the above sentences. But word embedding will
not capture the contextual information. The word bank will have the same word embedding
in both the sentences.

 Therefore, to generate context aware vectors, we use a mechanism called attention.

In GPT-3 architecture, before the attention block, there is a step called positional encoding.

Step 3: Positional encoding

Positional encoding is just a vector addition process, where positional vectors are added into the
word embeddings and the resulting vectors will have positional information encoded in them.

Step 4: Attention

Positional encoded vectors are mapped into different dimension by matrix multiplication (Q, K, V
matrices), the model is free to learn these Q, K, V matrices during training. The scaled dot product of
Q and K results in scores. New context aware vector is calculated as a weighted sum of values (V),
where weights are the scores generated by the dot product of Q and K. Now each word embedding is
represented by a context aware vector.

In GPT-3, instead of one attention block, there are multiple attention blocks with different Q, K, V
matrices helping the model to learn more complex relationship between words. Since it is a multi-
head attention, each attention head will produce a set of context aware vectors corresponding to
each word embedding. Context aware vectors from each attention head are concatenated together
and multiplied by a weight matrix to generate one context aware vector per word embedding.

Step 5: Add and normalize, FFNN

The context aware vectors are passed into add and normalize block, where context aware vectors are
added with word embeddings with positional information and normalized so that there is no
information decay. The output of this block is fed into a feed forward neural network (FFNN) followed
by another add and normalize layer.

The self-attention block, add and normalize block, FFNN followed by another add and normalize
block forms one transformer block. There can be many such transformer blocks.

The output after many such transformer blocks are a set of context aware vectors (also called hidden
states). To generate the next word, last word hidden state is used.

How to convert this context aware vector into a next word prediction?

The last word hidden state (context aware vector) is mapped into a vector of size of the vocabulary
by passing it through a FFNN. The resulting vector is passed through a SoftMax function resulting in
another vector of the same size whose values correspond to probabilities for each word in the
vocabulary.

The word (token) which has highest probability can be chosen as the prediction for next word in the
sentence. The generated word can be fed as input again to the model to generate next few words in
the sentence.
As we can see lower values for temperature produces deterministic kind of completions. With
increase in temperature, the GPT-3 model takes some risks and produces interesting completions.

top_p

This nucleus sampling parameter is an alternative to temperature parameter. If this parameter is set
to 0.1, model considers tokens with probability mass of 10% in results.

It is recommended to change either temperature or top_p but not both.

Number of completions to be generated for each prompt.

Note: This parameter should be set appropriately since higher values of n may result in many
completions which quickly consumes the 2048 tokens quota available.

Stream

If set True, tokens are sent as response when they are available. Once the response gets completed,
stream gets terminated with data: [DONE] message.

Echo

If set TRUE, this will echo back the input prompt text along with the completion generated.

stop

Up to 4 sequences where the API will stop generating further tokens. The returned text will not
contain the stop sequence.

Presence penalty

Number between 0 and 1 that penalizes new tokens based on whether they appear in the text so far.
Increases the model's likelihood to talk about new topics.

Frequency penalty

Number between 0 and 1 that penalizes new tokens based on their existing frequency in the text so
far. Decreases the model's likelihood to repeat the same line verbatim.

best_of

GPT-3 generates the number of completions specified in the best_of parameter and returns the best
completion i.e. the completion with lowest log probability per token.

Results generated cannot be streamed.

Note: This parameter should be set appropriately since higher values of n may result in many
completions which quickly consumes the 2048 tokens quota available. best_of should be greater
than n.

logprobs

Generates the log probabilities of the most likely tokens. For example, If logprobs parameter is set to
6, the response from GPT-3 API is the list of six most likely tokens.

Lecture 05 - Prompt Engineering
100% (1)
Lecture 05 - Prompt Engineering
31 pages
Introduction To Building AI Applications With Foundation Models - AI Engineering
100% (1)
Introduction To Building AI Applications With Foundation Models - AI Engineering
32 pages
AI Tools
No ratings yet
AI Tools
19 pages
Module 5
No ratings yet
Module 5
76 pages
Generative AI 101 Introduction To The Fundamentals Michael-Callaghan
100% (1)
Generative AI 101 Introduction To The Fundamentals Michael-Callaghan
145 pages
Liturgy of St. John (Eliz. English) - Staff Notation
100% (2)
Liturgy of St. John (Eliz. English) - Staff Notation
99 pages
Lesson Plan in Water A.N.M 1ST Year (Environmaental Sanitation
100% (3)
Lesson Plan in Water A.N.M 1ST Year (Environmaental Sanitation
9 pages
Upper-Voice Structures and Compositional Process in The Ars Nova Motet
100% (2)
Upper-Voice Structures and Compositional Process in The Ars Nova Motet
175 pages
Intro To Intelligent Apps Workshop
100% (1)
Intro To Intelligent Apps Workshop
106 pages
2005 14165v3 PDF
No ratings yet
2005 14165v3 PDF
74 pages
GPT 3
No ratings yet
GPT 3
15 pages
15 Ai Tools Changing The World Script
No ratings yet
15 Ai Tools Changing The World Script
9 pages
Name: - Date: - : English Year 2 Unit 7: Get Dressed Match The Picture Correctly
No ratings yet
Name: - Date: - : English Year 2 Unit 7: Get Dressed Match The Picture Correctly
12 pages
Release Strategies and The Social Impacts of Language Models
No ratings yet
Release Strategies and The Social Impacts of Language Models
71 pages
The Newbie’s Guidebook to ChatGPT: A Beginner's Tutorial: The Newbie’s Guidebook
From Everand
The Newbie’s Guidebook to ChatGPT: A Beginner's Tutorial: The Newbie’s Guidebook
Timothy King
No ratings yet
All About Open AI's GPT-3
No ratings yet
All About Open AI's GPT-3
11 pages
Innovative Content Generation Leveraging GPT-3 Lan
No ratings yet
Innovative Content Generation Leveraging GPT-3 Lan
10 pages
1 s2.0 S2667325821002193 Main
No ratings yet
1 s2.0 S2667325821002193 Main
3 pages
GEN-AI-unit 3
No ratings yet
GEN-AI-unit 3
30 pages
Text Generation
No ratings yet
Text Generation
4 pages
Move To Global War Japan
100% (2)
Move To Global War Japan
35 pages
Namma Kalvi 10th Maths Question Bank em 216419
No ratings yet
Namma Kalvi 10th Maths Question Bank em 216419
36 pages
ChatGPT: The revolution of communication
From Everand
ChatGPT: The revolution of communication
Andrew Ingram
No ratings yet
cSKBD8BVQNOoha7Kw6 Lyq - Openai Workingcourse Introduction To GPT 3 Introduction To GPT 3
No ratings yet
cSKBD8BVQNOoha7Kw6 Lyq - Openai Workingcourse Introduction To GPT 3 Introduction To GPT 3
19 pages
An Ebook a Day
From Everand
An Ebook a Day
arik arik
No ratings yet
Generative Pre-Trained Transformer 3.5 (GPT-3.5) : Models
No ratings yet
Generative Pre-Trained Transformer 3.5 (GPT-3.5) : Models
2 pages
Building Your Own GPT: A Step-by-Step Guide to Creating Custom AI Models
From Everand
Building Your Own GPT: A Step-by-Step Guide to Creating Custom AI Models
Peter Lengyel
No ratings yet
mZN73TglTKKKw9TU2I5ABQ - Openai Workingcourse Introduction To GPT 3 Development Process From Examples To Deployment
No ratings yet
mZN73TglTKKKw9TU2I5ABQ - Openai Workingcourse Introduction To GPT 3 Development Process From Examples To Deployment
13 pages
An AIRevolutionfroman Open AIFull Paper 1
No ratings yet
An AIRevolutionfroman Open AIFull Paper 1
14 pages
The 1 Page Python Book
From Everand
The 1 Page Python Book
Barani Kumar
2/5 (1)
GPT-3 - Wikipedia
No ratings yet
GPT-3 - Wikipedia
22 pages
LLM Review
No ratings yet
LLM Review
31 pages
Chatgpt Book For Beginners : A Step By Step Guide To Use Chatgpt Effectively, Earn Money And Increase Your Productivity With Over 50+ Tips
From Everand
Chatgpt Book For Beginners : A Step By Step Guide To Use Chatgpt Effectively, Earn Money And Increase Your Productivity With Over 50+ Tips
Daniel Brown
No ratings yet
Mastering Sublime Text
From Everand
Mastering Sublime Text
Dan Peleg
No ratings yet
What Is GPT-3 - Everything You Need To Know - TechTarget
No ratings yet
What Is GPT-3 - Everything You Need To Know - TechTarget
11 pages
GPT Models
No ratings yet
GPT Models
10 pages
Your First Python Program
From Everand
Your First Python Program
Alexander Paz
No ratings yet
Python for Beginners: Learn It as Easy as Pie
From Everand
Python for Beginners: Learn It as Easy as Pie
Yatin Bayya
No ratings yet
The Generative Pre-Trained Transformer: GPT-3
No ratings yet
The Generative Pre-Trained Transformer: GPT-3
1 page
Telugu Learn
50% (2)
Telugu Learn
2 pages
Generative AI Unit 1 2 3 Questions
No ratings yet
Generative AI Unit 1 2 3 Questions
12 pages
GPT 3
No ratings yet
GPT 3
14 pages
C&ss - Module1-1
No ratings yet
C&ss - Module1-1
91 pages
Creating Attachments To Work Items or To User Decisions in Workflows
100% (1)
Creating Attachments To Work Items or To User Decisions in Workflows
20 pages
Fundamentals of Generative AI
No ratings yet
Fundamentals of Generative AI
17 pages
ChatGPT for Beginners: A Comprehensive Guide
From Everand
ChatGPT for Beginners: A Comprehensive Guide
Joseph Capps
No ratings yet
Creatively Malicious Prompt Engineering
No ratings yet
Creatively Malicious Prompt Engineering
36 pages
Lab1 Installation
No ratings yet
Lab1 Installation
8 pages
GPT 2 August Report
No ratings yet
GPT 2 August Report
34 pages
Day 1
No ratings yet
Day 1
32 pages
Generative AI
No ratings yet
Generative AI
2 pages
Prompt Engineering with ChatGPT
From Everand
Prompt Engineering with ChatGPT
Nikiforos Kontopoulos
No ratings yet
Lang Chain
No ratings yet
Lang Chain
7 pages
Natural Language Processing
No ratings yet
Natural Language Processing
8 pages
Genai Principles
No ratings yet
Genai Principles
12 pages
The ChatGPT Handbook
From Everand
The ChatGPT Handbook
PA BOOKS
4/5 (1)
Worksheet of English Grammar Part 1
0% (1)
Worksheet of English Grammar Part 1
3 pages
Competition Law in India by Nishith Desai
No ratings yet
Competition Law in India by Nishith Desai
120 pages
Large Language Model Algorithms in Plain English
No ratings yet
Large Language Model Algorithms in Plain English
8 pages
1992 - 993 - DOC - Introduction To AI
No ratings yet
1992 - 993 - DOC - Introduction To AI
3 pages
Artificial Intelligence Innovation The Future With OpenAI GPT-3
No ratings yet
Artificial Intelligence Innovation The Future With OpenAI GPT-3
4 pages
Mastering ChatGPT: Unlock the Power of AI for Enhanced Communication and Relationships: English
From Everand
Mastering ChatGPT: Unlock the Power of AI for Enhanced Communication and Relationships: English
Vasyl Kolomiiets
5/5 (2)
Guitar Rig 4 Getting Started English
No ratings yet
Guitar Rig 4 Getting Started English
29 pages
Bab 6 Heat Treatment of Steels
No ratings yet
Bab 6 Heat Treatment of Steels
23 pages
Sist en 12390 1 2021
No ratings yet
Sist en 12390 1 2021
10 pages
Basics of Knitting Straight Bar Knitting Machine
100% (4)
Basics of Knitting Straight Bar Knitting Machine
3 pages
Professions and Occupations
No ratings yet
Professions and Occupations
2 pages
OpenAI GPT-3 Prominent Features
No ratings yet
OpenAI GPT-3 Prominent Features
1 page
Bai Tap Ve Su Hoa Hop Giua Chu Ngu Va Dong Tu
No ratings yet
Bai Tap Ve Su Hoa Hop Giua Chu Ngu Va Dong Tu
4 pages
Python Programming For Beginners: Learn The Basics Of Python Programming (Python Crash Course, Programming for Dummies)
From Everand
Python Programming For Beginners: Learn The Basics Of Python Programming (Python Crash Course, Programming for Dummies)
James Tudor
5/5 (1)
The Language Machines: A Remarkable AI Can Write Like Humans - But With No Understanding of What It's Saying
No ratings yet
The Language Machines: A Remarkable AI Can Write Like Humans - But With No Understanding of What It's Saying
4 pages
PHY 20 Physics For Engineers
No ratings yet
PHY 20 Physics For Engineers
4 pages
Search Document
No ratings yet
Search Document
13 pages
How to use ChatGPT
From Everand
How to use ChatGPT
Bernhard Gaum
No ratings yet
Program
No ratings yet
Program
2 pages
Quiz Mythology
No ratings yet
Quiz Mythology
4 pages
Sample
No ratings yet
Sample
8 pages
Old Dominion University Teacher Leaders 1 1
No ratings yet
Old Dominion University Teacher Leaders 1 1
6 pages
Music Assignment 1
No ratings yet
Music Assignment 1
3 pages
Parable of The Sower Summary
No ratings yet
Parable of The Sower Summary
7 pages
Program
No ratings yet
Program
4 pages
Here Are 40 Common Accounting Interview Questions and Answers For Freshers
No ratings yet
Here Are 40 Common Accounting Interview Questions and Answers For Freshers
4 pages
Python Programming For Beginners: Python Programming Language Tutorial
From Everand
Python Programming For Beginners: Python Programming Language Tutorial
Joseph Joyner
No ratings yet
Understanding Python: Beginner's Guide to Programming
From Everand
Understanding Python: Beginner's Guide to Programming
Sabry Fattah
No ratings yet
Wika 0900766b813ecd99
No ratings yet
Wika 0900766b813ecd99
2 pages
Figure 1-Bus Topology
No ratings yet
Figure 1-Bus Topology
2 pages
Manzano Vs CA
No ratings yet
Manzano Vs CA
7 pages
Aim Theory
No ratings yet
Aim Theory
2 pages
Lake Malawi National Park
No ratings yet
Lake Malawi National Park
5 pages
PCX Hotline: 725-8888: Dealer's Price List Fri, Aug 21, 2020
No ratings yet
PCX Hotline: 725-8888: Dealer's Price List Fri, Aug 21, 2020
2 pages
4 Hulganza v. CA PDF
No ratings yet
4 Hulganza v. CA PDF
4 pages
Airborne A Short Story
No ratings yet
Airborne A Short Story
1 page
Introduction to Programming Languages
From Everand
Introduction to Programming Languages
IntroBooks Team
4/5 (1)
ChatGPT for Beginners Al-Powered Producivity
From Everand
ChatGPT for Beginners Al-Powered Producivity
Ary S. Jr.
No ratings yet
Briyana Butler Resume 2-4-4
No ratings yet
Briyana Butler Resume 2-4-4
1 page
Unlocking Your Potential with ChatGPT
From Everand
Unlocking Your Potential with ChatGPT
Bill Vincent
No ratings yet