0% found this document useful (0 votes)

254 views12 pages

Text Generation - OpenAI API

The document provides an overview of OpenAI's text generation models and the Chat Completions API. It describes that the models can be used to draft documents, write code, answer questions, and more. It also introduces the new GPT-4 model and vision capabilities. The Chat Completions API allows sending a conversation history to a model to receive a response and is demonstrated with a Python example. Responses include the generated text and metadata like the number of tokens.

Uploaded by

Mayuri Sajnani

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

254 views12 pages

Text Generation - OpenAI API

Uploaded by

Mayuri Sajnani

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

11/23/23, 7:33 PM Text generation - OpenAI API

Documentation API reference Examples Forum Help

New models and developer products announced at DevDay!

Learn more Dismiss

Text generation models

New capabilities launched at DevDay

Text generation models are now capable of JSON mode and Reproducible outputs.
We also launched the Assistants API to enable you to build agent-like experiences on
top of our text-generation models. GPT-4 Turbo is available in preview by specifying
gpt-4-1106-preview as the model name.

OpenAI's text generation models (often called generative pre-trained transformers or

large language models) have been trained to understand natural language, code, and
images. The models provide text outputs in response to their inputs. The inputs to these
models are also referred to as "prompts". Designing a prompt is essentially how you
“program” a large language model model, usually by providing instructions or some
examples of how to successfully complete a task.

Using OpenAI's text generation models, you can build applications to:

Draft documents
Write computer code
Answer questions about a knowledge base
Analyze texts
Give software a natural language interface
Tutor in a range of subjects
Translate languages
Simulate characters for games

With the release of gpt-4-vision-preview , you can now build systems that also
process and understand images.
https://platform.openai.com/docs/guides/text-generation/parameter-details 1/12
11/23/23, 7:33 PM Text generation - OpenAI API

Explore GPT-4 with image inputs

Check out the vision guide for more detail.

GPT-4 Turbo
Try out GPT-4 Turbo in the playground.

To use one of these models via the OpenAI API, you’ll send a request containing the inputs
and your API key, and receive a response containing the model’s output. Our latest
models, gpt-4 and gpt-3.5-turbo , are accessed through the chat completions API
endpoint.

MODEL FAMILIE S API ENDP OINT

Newer models gpt-4, gpt-4 turbo, gpt- https://api.openai.com/v1/chat/completions

(2023–) 3.5-turbo

Updated gpt-3.5-turbo-instruct, https://api.openai.com/v1/completions

legacy models babbage-002, davinci-
(2023) 002

Legacy models text-davinci-003, text- https://api.openai.com/v1/completions

(2020–2022) davinci-002, davinci,
curie, babbage, ada

You can experiment with various models in the chat playground. If you’re not sure which
model to use, then use gpt-3.5-turbo or gpt-4 .

Chat Completions API

Chat models take a list of messages as input and return a model-generated message as
output. Although the chat format is designed to make multi-turn conversations easy, it’s
just as useful for single-turn tasks without any conversation.

An example Chat Completions API call looks like the following:

python Copy

1 from openai import OpenAI

2 client = OpenAI()
3
4 response = client.chat.completions.create(

https://platform.openai.com/docs/guides/text-generation/parameter-details 2/12
11/23/23, 7:33 PM Text generation - OpenAI API

5 model="gpt-3.5-turbo",
6 messages=[
7 {"role": "system", "content": "You are a helpful assistant."},
8 {"role": "user", "content": "Who won the world series in 2020?"},
9 {"role": "assistant", "content": "The Los Angeles Dodgers won the World
10 {"role": "user", "content": "Where was it played?"}
11 ]
12 )

To learn more, you can view the full API reference documentation for the Chat API.

The main input is the messages parameter. Messages must be an array of message
objects, where each object has a role (either "system", "user", or "assistant") and content.
Conversations can be as short as one message or many back and forth turns.

Typically, a conversation is formatted with a system message first, followed by alternating

user and assistant messages.

The system message helps set the behavior of the assistant. For example, you can modify
the personality of the assistant or provide specific instructions about how it should behave
throughout the conversation. However note that the system message is optional and the
model’s behavior without a system message is likely to be similar to using a generic
message such as "You are a helpful assistant."

The user messages provide requests or comments for the assistant to respond to.
Assistant messages store previous assistant responses, but can also be written by you to
give examples of desired behavior.

Including conversation history is important when user instructions refer to prior

messages. In the example above, the user’s final question of "Where was it played?" only
makes sense in the context of the prior messages about the World Series of 2020.
Because the models have no memory of past requests, all relevant information must be
supplied as part of the conversation history in each request. If a conversation cannot fit
within the model’s token limit, it will need to be shortened in some way.

To mimic the effect seen in ChatGPT where the text is returned iteratively, set
the stream parameter to true.

Chat Completions response format

An example Chat Completions API response looks as follows:

https://platform.openai.com/docs/guides/text-generation/parameter-details 3/12
11/23/23, 7:33 PM Text generation - OpenAI API

1 {
2 "choices": [
3 {
4 "finish_reason": "stop",
5 "index": 0,
6 "message": {
7 "content": "The 2020 World Series was played in Texas at G
8 "role": "assistant"
9 }
10 }
11 ],
12 "created": 1677664795,
13 "id": "chatcmpl-7QyqpwdfhqwajicIEznoc6Q47XAyW",
14 "model": "gpt-3.5-turbo-0613",
15 "object": "chat.completion",
16 "usage": {
17 "completion_tokens": 17,
18 "prompt_tokens": 57,
19 "total_tokens": 74
20 }
21 }

The assistant’s reply can be extracted with:

python Copy

response['choices'][0]['message']['content']

Every response will include a finish_reason . The possible values for finish_reason
are:

stop : API returned complete message, or a message terminated by one of the stop
sequences provided via the stop parameter
length : Incomplete model output due to max_tokens parameter or token limit

function_call : The model decided to call a function

content_filter : Omitted content due to a flag from our content filters

null : API response still in progress or incomplete

https://platform.openai.com/docs/guides/text-generation/parameter-details 4/12
11/23/23, 7:33 PM Text generation - OpenAI API

Depending on input parameters, the model response may include different information.

JSON mode New

A common way to use Chat Completions is to instruct the model to always return a JSON
object that makes sense for your use case, by specifying this in the system message.
While this does work in some cases, occasionally the models may generate output that
does not parse to valid JSON objects.

To prevent these errors and improve model performance, when calling gpt-4-1106-
preview or gpt-3.5-turbo-1106 , you can set response_format to { "type":
"json_object" } to enable JSON mode. When JSON mode is enabled, the model is
constrained to only generate strings that parse into valid JSON object.

Important notes:

When using JSON mode, always instruct the model to produce JSON via some
message in the conversation, for example via your system message. If you don't
include an explicit instruction to generate JSON, the model may generate an
unending stream of whitespace and the request may run continually until it reaches
the token limit. To help ensure you don't forget, the API will throw an error if the string
"JSON" does not appear somewhere in the context.

The JSON in the message the model returns may be partial (i.e. cut off) if
finish_reason is length , which indicates the generation exceeded max_tokens
or the conversation exceeded the token limit. To guard against this, check
finish_reason before parsing the response.

JSON mode will not guarantee the output matches any specific schema, only that it is
valid and parses without errors.

python Copy

1 from openai import OpenAI

2 client = OpenAI()
3
4 response = client.chat.completions.create(
5 model="gpt-3.5-turbo-1106",
6 response_format={ "type": "json_object" },
7 messages=[
8 {"role": "system", "content": "You are a helpful assistant designed to o
9 {"role": "user", "content": "Who won the world series in 2020?"}
10 ]
11
12

https://platform.openai.com/docs/guides/text-generation/parameter-details 5/12
11/23/23, 7:33 PM Text generation - OpenAI API

)
print(response.choices[0].message.content)

In this example, the response includes a JSON object that looks something like the
following:

"content": "{\"winner\": \"Los Angeles Dodgers\"}"`

Note that JSON mode is always enabled when the model is generating arguments as part
of function calling.

Reproducible outputs Beta

Chat Completions are non-deterministic by default (which means model outputs may
differ from request to request). That being said, we offer some control towards
deterministic outputs by giving you access to the seed parameter and the
system_fingerprint response field.

To receive (mostly) deterministic outputs across API calls, you can:

Set the seed parameter to any integer of your choice and use the same value across
requests you'd like deterministic outputs for.
Ensure all other parameters (like prompt or temperature ) are the exact same
across requests.

Sometimes, determinism may be impacted due to necessary changes OpenAI makes to

model configurations on our end. To help you keep track of these changes, we expose the
system_fingerprint field. If this value is different, you may see different outputs due to
changes we've made on our systems.

Deterministic outputs
Explore the new seed parameter in the OpenAI cookbook

Managing tokens
Language models read and write text in chunks called tokens. In English, a token can be as
short as one character or as long as one word (e.g., a or apple ), and in some languages
tokens can be even shorter than one character or even longer than one word.

https://platform.openai.com/docs/guides/text-generation/parameter-details 6/12
11/23/23, 7:33 PM Text generation - OpenAI API

For example, the string "ChatGPT is great!" is encoded into six tokens: ["Chat",
"G", "PT", " is", " great", "!"] .

The total number of tokens in an API call affects:

How much your API call costs, as you pay per token
How long your API call takes, as writing more tokens takes more time
Whether your API call works at all, as total tokens must be below the model’s
maximum limit (4097 tokens for gpt-3.5-turbo )

Both input and output tokens count toward these quantities. For example, if your API call
used 10 tokens in the message input and you received 20 tokens in the message output,
you would be billed for 30 tokens. Note however that for some models the price per token
is different for tokens in the input vs. the output (see the pricing page for more
information).

To see how many tokens are used by an API call, check the usage field in the API
response (e.g., response['usage']['total_tokens'] ).

Chat models like gpt-3.5-turbo and gpt-4 use tokens in the same way as the models
available in the completions API, but because of their message-based formatting, it's
more difficult to count how many tokens will be used by a conversation.

DEEP DIVE

Counting tokens for chat API calls

To see how many tokens are in a text string without making an API call, use OpenAI’s
tiktoken Python library. Example code can be found in the OpenAI Cookbook’s guide on
how to count tokens with tiktoken.

Each message passed to the API consumes the number of tokens in the content, role, and
other fields, plus a few extra for behind-the-scenes formatting. This may change slightly in
the future.

If a conversation has too many tokens to fit within a model’s maximum limit (e.g., more
than 4097 tokens for gpt-3.5-turbo), you will have to truncate, omit, or otherwise shrink
your text until it fits. Beware that if a message is removed from the messages input, the
model will lose all knowledge of it.

Note that very long conversations are more likely to receive incomplete replies. For
example, a gpt-3.5-turbo conversation that is 4090 tokens long will have its reply cut off

https://platform.openai.com/docs/guides/text-generation/parameter-details 7/12
11/23/23, 7:33 PM Text generation - OpenAI API

after just 6 tokens.

Parameter details
Frequency and presence penalties

The frequency and presence penalties found in the Chat Completions API and Legacy
Completions API can be used to reduce the likelihood of sampling repetitive sequences of
tokens. They work by directly modifying the logits (un-normalized log-probabilities) with
an additive contribution.

mu[j] -> mu[j] - c[j] * alpha_frequency - float(c[j] > 0) * alpha_pre

Where:

mu[j] is the logits of the j-th token

c[j] is how often that token was sampled prior to the current position

float(c[j] > 0) is 1 if c[j] > 0 and 0 otherwise

alpha_frequency is the frequency penalty coefficient

alpha_presence is the presence penalty coefficient

As we can see, the presence penalty is a one-off additive contribution that applies to all
tokens that have been sampled at least once and the frequency penalty is a contribution
that is proportional to how often a particular token has already been sampled.

Reasonable values for the penalty coefficients are around 0.1 to 1 if the aim is to just reduce
repetitive samples somewhat. If the aim is to strongly suppress repetition, then one can
increase the coefficients up to 2, but this can noticeably degrade the quality of samples.
Negative values can be used to increase the likelihood of repetition.

Completions API Legacy

The completions API endpoint received its final update in July 2023 and has a different
interface than the new chat completions endpoint. Instead of the input being a list of
messages, the input is a freeform text string called a prompt .

An example API call looks as follows:

https://platform.openai.com/docs/guides/text-generation/parameter-details 8/12
11/23/23, 7:33 PM Text generation - OpenAI API

python Copy

1 from openai import OpenAI

2 client = OpenAI()
3
4 response = client.completions.create(
5 model="gpt-3.5-turbo-instruct",
6 prompt="Write a tagline for an ice cream shop."
7 )

See the full API reference documentation to learn more.

Token log probabilities

The completions API can provide a limited number of log probabilities associated with the
most likely tokens for each output token. This feature is controlled by using the logprobs
field. This can be useful in some cases to assess the confidence of the model in its output.

Inserting text

The completions endpoint also supports inserting text by providing a suffix in addition to
the standard prompt which is treated as a prefix. This need naturally arises when writing
long-form text, transitioning between paragraphs, following an outline, or guiding the
model towards an ending. This also works on code, and can be used to insert in the middle
of a function or file.

DEEP DIVE

Inserting text

Completions response format

An example completions API response looks as follows:

1 {
2 "choices": [
3 {
4 "finish_reason": "length",
5 "index": 0,
6 "logprobs": null,
https://platform.openai.com/docs/guides/text-generation/parameter-details 9/12
11/23/23, 7:33 PM Text generation - OpenAI API

7 "text": "\n\n\"Let Your Sweet Tooth Run Wild at Our Creamy I

8 }
9 ],
10 "created": 1683130927,
11 "id": "cmpl-7C9Wxi9Du4j1lQjdjhxBlO22M61LD",
12 "model": "gpt-3.5-turbo-instruct",
13 "object": "text_completion",
14 "usage": {
15 "completion_tokens": 16,
16 "prompt_tokens": 10,
17 "total_tokens": 26
18 }
19 }

In Python, the output can be extracted with response['choices'][0]['text'] .

The response format is similar to the response format of the Chat Completions API but
also includes the optional field logprobs .

Chat Completions vs. Completions

The Chat Completions format can be made similar to the completions format by
constructing a request using a single user message. For example, one can translate from
English to French with the following completions prompt:

Translate the following English text to French: "{text}"

And an equivalent chat prompt would be:

[{"role": "user", "content": 'Translate the following English text to

Likewise, the completions API can be used to simulate a chat between a user and an
assistant by formatting the input accordingly.

The difference between these APIs is the underlying models that are available in each. The
chat completions API is the interface to our most capable model ( gpt-4 ), and our most
cost effective model ( gpt-3.5-turbo ).

https://platform.openai.com/docs/guides/text-generation/parameter-details 10/12
11/23/23, 7:33 PM Text generation - OpenAI API

Which model should I use?

We generally recommend that you use either gpt-4 or gpt-3.5-turbo . Which of these
you should use depends on the complexity of the tasks you are using the models for. gpt-
4 generally performs better on a wide range of evaluations. In particular, gpt-4 is more
capable at carefully following complex instructions. By contrast gpt-3.5-turbo is more
likely to follow just one part of a complex multi-part instruction. gpt-4 is less likely than
gpt-3.5-turbo to make up information, a behavior known as "hallucination". gpt-4
also has a larger context window with a maximum size of 8,192 tokens compared to 4,096
tokens for gpt-3.5-turbo . However, gpt-3.5-turbo returns outputs with lower
latency and costs much less per token.

We recommend experimenting in the playground to investigate which models provide the

best price performance trade-off for your usage. A common design pattern is to use
several distinct query types which are each dispatched to the model appropriate to handle
them.

Prompt engineering
An awareness of the best practices for working with OpenAI models can make a
significant difference in application performance. The failure modes that each exhibit and
the ways of working around or correcting those failure modes are not always intuitive.
There is an entire field related to working with language models which has come to be
known as "prompt engineering", but as the field has progressed its scope has outgrown
merely engineering the prompt into engineering systems that use model queries as
components. To learn more, read our guide on prompt engineering which covers methods
to improve model reasoning, reduce the likelihood of model hallucinations, and more. You
can also find many useful resources including code samples in the OpenAI Cookbook.

FAQ

How should I set the temperature parameter?

Lower values for temperature result in more consistent outputs (e.g. 0.2), while higher
values generate more diverse and creative results (e.g. 1.0). Select a temperature value
based on the desired trade-off between coherence and creativity for your specific
application. The temperature can range is from 0 to 2.

Is fine-tuning available for the latest models?

https://platform.openai.com/docs/guides/text-generation/parameter-details 11/12
11/23/23, 7:33 PM Text generation - OpenAI API

Yes, for some. Currently, you can only fine-tune gpt-3.5-turbo and our updated base
models ( babbage-002 and davinci-002 ). See the fine-tuning guide for more details on
how to use fine-tuned models.

Do you store the data that is passed into the API?

As of March 1st, 2023, we retain your API data for 30 days but no longer use your data sent
via the API to improve our models. Learn more in our data usage policy. Some endpoints
offer zero retention.

How can I make my application more safe?

If you want to add a moderation layer to the outputs of the Chat API, you can follow our
moderation guide to prevent content that violates OpenAI’s usage policies from being
shown.

Should I use ChatGPT or the API?

ChatGPT offers a chat interface to the models in the OpenAI API and a range of built-in
features such as integrated browsing, code execution, plugins, and more. By contrast,
using OpenAI’s API provides more flexibility.

https://platform.openai.com/docs/guides/text-generation/parameter-details 12/12

N e Cornerstone
100% (2)
N e Cornerstone
161 pages
No Code Agent
No ratings yet
No Code Agent
15 pages
Claude and ChatGPT Data Analysis Prompts
No ratings yet
Claude and ChatGPT Data Analysis Prompts
7 pages
MPC 001 Previous Year Question Papers
100% (1)
MPC 001 Previous Year Question Papers
45 pages
Your Superbible of Prompts - Sheet1
100% (1)
Your Superbible of Prompts - Sheet1
15 pages
Assistants Tools - OpenAI API
No ratings yet
Assistants Tools - OpenAI API
12 pages
2.1.11 Practice - Written Assignment - Daily Obligations (Practice)
No ratings yet
2.1.11 Practice - Written Assignment - Daily Obligations (Practice)
3 pages
Vision - OpenAI API
No ratings yet
Vision - OpenAI API
8 pages
AI - Learning.websites and Other Educational Websites
No ratings yet
AI - Learning.websites and Other Educational Websites
18 pages
Foundations GPT
No ratings yet
Foundations GPT
8 pages
Prompt Engineering
No ratings yet
Prompt Engineering
1 page
What Is Chatgpt? (And How To Use It)
No ratings yet
What Is Chatgpt? (And How To Use It)
5 pages
Jailbreaking ChatGPT Via Prompt Engineering: An Empirical Study
100% (1)
Jailbreaking ChatGPT Via Prompt Engineering: An Empirical Study
12 pages
Micrsoft - AI Builder Prompting Guide
No ratings yet
Micrsoft - AI Builder Prompting Guide
10 pages
Ai Drive - Prompt Library
No ratings yet
Ai Drive - Prompt Library
4 pages
The Complete Guide To Chatbots
100% (1)
The Complete Guide To Chatbots
88 pages
State of GPT
No ratings yet
State of GPT
50 pages
Prompt Diffusion in Context Learning For Generative Models
No ratings yet
Prompt Diffusion in Context Learning For Generative Models
5 pages
An Intelligent Chatbot Using Deep Learning With Bidir - 2021 - Materials Today PDF
No ratings yet
An Intelligent Chatbot Using Deep Learning With Bidir - 2021 - Materials Today PDF
8 pages
Generative AI - 48 Hours TOC
100% (2)
Generative AI - 48 Hours TOC
4 pages
ChatGPT Cheat Sheet Devs v2 2
No ratings yet
ChatGPT Cheat Sheet Devs v2 2
1 page
Levels of AI Agents - From Rules To Large Language Models
No ratings yet
Levels of AI Agents - From Rules To Large Language Models
8 pages
Finxter OpenAI Glossary
No ratings yet
Finxter OpenAI Glossary
1 page
Master Resources Prompt Engineering
100% (1)
Master Resources Prompt Engineering
2 pages
Jailbreak Ai
0% (1)
Jailbreak Ai
3 pages
Generative AI Masters Brochure
No ratings yet
Generative AI Masters Brochure
38 pages
chatGPT Prompt Engineering
No ratings yet
chatGPT Prompt Engineering
11 pages
A Comprehensive Guide To Popular Generative AI Text Models
No ratings yet
A Comprehensive Guide To Popular Generative AI Text Models
8 pages
Gmail - 5 Prompts, Infinite Knowledge
No ratings yet
Gmail - 5 Prompts, Infinite Knowledge
10 pages
ChatGPT Is A 20 Data Analyst 1702624521
No ratings yet
ChatGPT Is A 20 Data Analyst 1702624521
24 pages
HubSpot Architecture II
No ratings yet
HubSpot Architecture II
6 pages
From Google Gemini To OpenAI
No ratings yet
From Google Gemini To OpenAI
30 pages
GPTforBusiness Strategic Prompt 240120 185117
No ratings yet
GPTforBusiness Strategic Prompt 240120 185117
46 pages
Building of Personalised AI Assistant
100% (1)
Building of Personalised AI Assistant
12 pages
ChatFPT Prompt Injection
No ratings yet
ChatFPT Prompt Injection
10 pages
Build Your Own ChatGPT Presentation
0% (1)
Build Your Own ChatGPT Presentation
57 pages
The Prompt Canvas
No ratings yet
The Prompt Canvas
16 pages
Understanding ChatGPT
No ratings yet
Understanding ChatGPT
2 pages
Types of AI Agents
No ratings yet
Types of AI Agents
6 pages
AI Builder Prompting Guide
No ratings yet
AI Builder Prompting Guide
10 pages
Jain AI Paper (23-7-24)
No ratings yet
Jain AI Paper (23-7-24)
26 pages
TEDchats ChatGPT Guide
No ratings yet
TEDchats ChatGPT Guide
25 pages
5 Data Visualization Tools For 2023
No ratings yet
5 Data Visualization Tools For 2023
6 pages
64e8c37a3a32b1b85d479988 - AIPromptPlaybook v1
No ratings yet
64e8c37a3a32b1b85d479988 - AIPromptPlaybook v1
28 pages
Prompt Engineering
No ratings yet
Prompt Engineering
45 pages
Advanced Cheatsheet For ChatGPT
100% (4)
Advanced Cheatsheet For ChatGPT
31 pages
Using Generative AI in Your Job Search - Resources and Prompts
No ratings yet
Using Generative AI in Your Job Search - Resources and Prompts
15 pages
ChatGPT Prompt Patterns For Improving Code Quality, - Refactoring, Requirements Elicitation, and Software Design
No ratings yet
ChatGPT Prompt Patterns For Improving Code Quality, - Refactoring, Requirements Elicitation, and Software Design
14 pages
Gemini For Google Workspace Guided Evaluation EN
No ratings yet
Gemini For Google Workspace Guided Evaluation EN
1 page
Chatgpt JB
No ratings yet
Chatgpt JB
1 page
Exploring Opportunities in The Generative Ai Value Chain
No ratings yet
Exploring Opportunities in The Generative Ai Value Chain
10 pages
AI Made Easy For All
No ratings yet
AI Made Easy For All
54 pages
LLM Prompt Engineering Best Practices
No ratings yet
LLM Prompt Engineering Best Practices
20 pages
Chat GPT
No ratings yet
Chat GPT
84 pages
Chatgpt Prompt Engineering
50% (2)
Chatgpt Prompt Engineering
12 pages
Starter Prompt Library
No ratings yet
Starter Prompt Library
10 pages
How To Use AI Ad Generator For Google Ads, LinkedIn and Instagram Ads
No ratings yet
How To Use AI Ad Generator For Google Ads, LinkedIn and Instagram Ads
11 pages
AutoGPT - AutoBusDevTool
No ratings yet
AutoGPT - AutoBusDevTool
3 pages
Ultimate AI & Automation Resource Sheet
No ratings yet
Ultimate AI & Automation Resource Sheet
18 pages
Using An AI Product Description Generator For Your ECommerce Website
No ratings yet
Using An AI Product Description Generator For Your ECommerce Website
9 pages
Lab1 Installation
No ratings yet
Lab1 Installation
8 pages
Open AI Python
No ratings yet
Open AI Python
1 page
API Reference - OpenAI API
No ratings yet
API Reference - OpenAI API
116 pages
Rhetorical Strategies of Legal Arguments in Courtrooms: Bader Nasser Aldosari
No ratings yet
Rhetorical Strategies of Legal Arguments in Courtrooms: Bader Nasser Aldosari
9 pages
ESP510 English For Technology
No ratings yet
ESP510 English For Technology
14 pages
Reported Speech
No ratings yet
Reported Speech
8 pages
Unit 6 - Speaking Weather
No ratings yet
Unit 6 - Speaking Weather
25 pages
Adjective Clauses
No ratings yet
Adjective Clauses
8 pages
The Oxford New German Dictionary The Essential Resource Revised and Updated Oxford University Press Download
100% (1)
The Oxford New German Dictionary The Essential Resource Revised and Updated Oxford University Press Download
52 pages
Seidlhofer - 2005 - Understanding ELF
100% (1)
Seidlhofer - 2005 - Understanding ELF
3 pages
The BHUJEL LANGUAGE A Dissertation Submi by Dr. Danraj Regmi
No ratings yet
The BHUJEL LANGUAGE A Dissertation Submi by Dr. Danraj Regmi
536 pages
Top Notch 2 Units 1-2 February 2012
No ratings yet
Top Notch 2 Units 1-2 February 2012
3 pages
Reported Speech (Statements & Questions)
No ratings yet
Reported Speech (Statements & Questions)
19 pages
Focus5 2E Grammar Quiz Unit4.3 GroupB 2kol
No ratings yet
Focus5 2E Grammar Quiz Unit4.3 GroupB 2kol
1 page
Class 11 Science English Worksheet - Solution
No ratings yet
Class 11 Science English Worksheet - Solution
29 pages
Materi at in On Rahmat
No ratings yet
Materi at in On Rahmat
4 pages
The Collapse of The Somali State - The Impact of The Colonial Legacy PDF
100% (1)
The Collapse of The Somali State - The Impact of The Colonial Legacy PDF
187 pages
Res 2023
No ratings yet
Res 2023
4 pages
The Celta Course Trainee Book 2nd Edition Peter Watkins Scott Thornbury PDF Download
No ratings yet
The Celta Course Trainee Book 2nd Edition Peter Watkins Scott Thornbury PDF Download
90 pages
Public Speaking and Effective Communication Voiceover
No ratings yet
Public Speaking and Effective Communication Voiceover
20 pages
Dispensa Corso Di Inglese 80 Ore - Docente Anna Cascone
No ratings yet
Dispensa Corso Di Inglese 80 Ore - Docente Anna Cascone
40 pages
Topic Britian
No ratings yet
Topic Britian
11 pages
Cambridge O Level: English Language 1123/12
No ratings yet
Cambridge O Level: English Language 1123/12
8 pages
Prefixes and Suffixes
No ratings yet
Prefixes and Suffixes
4 pages
Mco 5
No ratings yet
Mco 5
9 pages
Getting Started Cantonese
No ratings yet
Getting Started Cantonese
4 pages
Insight Elementary WB
50% (2)
Insight Elementary WB
143 pages
Tungkung Langit and Alunsina
100% (1)
Tungkung Langit and Alunsina
5 pages
WBCS Prelims Previous Years Questions On West Bengal Geography
No ratings yet
WBCS Prelims Previous Years Questions On West Bengal Geography
16 pages
Module 3 - IPC Skills For IPA Members - A Two Hour Module Zero
No ratings yet
Module 3 - IPC Skills For IPA Members - A Two Hour Module Zero
14 pages

Text Generation - OpenAI API

Uploaded by

Text Generation - OpenAI API

Uploaded by

11/23/23, 7:33 PM Text generation - OpenAI API

Documentation API reference Examples Forum Help

New models and developer products announced at DevDay!

Learn more Dismiss

Text generation models

OpenAI's text generation models (often called generative pre-trained transformers or

Explore GPT-4 with image inputs

MODEL FAMILIE S API ENDP OINT

Newer models gpt-4, gpt-4 turbo, gpt- https://api.openai.com/v1/chat/completions

Updated gpt-3.5-turbo-instruct, https://api.openai.com/v1/completions

Legacy models text-davinci-003, text- https://api.openai.com/v1/completions

Chat Completions API

An example Chat Completions API call looks like the following:

1 from openai import OpenAI

Typically, a conversation is formatted with a system message first, followed by alternating

Including conversation history is important when user instructions refer to prior

Chat Completions response format

An example Chat Completions API response looks as follows:

The assistant’s reply can be extracted with:

function_call : The model decided to call a function

content_filter : Omitted content due to a flag from our content filters

null : API response still in progress or incomplete

JSON mode New

1 from openai import OpenAI

"content": "{\"winner\": \"Los Angeles Dodgers\"}"`

Reproducible outputs Beta

To receive (mostly) deterministic outputs across API calls, you can:

Sometimes, determinism may be impacted due to necessary changes OpenAI makes to

The total number of tokens in an API call affects:

Counting tokens for chat API calls

after just 6 tokens.

mu[j] -> mu[j] - c[j] * alpha_frequency - float(c[j] > 0) * alpha_pre

mu[j] is the logits of the j-th token

float(c[j] > 0) is 1 if c[j] > 0 and 0 otherwise

alpha_frequency is the frequency penalty coefficient

alpha_presence is the presence penalty coefficient

Completions API Legacy

An example API call looks as follows:

1 from openai import OpenAI

See the full API reference documentation to learn more.

Token log probabilities

Completions response format

An example completions API response looks as follows:

7 "text": "\n\n\"Let Your Sweet Tooth Run Wild at Our Creamy I

In Python, the output can be extracted with response['choices'][0]['text'] .

Chat Completions vs. Completions

Translate the following English text to French: "{text}"

And an equivalent chat prompt would be:

[{"role": "user", "content": 'Translate the following English text to

Which model should I use?

We recommend experimenting in the playground to investigate which models provide the

How should I set the temperature parameter?

Is fine-tuning available for the latest models?

Do you store the data that is passed into the API?

How can I make my application more safe?

Should I use ChatGPT or the API?

You might also like