How ChatGPT and Our Language Models Are Developed - OpenAI Help Center

ChatGPT and other OpenAI language models are developed using information from three sources: publicly available internet sources, licensed third-party information, and information from users and trainers. Models are trained by reading vast amounts of publicly available text to learn word associations, without copying or storing the actual text. Personal information may be included incidentally in public texts used for training, but is not actively sought out and is not used to profile or target individuals. OpenAI takes steps to ensure the lawful and responsible use of personal information in model training in compliance with privacy laws.

Uploaded by

Nandika Khator

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

192 views5 pages

How ChatGPT and Our Language Models Are Developed - OpenAI Help Center

Uploaded by

Nandika Khator

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

13/12/2023, 10:35 How ChatGPT and Our Language Models Are Developed | OpenAI Help Center

Search for articles...

All Collections Privacy and policies General FAQ

How ChatGPT and Our Language Models Are Developed

How ChatGPT and Our Language

Models Are Developed
Written by Michael Schade
Updated over a week ago

Table of contents
OpenAI’s large language models, including the models that power ChatGPT, are
developed using three primary sources of information: (1) information that is publicly
available on the internet, (2) information that we license from third parties, and (3)
information that our users or our human trainers provide.
This article provides an overview of the publicly available information we use to help
develop our models and how we collect and use that information in compliance with
privacy laws. To understand how we collect and use information from users of our
services, including how to opt out of having ChatGPT conversations used to help teach
our models, please see our Privacy Policy and this help center article.
What is ChatGPT, and how does it work?
ChatGPT is an artificial intelligence-based service that you can access via the internet.
You can use ChatGPT to organize or summarize text, or to write new text. ChatGPT has
been developed in a way that allows it to understand and respond to user questions and
instructions. It does this by “reading” a large amount of existing text and learning how
words tend to appear in context with other words. It then uses what it has learned to
predict the next most likely word that might appear in response to a user request, and
https://help.openai.com/en/articles/7842364-how-chatgpt-and-our-language-models-are-developed#h_cf0ebff89d 1/5
13/12/2023, 10:35 How ChatGPT and Our Language Models Are Developed | OpenAI Help Center

each subsequent word after that. This is similar to auto-complete capabilities on search
engines, smartphones, and email programs.
As an example, during the model learning process (called “training”), we might have a
model try to complete the sentence: “instead of turning left, she turned ___.” Before
training, the model will respond with random words, but as it reads and learns from
many lines of text, it better understands this type of sentence and can predict the next
word more accurately. It then repeats this process across a very large number of
sentences.
Because there are many possible words that could come next in this sentence (e.g.,
instead of turning left, she turned “right,” “around,” or “back”), there is an element of
randomness in the way a model can respond, and in many cases our models will answer
the same question in different ways.
Machine learning models are made up of large strings of numbers, called “weights” or
“parameters,” and code that interprets and executes those numbers. Models do not
contain or store copies of information that they learn from. Instead, as a model learns,
some of the numbers that make up the model change slightly to reflect what it has
learned. In the example above, the model read information that helped it improve from
predicting random incorrect words to predicting more accurate words, but all that
actually happened in the model itself was that the numbers changed slightly. The model
did not store or copy the sentences that it read.
What type of information is used to teach ChatGPT?
As noted above, ChatGPT and our other services are developed using (1) information
that is publicly available on the internet, (2) information that we license from third
parties, and (3) information that our users or human trainers provide. This article
focuses on the first set: information that is publicly available on the internet.
For this set of information, we only use publicly available information that is freely and
openly available on the Internet – for example, we do not seek information behind
paywalls or from the “dark web.” We apply filters and remove information that we do not
want our models to learn from or output, such as hate speech, adult content, sites that
primarily aggregate personal information, and spam. We then use the information to
teach our models.
As mentioned in the previous section, ChatGPT does not copy or store training
information in a database. Instead, it learns about associations between words, and
those learnings help the model update its numbers/weights. The model then uses those
weights to predict and generate new words in response to a user request. It does not
“copy and paste” training information – much like a person who has read a book and
https://help.openai.com/en/articles/7842364-how-chatgpt-and-our-language-models-are-developed#h_cf0ebff89d 2/5
13/12/2023, 10:35 How ChatGPT and Our Language Models Are Developed | OpenAI Help Center

sets it down, our models do not have access to training information after they have
learned from it.
Is personal information used to teach ChatGPT?
A large amount of data on the internet relates to people, so our training information
does incidentally include personal information. We don’t actively seek out personal
information to train our models.
We use training information only to help our models learn about language and how
to understand and respond to it. We do not and will not use any personal
information in training information to build profiles about people, to contact them,
to advertise to them, to try to sell them anything, or to sell the information itself.
Our models may learn from personal information to understand how things like names
and addresses fit within language and sentences, or to learn about famous people and
public figures. This makes our models better at providing relevant responses.
How does the development of ChatGPT comply with
privacy laws?
We use training information lawfully. Large language models have many applications
that provide significant benefits and are already helping people create content, improve
customer service, develop software, customize education, support scientific research,
and much more. These benefits cannot be realized without a large amount of
information to teach the models. In addition, our use of training information is not meant
to negatively impact individuals, and the sources of this training information are already
publicly available. For these reasons, we base our collection and use of personal
information that is included in training information on legitimate interests according to
privacy laws like the GDPR. To fulfill our compliance obligations, we have also
completed a data protection impact assessment to help ensure we are collecting and
using this information legally and responsibly.
We respond to objection requests and similar rights. As a result of learning
language, ChatGPT responses may sometimes include personal information about
individuals whose personal information appears multiple times on the public internet
(for example, public figures). Individuals in certain jurisdictions can object to the
processing of their personal information by our models by filling out this form.
Individuals also may have the right to access, correct, restrict, delete, or transfer their
personal information that may be included in our training information. You can exercise
these rights by reaching out to [email protected].

https://help.openai.com/en/articles/7842364-how-chatgpt-and-our-language-models-are-developed#h_cf0ebff89d 3/5
13/12/2023, 10:35 How ChatGPT and Our Language Models Are Developed | OpenAI Help Center

Please be aware that, in accordance with privacy laws, some rights may not be
absolute. We may decline a request if we have a lawful reason for doing so. However,
we strive to prioritize the protection of personal information and comply with all
applicable privacy laws. If you feel we have not adequately addressed an issue, you
have the right to lodge a complaint with your local supervisory authority.
We protect training information and limit how it is used and shared. To keep this
information safe, we use commercially reasonable technical, physical, and
administrative measures like access controls, audit logs, read-only permissions, and
encrypting stored data. For more information on our security practices, please visit
https://www.openai.com/security.
We also take steps to reduce the processing of personal information when training our
models. For example, we remove websites that aggregate large volumes of personal
information and we try to train our models to reject requests for private or sensitive
information about people.
We do not sell training information to third parties, and only disclose portions of the
information when necessary and consistent with our Privacy Policy.
We only keep this information for as long as we need it to serve its intended
purpose. How long we keep this information hinges on factors like its quantity, type,
and sensitivity, the risk of harm from unauthorized use or sharing, whether the
information is still necessary or useful to train or update our models, and any legal
requirements.
Our data controller under the GDPR is OpenAI OpCo, LLC at 3180 18th Street, San
Francisco, CA, United States. For information about our EEA and UK representative for
data protection matters, please see our Privacy Policy. Our Data Protection Officer can
be contacted at [email protected].

Related Articles
How your data is used to improve model performance
What is ChatGPT?
ChatGPT — Release Notes

https://help.openai.com/en/articles/7842364-how-chatgpt-and-our-language-models-are-developed#h_cf0ebff89d 4/5
13/12/2023, 10:35 How ChatGPT and Our Language Models Are Developed | OpenAI Help Center

How can I access the ChatGPT API?

What is ChatGPT Enterprise?

Did this answer your question?

😞😐😃

ChatGPT API DALL·E Service Status

https://help.openai.com/en/articles/7842364-how-chatgpt-and-our-language-models-are-developed#h_cf0ebff89d 5/5

Data Protection Case Study
No ratings yet
Data Protection Case Study
10 pages
5 Ways To Download Files From Scribd Without Login 2024 - Technadvice
0% (1)
5 Ways To Download Files From Scribd Without Login 2024 - Technadvice
5 pages
Earn $10000 With CHAT GPT
From Everand
Earn $10000 With CHAT GPT
Sabarinath
3/5 (2)
The Most Concise Step-By-Step Guide To ChatGPT Ever
From Everand
The Most Concise Step-By-Step Guide To ChatGPT Ever
G.A. Pimpleton
3.5/5 (3)
Cracking the Data Science Interview: Unlock insider tips from industry experts to master the data science field
From Everand
Cracking the Data Science Interview: Unlock insider tips from industry experts to master the data science field
Leondra R. Gonzalez
No ratings yet
Confident Ux The Essential Skills For User Experience Design Adrian Bilan Download
No ratings yet
Confident Ux The Essential Skills For User Experience Design Adrian Bilan Download
90 pages
How to Generate Money with ChatGPT: A Comprehensive Guide
From Everand
How to Generate Money with ChatGPT: A Comprehensive Guide
Trade Sage
3/5 (1)
The ChatGPT Sales Playbook: Revolutionizing Sales with AI
From Everand
The ChatGPT Sales Playbook: Revolutionizing Sales with AI
Linda Bishop
No ratings yet
Chat GPT and Health Wealth: A Guide to Using Prompts for Financial Success and Better Living: 1, #1
From Everand
Chat GPT and Health Wealth: A Guide to Using Prompts for Financial Success and Better Living: 1, #1
Rosen Dimitrov
5/5 (1)
Transformer NLP
No ratings yet
Transformer NLP
64 pages
The Ultimate Guide to ChatGPT: A beginner's handbook to understanding prompt engineering, the future of artificial intelligence and how to use it effectively
From Everand
The Ultimate Guide to ChatGPT: A beginner's handbook to understanding prompt engineering, the future of artificial intelligence and how to use it effectively
Percival C. Verena
No ratings yet
MACHINE LEARNING AND ARTIFICIAL INTELLIGENCE: A Comprehensive Guide to Understanding and Implementing ML and AI (2023 Beginner Crash Course)
From Everand
MACHINE LEARNING AND ARTIFICIAL INTELLIGENCE: A Comprehensive Guide to Understanding and Implementing ML and AI (2023 Beginner Crash Course)
Carl Dennis
No ratings yet
Machine Learning for Absolute Beginners: An Introduction to the Fundamentals and Applications of Machine Learning
From Everand
Machine Learning for Absolute Beginners: An Introduction to the Fundamentals and Applications of Machine Learning
daniel huston
3/5 (1)
The Mobile Learning Edge: Tools and Technologies for Developing Your Teams
From Everand
The Mobile Learning Edge: Tools and Technologies for Developing Your Teams
Gary Woodill
2/5 (1)
Mastering ChatGPT: Business Uses: Podcasts in Print
From Everand
Mastering ChatGPT: Business Uses: Podcasts in Print
Tom Fox
2/5 (1)
How To Get A Client A Day With Facebook Groups
100% (1)
How To Get A Client A Day With Facebook Groups
6 pages
The AI-Powered Productivity Handbook
From Everand
The AI-Powered Productivity Handbook
Jamal Faisal Almutawa
No ratings yet
Darpa Took Over Pentagon Internet PDF
100% (1)
Darpa Took Over Pentagon Internet PDF
9 pages
Api Reference Guide
No ratings yet
Api Reference Guide
89 pages
ChatGPT Ultimate User Guide - How to Make Money Online Faster and More Precise Using AI Technology
From Everand
ChatGPT Ultimate User Guide - How to Make Money Online Faster and More Precise Using AI Technology
Maximus Wilson
1/5 (1)
AI Tutor : Harnessing ChatGPT for Revolutionary Education Programs
From Everand
AI Tutor : Harnessing ChatGPT for Revolutionary Education Programs
Gary Covella, Ph.D.
2/5 (1)
AI Learning Content Limitations
No ratings yet
AI Learning Content Limitations
53 pages
TikTok Guide and Best Practices
No ratings yet
TikTok Guide and Best Practices
6 pages
ChatGPT: The Future of Intelligent Conversation
From Everand
ChatGPT: The Future of Intelligent Conversation
Cea West
4/5 (9)
MDO Infographic
No ratings yet
MDO Infographic
1 page
ChatGPT Simplified For Professionals: Learn How To Harness AI For Maximum Productivity, Effortless Automation, And Smarter Business Strategies—All In One Easy-To-Follow Guide
From Everand
ChatGPT Simplified For Professionals: Learn How To Harness AI For Maximum Productivity, Effortless Automation, And Smarter Business Strategies—All In One Easy-To-Follow Guide
Dylan Fairview
No ratings yet
Lecture 12 Cloud Security
No ratings yet
Lecture 12 Cloud Security
30 pages
ChatGPT
From Everand
ChatGPT
Robert Conway
1/5 (2)
Mastering ChatGPT: Tips and Techniques for Beginners
From Everand
Mastering ChatGPT: Tips and Techniques for Beginners
Keir Bandy
No ratings yet
T29 - Securing Azure Open AI Apps in The Enterprise
No ratings yet
T29 - Securing Azure Open AI Apps in The Enterprise
28 pages
ChatGPT: The Good, the Bad, and the Ugly
From Everand
ChatGPT: The Good, the Bad, and the Ugly
Rick Spair
No ratings yet
NS 21ec742 Assignment 2
No ratings yet
NS 21ec742 Assignment 2
2 pages
Me and My AI: 1, #1
From Everand
Me and My AI: 1, #1
Factsmasterx
No ratings yet
The Internet Is Decentralized
No ratings yet
The Internet Is Decentralized
1 page
Chatgpt Book For Beginners : A Step By Step Guide To Use Chatgpt Effectively, Earn Money And Increase Your Productivity With Over 50+ Tips
From Everand
Chatgpt Book For Beginners : A Step By Step Guide To Use Chatgpt Effectively, Earn Money And Increase Your Productivity With Over 50+ Tips
Daniel Brown
No ratings yet
The Art Of Conversation With ChatGPT
From Everand
The Art Of Conversation With ChatGPT
Hakan SAĞLIK
No ratings yet
How to Generate Money with ChatGPT: TradeSage
From Everand
How to Generate Money with ChatGPT: TradeSage
TradeSage
No ratings yet
Nguyen Ngoc Thuy An
No ratings yet
Nguyen Ngoc Thuy An
1 page
ChatGPT for Pupils and Students: How Artificial Intelligence can help in school and college
From Everand
ChatGPT for Pupils and Students: How Artificial Intelligence can help in school and college
Anna Somnis
No ratings yet
ChatGPT For Dummies
From Everand
ChatGPT For Dummies
Maor Dayan
No ratings yet
The ChatGPT Handbook
From Everand
The ChatGPT Handbook
PA BOOKS
4/5 (1)
All About Chat GPT
From Everand
All About Chat GPT
Richard Turner
No ratings yet
Mastering ChatGPT for Success
From Everand
Mastering ChatGPT for Success
Max Revene
No ratings yet
State Bank of India - Net Banking Reset Form
No ratings yet
State Bank of India - Net Banking Reset Form
1 page
Chapter 2
No ratings yet
Chapter 2
7 pages
cSKBD8BVQNOoha7Kw6 Lyq - Openai Workingcourse Introduction To GPT 3 Introduction To GPT 3
No ratings yet
cSKBD8BVQNOoha7Kw6 Lyq - Openai Workingcourse Introduction To GPT 3 Introduction To GPT 3
19 pages
The Ultimate Guide To Writing With A.I.
From Everand
The Ultimate Guide To Writing With A.I.
Sky Benson
No ratings yet
PYTHON MACHINE LEARNING: Leveraging Python for Implementing Machine Learning Algorithms and Applications (2023 Guide)
From Everand
PYTHON MACHINE LEARNING: Leveraging Python for Implementing Machine Learning Algorithms and Applications (2023 Guide)
Roberta Bowman
No ratings yet
ZAOU Student ELearning Guide 3
No ratings yet
ZAOU Student ELearning Guide 3
11 pages
Affiliate Equation Action Guide
No ratings yet
Affiliate Equation Action Guide
21 pages
Introduction to AI and Machine Learning
From Everand
Introduction to AI and Machine Learning
Disrupt.co.nz
No ratings yet
Leveraging Chatgpt for Job Hunting in the United Kingdom: Chatgpt, #5
From Everand
Leveraging Chatgpt for Job Hunting in the United Kingdom: Chatgpt, #5
Craig Fraser
No ratings yet
Ujian Tengah Semester Digital Marketing - Atika Khairul
No ratings yet
Ujian Tengah Semester Digital Marketing - Atika Khairul
3 pages
PYTHON DATA SCIENCE: A Practical Guide to Mastering Python for Data Science and Artificial Intelligence (2023 Beginner Crash Course)
From Everand
PYTHON DATA SCIENCE: A Practical Guide to Mastering Python for Data Science and Artificial Intelligence (2023 Beginner Crash Course)
Calvert Long
No ratings yet
Mastering ChatGPT : A Guide for Men over 30
From Everand
Mastering ChatGPT : A Guide for Men over 30
Reid Cypher
No ratings yet
Chatbot: Fundamentals and Applications
From Everand
Chatbot: Fundamentals and Applications
Fouad Sabry
No ratings yet
Topic 6.0 Network Troubleshooting
No ratings yet
Topic 6.0 Network Troubleshooting
22 pages
ChatBot and the New Future of Content Creations: A Guide For Your Marketing Solution Using Chat GPT
From Everand
ChatBot and the New Future of Content Creations: A Guide For Your Marketing Solution Using Chat GPT
Dwayne anderson
No ratings yet
Python Machine Learning: A Practical Beginner's Guide to Understanding Machine Learning, Deep Learning and Neural Networks with Python, Scikit-Learn, Tensorflow and Keras
From Everand
Python Machine Learning: A Practical Beginner's Guide to Understanding Machine Learning, Deep Learning and Neural Networks with Python, Scikit-Learn, Tensorflow and Keras
Brandon Railey
No ratings yet
From Idea to Execution: 5 steps to using ChatGPT for Project Proposals
From Everand
From Idea to Execution: 5 steps to using ChatGPT for Project Proposals
Azito
No ratings yet
The ChatGPT Coaching Millionaire Blueprint (GPT-4o 2025 Edition): ChatGPT Millionaire Blueprint, #6
From Everand
The ChatGPT Coaching Millionaire Blueprint (GPT-4o 2025 Edition): ChatGPT Millionaire Blueprint, #6
Digital Edge
No ratings yet
ChatGPT - Make Money Online: How AI can help you earn passive income and increase your business productivity
From Everand
ChatGPT - Make Money Online: How AI can help you earn passive income and increase your business productivity
Newman Michael
No ratings yet
Earning with AI: Unlocking Financial Opportunities through ChatGPT
From Everand
Earning with AI: Unlocking Financial Opportunities through ChatGPT
Gary Kerkow
No ratings yet
Configure SSO For OS Admin and DRS in CUCM Version 12.x - Cisco
No ratings yet
Configure SSO For OS Admin and DRS in CUCM Version 12.x - Cisco
4 pages
PYTHON FOR DATA ANALYTICS: Mastering Python for Comprehensive Data Analysis and Insights (2023 Guide for Beginners)
From Everand
PYTHON FOR DATA ANALYTICS: Mastering Python for Comprehensive Data Analysis and Insights (2023 Guide for Beginners)
Waldo Todd
No ratings yet
Boost Your Career with ChatGPT: The Ultimate Guide to AI in Job Searching
From Everand
Boost Your Career with ChatGPT: The Ultimate Guide to AI in Job Searching
Ebony Washington
No ratings yet
Mastering Information: AI-Powered Insights for Students and Professionals
From Everand
Mastering Information: AI-Powered Insights for Students and Professionals
Alberto Rocha
No ratings yet
ChatGPT and Data security
From Everand
ChatGPT and Data security
Stefan Mielich
No ratings yet
Master ChatGPT in 24 Hours: Learn to Use ChatGPT in Just 24 Hours and Apply Its Benefits in All Aspects of Your Life
From Everand
Master ChatGPT in 24 Hours: Learn to Use ChatGPT in Just 24 Hours and Apply Its Benefits in All Aspects of Your Life
Arellano Martín Y.
No ratings yet
ChatGPT for Business the Best Artificial Intelligence Applications, Marketing and Tools to Boost Your Income
From Everand
ChatGPT for Business the Best Artificial Intelligence Applications, Marketing and Tools to Boost Your Income
Jake L Kent
No ratings yet
The Secure CEO: How to Protect Your Computer Systems, Your Company, and Your Job
From Everand
The Secure CEO: How to Protect Your Computer Systems, Your Company, and Your Job
Mike Foster
No ratings yet
Mastering ChatGPT
From Everand
Mastering ChatGPT
Charles J. Jones
No ratings yet
Cosec VMS: User Manual
No ratings yet
Cosec VMS: User Manual
28 pages
ChatGPT's Guide to Wealth: How to Make Money with Conversational AI Technology
From Everand
ChatGPT's Guide to Wealth: How to Make Money with Conversational AI Technology
Oliver Smith
5/5 (2)
Mikrotik 4 Wan Load Balance PDF
No ratings yet
Mikrotik 4 Wan Load Balance PDF
4 pages
Data Science Career Guide Interview Preparation
From Everand
Data Science Career Guide Interview Preparation
Gradient Publication
No ratings yet
NO Block LOT Lot Area Lot Price
No ratings yet
NO Block LOT Lot Area Lot Price
1 page
Lesson Plan
No ratings yet
Lesson Plan
6 pages
Information Technology Policy and Procedures
No ratings yet
Information Technology Policy and Procedures
17 pages
HTML Code For Blogger Contact Form
No ratings yet
HTML Code For Blogger Contact Form
1 page
Chatbots - the New Future for Content Creation: A Guide For Your Marketing Solution Using ChatGPT
From Everand
Chatbots - the New Future for Content Creation: A Guide For Your Marketing Solution Using ChatGPT
Dwayne Anderson
No ratings yet
ChatGPT for Beginners Al-Powered Producivity
From Everand
ChatGPT for Beginners Al-Powered Producivity
Ary S. Jr.
No ratings yet
VPN Presentation
No ratings yet
VPN Presentation
15 pages
SRS
No ratings yet
SRS
19 pages
Position Paper Peru Disec
No ratings yet
Position Paper Peru Disec
2 pages
Unlocking Your Potential with ChatGPT
From Everand
Unlocking Your Potential with ChatGPT
Bill Vincent
No ratings yet
A Best Practices Guide for Comprehensive Employee Awareness Programs
From Everand
A Best Practices Guide for Comprehensive Employee Awareness Programs
MediaPro
No ratings yet
Integration of BW in SAP EP
No ratings yet
Integration of BW in SAP EP
20 pages
Hadoop BIG DATA Interview Questions You'll Most Likely Be Asked
From Everand
Hadoop BIG DATA Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Machine Learning with Tensorflow: A Deeper Look at Machine Learning with TensorFlow
From Everand
Machine Learning with Tensorflow: A Deeper Look at Machine Learning with TensorFlow
Frank Millstein
No ratings yet

How ChatGPT and Our Language Models Are Developed - OpenAI Help Center

Uploaded by

How ChatGPT and Our Language Models Are Developed - OpenAI Help Center

Uploaded by

13/12/2023, 10:35 How ChatGPT and Our Language Models Are Developed | OpenAI Help Center

Search for articles...

All Collections Privacy and policies General FAQ

How ChatGPT and Our Language

How can I access the ChatGPT API?

Did this answer your question?

ChatGPT API DALL·E Service Status

You might also like