OpenAI O1 and New Tools For Developers - OpenAI
OpenAI O1 and New Tools For Developers - OpenAI
Today we’re introducing more capable models, new tools for customization, and
upgrades that improve performance, flexibility, and cost-efficiency for developers
building with AI. This includes:
• OpenAI o1 in the API, with support for function calling, developer messages,
Structured Outputs, and vision capabilities.
• Realtime API updates, including simple WebRTC integration, a 60% price reduction
for GPT-4o audio, and support for GPT-4o mini at one-tenth of previous audio rates.
OpenAI o1, our reasoning model designed to handle complex multi-step tasks with
advanced accuracy, is rolling out to developers on usage tier 5in the API. o1 is the
successor to OpenAI o1-preview, which developers have already used to build agentic
applications to streamline customer support, optimize supply chain decisions, and
forecast complex financial trends.
• Structured Outputs: Generate responses that reliably adhere to your custom JSON
Schema.
• Developer messages: Specify instructions or context for the model to follow, such as
defining tone, style and other behavioral guidance.
• Vision capabilities: Reason over images to unlock many more applications in science,
manufacturing, or coding, where visual inputs matter.
• Lower latency: o1 uses on average 60% fewer reasoning tokens than o1-preview for a
given request.
• A new `reasoning_effort` API parameter allows you to control how long the model
thinks before answering.
0.9
0.8
0.7
0.6
Accuracy
0.5
0.4
0.3
0.2
0.1
0.0
s
ts
ng
4
ut
02
lin
pu
di
tp
al
-2
ut
co
ou
22
-o
n-
-
ch
d-
20
ed
tio
re
en
ur
nc
E
tu
eb
M
ct
-fu
uc
AI
ru
liv
tr
al
st
-s
rn
d-
al
te
an
rn
in
g-
te
in
lin
al
c
n-
tio
nc
-fu
al
rn
te
in
We are rolling out access incrementally while working to expand access to additional
usage tiers and ramping up rate limits. To get started, check out the API documentation.
WebRTC support
We’re introducing WebRTCsupport for the Realtime API. WebRTC is an open standard
that makes it easier to build and scale real-time voice products across platforms—
whether for browser-based apps, mobile clients, IoT devices, or direct server-to-server
setups.
With WebRTC, you can now add Realtime capabilities with just a handful of lines of
Javascript:
JavaScript
We’re also bringing GPT-4o mini to the Realtime API beta as gpt-4o-mini-realtime-
preview-2024-12-17 . GPT-4o mini is our most cost-efficient small model and brings
the same rich voice experiences to the Realtime API as GPT-4o. GPT-4o mini audio price
is $10/1M input tokens and $20/1M output tokens. Text tokens are priced at $0.60/1M
input tokens and $2.40/1M output tokens. Cached audio and text both cost $0.30/1M
tokens.
These snapshots are available in the Realtime APIand also in the Chat Completions API
as gpt-4o-audio-preview-2024-12-17 and gpt-4o-mini-audio-preview-2024-12-
17 .
We’re shipping the following features to the Realtime API to make it easier to deliver
exceptional voice-driven experiences:
• Custom input contextto specify which conversation items to include as model input.
For example, run a moderation check on just the user’s last utterance or re-use a past
response without permanently altering the session state.
• Controlled response timingto use server-side Voice Activity Detection (VAD) without
automatically triggering a response. For instance, gather necessary data such as
account details and add it to the model’s context before manually initiating a voice
reply, offering more control over timing and accuracy.
Preference Fine-Tuning
The fine-tuning API now supports Preference Fine-Tuningto make it easy to customize
models based on user and developer preferences. This method uses Direct Preference
Optimization (DPO)to compare pairs of model responses, teaching the model to
distinguish between preferred and non-preferred outputs. By learning from pairwise
comparisons rather than fixed targets, Preference Fine-Tuning is especially effective for
subjective tasks where tone, style, and creativity matter.
There are some key differences between Preference Fine-Tuning and Supervised Fine-
Tuning, as shown below.
Training data Exact input and output Pairs of preferred and non-
pairs preferred model output, via
human annotation, A/B
testing, or synthetic data
generation
We started testing Preference Fine-Tuning with trusted partners who have seen
promising results so far. For example, Rogo AIis building an AI assistant for financial
analysts that breaks down complex queries into sub-queries. Using their expert-built
benchmark, Rogo-Golden, they found that while Supervised Fine-Tuning faced
challenges with out-of-distribution query expansion—such as missing metrics like ARR
for queries like “how fast is company X growing”—Preference Fine-Tuning resolved
these issues, improving performance from 75% accuracy in the base model to over 80%.
Preference Fine-Tuning will roll out today for gpt-4o-2024-08-06 and will be available
for gpt-4o-mini-2024-07-18 soon. It will be available at the same price per trained
token as Supervised Fine-Tuning, with support for our newest models coming early next
year. For more information, visit our fine-tuning guidein the API documentation.
Finally, we’re introducing two new official SDKs for Goand Javain beta, in addition to our
existing official Python, Node.js and .NET libraries. Our goal is for OpenAI APIs to be
easy to use, no matter what programming language you choose.
Go is a statically typed language ideal for handling concurrency and building scalable
APIs and backend systems. The OpenAI Go SDK makes it easy to interact with OpenAI
models in your Go code.
Go
1 client := openai.NewClient()
2 ctx := context.Background()
3 prompt := "Write me a haiku about Golang."
4
5 completion, err := client.Chat.Completions.New(
6 ctx,
7 openai.ChatCompletionNewParams{
8 Messages: openai.F(
9 []openai.ChatCompletionMessageParamUnion{
10 openai.UserMessage(prompt),
11 },
12 ),
13 Model: openai.F(openai.ChatModelGPT4o),
14 },
15 )
For more information on the Go SDK, check out the README on GitHub.
Java has been a staple of enterprise software development, favored for its type system
and massive ecosystem of open-source libraries. The OpenAI Java SDK provides typed
request and response objects, and helpful utilities to manage API requests.
Java
For more information on the Java SDK, check out the README on GitHub.
Conclusion
We’re excited to see what you’ll build with these updates—whether it’s new voice apps,
fine-tuned models, or agentic applications that push the boundaries of what’s possible.
Check out the detailed guides for o1, Realtime API, WebRTC integration, and Preference
Fine-Tuningin our API documentation to dive deeper and start experimenting today.
Have questions? Connect with our team on the OpenAI Developer Forum.
Announcements
Authors
OpenAI
Our research
Overview
Index
Latest advancements
OpenAI o1
OpenAI o1-mini
GPT-4
GPT-4o mini
DALL·E 3
Sora
ChatGPT
For Everyone
For Teams
For Enterprises
ChatGPT login
Download
API
Platform overview
Pricing
Documentation
API login
Explore more
Stories
Safety overview
Safety overview
Company
About us
News
Our Charter
Security
Residency
Careers
Terms of use
Privacy policy
Brand guidelines
Other policies
English (US)
OpenAI © 2015–2025