0% found this document useful (0 votes)

30 views12 pages

OpenAI O1 and New Tools For Developers - OpenAI

Uploaded by

Sitesh Muduli

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views12 pages

OpenAI O1 and New Tools For Developers - OpenAI

Uploaded by

Sitesh Muduli

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

OpenAI o1 and new

tools for developers

Introducing OpenAI o1, Realtime API improvements,
a new fine-tuning method and more for developers.

Today we’re introducing more capable models, new tools for customization, and
upgrades that improve performance, flexibility, and cost-efficiency for developers
building with AI. This includes:

• OpenAI o1 in the API⁠, with support for function calling, developer messages,
Structured Outputs, and vision capabilities.

• Realtime API updates⁠, including simple WebRTC integration, a 60% price reduction
for GPT-4o audio, and support for GPT-4o mini at one-tenth of previous audio rates.

• Preference Fine-Tuning⁠, a new model customization technique that makes it easier to

tailor models based on user and developer preferences.

• New Go and Java SDKs⁠available in beta.

OpenAI o1 in the API

OpenAI o1⁠, our reasoning model designed to handle complex multi-step tasks with
advanced accuracy, is rolling out to developers on usage tier 5⁠in the API. o1 is the
successor to OpenAI o1-preview⁠, which developers have already used to build agentic
applications to streamline customer support, optimize supply chain decisions, and
forecast complex financial trends.

o1 is production-ready with key features to enable real-world use cases, including:

• Function calling⁠: Seamlessly connect o1 to external data and APIs.

• Structured Outputs⁠: Generate responses that reliably adhere to your custom JSON
Schema.

• Developer messages: Specify instructions or context for the model to follow, such as
defining tone, style and other behavioral guidance.

• Vision capabilities: Reason over images to unlock many more applications in science,
manufacturing, or coding, where visual inputs matter.

• Lower latency: o1 uses on average 60% fewer reasoning tokens than o1-preview for a
given request.

• A new `reasoning_effort` API parameter allows you to control how long the model
thinks before answering.

The snapshot of o1 we’re shipping today o1-2024-12-17 is a new post-trained version

of the model we released in ChatGPT two weeks ago. It improves on areas of model
behavior based on feedback, while maintaining the frontier capabilities we evaluated in
our o1 System Card.⁠We’re also updating o1 in ChatGPT to this version soon. The
evaluations we’re sharing below reflect the performance of this new snapshot, ensuring
developers have up-to-date benchmarks for this version.

o1-2024-12-17 sets new state-of-the-art results on several benchmarks, improving

cost-efficiency and performance.

Category Eval o1-2024-12-17 o1-preview

General GPQA diamond 75.7 73.3

MMLU (pass @1) 91.8 90.8

Coding SWE-bench Verified 48.9 41.3

Category Eval o1-2024-12-17 o1-preview

LiveBench (Coding) 76.6 52.3

Math MATH (pass @1) 96.4 85.5

AIME 2024 (pass @1) 79.2 42.0

MGSM (pass @1) 89.3 90.8

Vision MMMU (pass @1) 77.3 —

MathVista (pass @1) 71.0 —

Factuality SimpleQA 42.6 42.4

Agents TAU-bench (retail) 73.5 —

TAU-bench (airline) 54.2 —

Model Evaluation Accuracy Across Different Metrics

gpt-4o-2024-11-20 o1-preview o1-2024-12-17 o1 with SO
1.0

0.9

0.8

0.7

0.6
Accuracy

0.5

0.4

0.3

0.2

0.1

0.0
s

4
ut

02
lin

di
tp

-2
ut

co
ou

22
-o
n-

-
ch
d-

20
ed
tio
re

en
ur
nc

E
tu

M
ct
-fu
uc

AI
ru

liv
tr

st
-s

d-
al

an
rn

g-
te
in

lin
al
c
n-
tio
nc
-fu
al
rn
te
in

Additionally, we have observed that o1-2024-12-17 significantly outperforms gpt-4o

in our function calling and Structured Outputs testing.

We are rolling out access incrementally while working to expand access to additional
usage tiers and ramping up rate limits. To get started, check out the API documentation⁠.

Improvements to the Realtime API

The Realtime API⁠enables developers to create low-latency, natural conversational

experiences. It’s ideal for voice assistants, live translation tools, virtual tutors, interactive
customer support systems, or even your own virtual Santa⁠. Today we’re releasing
changes to address some of the most common requests from developers: a direct
WebRTC integration, reduced pricing, and more control over responses.

WebRTC support

We’re introducing WebRTC⁠support for the Realtime API. WebRTC is an open standard
that makes it easier to build and scale real-time voice products across platforms—
whether for browser-based apps, mobile clients, IoT devices, or direct server-to-server
setups.

Our WebRTC integration is designed to enable smooth and responsive interactions in

real-world conditions, even with variable network quality. It handles audio encoding,
streaming, noise suppression, and congestion control.

With WebRTC, you can now add Realtime capabilities with just a handful of lines of
Javascript:

JavaScript

1 async function createRealtimeSession(localStream, remoteAudioEl, token) {

2 const pc = new RTCPeerConnection();
3 pc.ontrack = e => remoteAudioEl.srcObject = e.streams[0];
4 pc.addTrack(localStream.getTracks()[0]);
5 const offer = await pc.createOffer();
6 await pc.setLocalDescription(offer);
7 const headers = { Authorization: `Bearer ${token}`, 'Content-Type': 'appli
8 const opts = { method: 'POST', body: offer.sdp, headers };
9 const resp = await fetch('https://api.openai.com/v1/realtime', opts);
10 await pc.setRemoteDescription({ type: 'answer', sdp: await resp.text() });
11 return pc;
12 }

Learn more about our WebRTC integration in the API documentation⁠.

New GPT-4o and GPT-4o mini realtime snapshots at lower cost

We’re releasing gpt-4o-realtime-preview-2024-12-17 as part of the Realtime API

beta with improved voice quality, more reliable input (especially for dictated numbers),
and reduced costs. Due to our efficiency improvements, we’re dropping the audio token
price by 60% to $40/1M input tokens and $80/1M output tokens. Cached audio input
costs are reduced by 87.5% to $2.50/1M input tokens.

We’re also bringing GPT-4o mini to the Realtime API beta as gpt-4o-mini-realtime-
preview-2024-12-17 . GPT-4o mini is our most cost-efficient small model and brings
the same rich voice experiences to the Realtime API as GPT-4o. GPT-4o mini audio price
is $10/1M input tokens and $20/1M output tokens. Text tokens are priced at $0.60/1M
input tokens and $2.40/1M output tokens. Cached audio and text both cost $0.30/1M
tokens.

These snapshots are available in the Realtime API⁠and also in the Chat Completions API⁠
as gpt-4o-audio-preview-2024-12-17 and gpt-4o-mini-audio-preview-2024-12-
17 .

More control over responses

We’re shipping the following features to the Realtime API to make it easier to deliver
exceptional voice-driven experiences:

• Concurrent out-of-band responses⁠to enable background tasks such as content

moderation or classification to run without interrupting the user’s voice interaction.

• Custom input context⁠to specify which conversation items to include as model input.
For example, run a moderation check on just the user’s last utterance or re-use a past
response without permanently altering the session state.

• Controlled response timing⁠to use server-side Voice Activity Detection (VAD) without
automatically triggering a response. For instance, gather necessary data such as
account details and add it to the model’s context before manually initiating a voice
reply, offering more control over timing and accuracy.

• Increased maximum session length⁠from 15 to 30 min.

Preference Fine-Tuning

The fine-tuning API now supports Preference Fine-Tuning⁠to make it easy to customize
models based on user and developer preferences. This method uses Direct Preference
Optimization (DPO)⁠to compare pairs of model responses, teaching the model to
distinguish between preferred and non-preferred outputs. By learning from pairwise
comparisons rather than fixed targets, Preference Fine-Tuning is especially effective for
subjective tasks where tone, style, and creativity matter.

There are some key differences between Preference Fine-Tuning and Supervised Fine-
Tuning, as shown below.

Supervised Fine-Tuning Preference Fine-Tuning

(SFT) (PFT)

Objective Encourage the model to Optimize the model to favor

generate correct outputs desired behavior by
by replicating labeled reinforcing preferred
outputs responses and reducing the
likelihood of unpreferred
ones

Training data Exact input and output Pairs of preferred and non-
pairs preferred model output, via
human annotation, A/B
testing, or synthetic data
generation

Use cases Tasks where an ideal Effective for tasks where

output is easy to prepare, “better” responses are
such as custom code subjective, such as creative
format, and strict writing or summarization.
correctness is needed

We started testing Preference Fine-Tuning with trusted partners who have seen
promising results so far. For example, Rogo AI⁠is building an AI assistant for financial
analysts that breaks down complex queries into sub-queries. Using their expert-built
benchmark, Rogo-Golden, they found that while Supervised Fine-Tuning faced
challenges with out-of-distribution query expansion—such as missing metrics like ARR
for queries like “how fast is company X growing”—Preference Fine-Tuning resolved
these issues, improving performance from 75% accuracy in the base model to over 80%.

Preference Fine-Tuning will roll out today for gpt-4o-2024-08-06 and will be available
for gpt-4o-mini-2024-07-18 soon. It will be available at the same price per trained
token as Supervised Fine-Tuning, with support for our newest models coming early next
year. For more information, visit our fine-tuning guide⁠in the API documentation.

Go and Java SDKs in beta

Finally, we’re introducing two new official SDKs for Go⁠and Java⁠in beta, in addition to our
existing official Python, Node.js and .NET libraries⁠. Our goal is for OpenAI APIs to be
easy to use, no matter what programming language you choose.

Go is a statically typed language ideal for handling concurrency and building scalable
APIs and backend systems. The OpenAI Go SDK makes it easy to interact with OpenAI
models in your Go code.
Go

1 client := openai.NewClient()
2 ctx := context.Background()
3 prompt := "Write me a haiku about Golang."
4
5 completion, err := client.Chat.Completions.New(
6 ctx,
7 openai.ChatCompletionNewParams{
8 Messages: openai.F(
9 []openai.ChatCompletionMessageParamUnion{
10 openai.UserMessage(prompt),
11 },
12 ),
13 Model: openai.F(openai.ChatModelGPT4o),
14 },
15 )

For more information on the Go SDK, check out the README on GitHub⁠.

Java has been a staple of enterprise software development, favored for its type system
and massive ecosystem of open-source libraries. The OpenAI Java SDK provides typed
request and response objects, and helpful utilities to manage API requests.

Java

1 OpenAIClient client = OpenAIOkHttpClient.fromEnv();

2
3 ChatCompletionCreateParams params = ChatCompletionCreateParams
4 .builder()
5 .message(List.of(
6 ChatCompletionMessageParam.ofChatCompletionUserMessageParam(
7 ChatCompletionUserMessageParam
8 .builder()
9 .role(ChatCompletionUserMessageParam.Role.USER)
10 .content(
11 ChatCompletionUserMessageParam.Content.ofTextContent(
12 "What is the origin of Java's Duke mascot?"
13 )
14 )
15 .build()
16 )
17 ))
18 .model(ChatModel.O1_PREVIEW)
19 .build();
20
21 ChatCompletion chatCompletion = client.chat().completions().create(params);

For more information on the Java SDK, check out the README on GitHub⁠.

Conclusion

We’re excited to see what you’ll build with these updates—whether it’s new voice apps,
fine-tuned models, or agentic applications that push the boundaries of what’s possible.
Check out the detailed guides for o1⁠, Realtime API⁠, WebRTC integration⁠, and Preference
Fine-Tuning⁠in our API documentation to dive deeper and start experimenting today.

Have questions? Connect with our team on the OpenAI Developer Forum⁠.

Announcements

Authors

OpenAI
Our research

Overview

Index

Latest advancements

OpenAI o1

OpenAI o1-mini

GPT-4

GPT-4o mini

DALL·E 3

Sora

ChatGPT

For Everyone

For Teams

For Enterprises

ChatGPT login

Download

API

Platform overview

Pricing

Documentation

API login

Explore more

OpenAI for business

Stories
Safety overview

Safety overview

Company

About us

News

Our Charter

Security

Residency

Careers

Terms & policies

Brand guidelines

Other policies

English (US)

API Reference - OpenAI API
No ratings yet
API Reference - OpenAI API
31 pages
BC OpenAI Versao 01
No ratings yet
BC OpenAI Versao 01
1,133 pages
The Art of Prompt Engineering With Chatgpt A Hands-On Guide PDF Download
No ratings yet
The Art of Prompt Engineering With Chatgpt A Hands-On Guide PDF Download
4 pages
OpenAI TypeScript and JavaScript API Library
No ratings yet
OpenAI TypeScript and JavaScript API Library
9 pages
ChatBot PDF
No ratings yet
ChatBot PDF
109 pages
Getting Started - OpenAI Realtime and WebRTC - by Chris McKenzie - Medium
No ratings yet
Getting Started - OpenAI Realtime and WebRTC - by Chris McKenzie - Medium
17 pages
Instructions For GPT HTTPs
No ratings yet
Instructions For GPT HTTPs
2 pages
Realtime API - OpenAI API
No ratings yet
Realtime API - OpenAI API
9 pages
API Reference - OpenAI API
No ratings yet
API Reference - OpenAI API
116 pages
Namma Kalvi 12th Maths Study Material Tamil Medium 215380
No ratings yet
Namma Kalvi 12th Maths Study Material Tamil Medium 215380
21 pages
Introducing OpenAI O1 - OpenAI
No ratings yet
Introducing OpenAI O1 - OpenAI
5 pages
GPT-4.1 Prompting Guide
No ratings yet
GPT-4.1 Prompting Guide
31 pages
Berryman
No ratings yet
Berryman
24 pages
Study Together-A New Pedagogical Mode
No ratings yet
Study Together-A New Pedagogical Mode
3 pages
OpenAI Adopts The Model Context Protocol (MCP)
No ratings yet
OpenAI Adopts The Model Context Protocol (MCP)
3 pages
Exploring OpenAI's Latest: O3 & O4-Mini For Complex Tasks
No ratings yet
Exploring OpenAI's Latest: O3 & O4-Mini For Complex Tasks
8 pages
The Emergence of Artificial Intelligence
No ratings yet
The Emergence of Artificial Intelligence
6 pages
Resume 1
No ratings yet
Resume 1
1 page
Open AI
No ratings yet
Open AI
65 pages
Audio and Speech - OpenAI API
No ratings yet
Audio and Speech - OpenAI API
1 page
TDSynexx - Virtual Training Session
No ratings yet
TDSynexx - Virtual Training Session
49 pages
2025 04 25 AI Updates
No ratings yet
2025 04 25 AI Updates
24 pages
Understanding OpenAI's GPT Model Evolution - From G
No ratings yet
Understanding OpenAI's GPT Model Evolution - From G
4 pages
How To Use Function Calling With OpenAI Realtime API - by Pragnakalp Techlabs - Nov, 2024 - Generative AI
No ratings yet
How To Use Function Calling With OpenAI Realtime API - by Pragnakalp Techlabs - Nov, 2024 - Generative AI
13 pages
ChatGPT - OpenAI
No ratings yet
ChatGPT - OpenAI
12 pages
Class Notes - Removed
No ratings yet
Class Notes - Removed
29 pages
Fayyaz CIP Deepseek Vs ChatGpt Presentation
No ratings yet
Fayyaz CIP Deepseek Vs ChatGpt Presentation
36 pages
Prompt Caching in The API - OpenAI
No ratings yet
Prompt Caching in The API - OpenAI
6 pages
Exhibit 20
No ratings yet
Exhibit 20
22 pages
Alternatives For OpenAI API
100% (1)
Alternatives For OpenAI API
10 pages
GPT Vs Assistants API
No ratings yet
GPT Vs Assistants API
5 pages
New OpenAI Feature - Predicted Outputs
No ratings yet
New OpenAI Feature - Predicted Outputs
2 pages
Interrupting The OpenAI RealTime API - WorkAdventure Documentation
No ratings yet
Interrupting The OpenAI RealTime API - WorkAdventure Documentation
1 page
2204 LwAI Newsletter
No ratings yet
2204 LwAI Newsletter
18 pages
Using OpenAI's RealTime API - WorkAdventure Documentation
No ratings yet
Using OpenAI's RealTime API - WorkAdventure Documentation
1 page
Autogen OpenAi Class
No ratings yet
Autogen OpenAi Class
12 pages
Chat GPT
No ratings yet
Chat GPT
8 pages
14 ChatGPT
No ratings yet
14 ChatGPT
45 pages
Python Code Explanation
No ratings yet
Python Code Explanation
4 pages
Openai Workingcourse Introduction To Chatgpt Api Chatgpt Api Parameters
No ratings yet
Openai Workingcourse Introduction To Chatgpt Api Chatgpt Api Parameters
11 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
3 pages
Fine-Tuning - OpenAI API
No ratings yet
Fine-Tuning - OpenAI API
19 pages
GCP-GCX Genesys Cloud CX Certified Professional-Consolidated Exam Practice Questions
No ratings yet
GCP-GCX Genesys Cloud CX Certified Professional-Consolidated Exam Practice Questions
10 pages
OpenAI API
No ratings yet
OpenAI API
14 pages
Text Generation - OpenAI API
No ratings yet
Text Generation - OpenAI API
12 pages
Intro To OpenAI GPT API - Intro To OpenAI GPT API Cheatsheet - Codecademy
No ratings yet
Intro To OpenAI GPT API - Intro To OpenAI GPT API Cheatsheet - Codecademy
7 pages
Slides
No ratings yet
Slides
63 pages
Introduction To ChatGPT
No ratings yet
Introduction To ChatGPT
23 pages
ChatGPT Cheat Sheet - A Complete Guide For 2024
No ratings yet
ChatGPT Cheat Sheet - A Complete Guide For 2024
9 pages
Open AI Python
No ratings yet
Open AI Python
1 page
Artificial Intelligence To Enhance Language Skills Brainstorm in Blue 3D Modern Style
No ratings yet
Artificial Intelligence To Enhance Language Skills Brainstorm in Blue 3D Modern Style
10 pages
Prompt Engineering - OpenAI API
No ratings yet
Prompt Engineering - OpenAI API
20 pages
Lab1 Installation
No ratings yet
Lab1 Installation
8 pages
Assistants API Overview (Python SDK) OpenAI Cookbook
No ratings yet
Assistants API Overview (Python SDK) OpenAI Cookbook
21 pages
Best Practices For Prompt Engineering With OpenAI API - OpenAI Help Center
No ratings yet
Best Practices For Prompt Engineering With OpenAI API - OpenAI Help Center
7 pages
API Reference - OpenAI API
No ratings yet
API Reference - OpenAI API
46 pages
ISSA Catalogue Booking Form 2012-2013
60% (5)
ISSA Catalogue Booking Form 2012-2013
2 pages
Chat GPT
No ratings yet
Chat GPT
2 pages
Everything I'll Forget About Prompting LLMs
No ratings yet
Everything I'll Forget About Prompting LLMs
36 pages
Fortinet Fortinac Lab Guide For Fortinac 72
No ratings yet
Fortinet Fortinac Lab Guide For Fortinac 72
118 pages
Sage100 2023 Installation System Admin Guide
No ratings yet
Sage100 2023 Installation System Admin Guide
132 pages
Function Calling - OpenAI API
No ratings yet
Function Calling - OpenAI API
5 pages
Chatbot Development With ChatGPT & LangChain A Context-Aware Approach DataCamp
No ratings yet
Chatbot Development With ChatGPT & LangChain A Context-Aware Approach DataCamp
18 pages
Finkster-Python Cheatsheet
No ratings yet
Finkster-Python Cheatsheet
11 pages
Admin-Material Compress
No ratings yet
Admin-Material Compress
75 pages
New Zkaccess3.5: Access Control Management Solution
No ratings yet
New Zkaccess3.5: Access Control Management Solution
3 pages
Spring Boot Interview Questions 1736792896
No ratings yet
Spring Boot Interview Questions 1736792896
3 pages
2025 Information System 511 Assignment - IT
No ratings yet
2025 Information System 511 Assignment - IT
7 pages
RedHat Satellite 6.15
No ratings yet
RedHat Satellite 6.15
17 pages
VxRail ApplianceVxRail How To Procedures-Change The VxRail Manager VM Hostname and IP Address (PDF) - CliffsNotes
No ratings yet
VxRail ApplianceVxRail How To Procedures-Change The VxRail Manager VM Hostname and IP Address (PDF) - CliffsNotes
9 pages
Introducing ChatGPT - OpenAI
No ratings yet
Introducing ChatGPT - OpenAI
8 pages
Social Media Strategy
100% (1)
Social Media Strategy
8 pages
11981094
No ratings yet
11981094
2 pages
First Year at Harrow (Past Papers Synonyms) All Boards 2011 - 2018 (Group I - II) - IQRA ENGLISH NOTES
100% (1)
First Year at Harrow (Past Papers Synonyms) All Boards 2011 - 2018 (Group I - II) - IQRA ENGLISH NOTES
5 pages
4.4.4 - 4932 Lab - Locating Log Files - ILM
No ratings yet
4.4.4 - 4932 Lab - Locating Log Files - ILM
19 pages
Sora Is Here - OpenAI
No ratings yet
Sora Is Here - OpenAI
7 pages
KPI Format Dec
No ratings yet
KPI Format Dec
6 pages
The Ultimate C - C - TB1200 - 10 - SAP Certified Application Associate - SAP Business One Release 10.0
No ratings yet
The Ultimate C - C - TB1200 - 10 - SAP Certified Application Associate - SAP Business One Release 10.0
2 pages
Karen Garner: Philip Metzler Video Audit Log
No ratings yet
Karen Garner: Philip Metzler Video Audit Log
5 pages
Upload A Document - Scribd
No ratings yet
Upload A Document - Scribd
3 pages
Emptech Lesson 2 Online Safety Security Ethics and Etiquette
No ratings yet
Emptech Lesson 2 Online Safety Security Ethics and Etiquette
3 pages
There Are Among Us: 4 Reporters
No ratings yet
There Are Among Us: 4 Reporters
14 pages
FRT Student Guide - Tata Strive - 15112023
No ratings yet
FRT Student Guide - Tata Strive - 15112023
4 pages
Trading View User Guide
No ratings yet
Trading View User Guide
5 pages
One Four Three 148
No ratings yet
One Four Three 148
3 pages
How To Use SKKU Portal ID & Course Registration
No ratings yet
How To Use SKKU Portal ID & Course Registration
6 pages
Create New Otp Order
No ratings yet
Create New Otp Order
2 pages
Bus251 - Final Assignment - 1911654630
No ratings yet
Bus251 - Final Assignment - 1911654630
9 pages
ZAPAK
No ratings yet
ZAPAK
5 pages
Linux X Configuration: Large Interactive Display
No ratings yet
Linux X Configuration: Large Interactive Display
5 pages
Extract PDF Title Perl
No ratings yet
Extract PDF Title Perl
2 pages
Week 6 Lab - Configuring IPv4 Static and Default Routes
No ratings yet
Week 6 Lab - Configuring IPv4 Static and Default Routes
7 pages
What Is The Difference of Phishing and Pharming?
No ratings yet
What Is The Difference of Phishing and Pharming?
5 pages
Hidden Line Removal: Unveiling the Invisible: Secrets of Computer Vision
From Everand
Hidden Line Removal: Unveiling the Invisible: Secrets of Computer Vision
Fouad Sabry
No ratings yet

OpenAI O1 and New Tools For Developers - OpenAI

Uploaded by

OpenAI O1 and New Tools For Developers - OpenAI

Uploaded by

OpenAI o1 and new

tools for developers

• Preference Fine-Tuning⁠, a new model customization technique that makes it easier to

• New Go and Java SDKs⁠available in beta.

OpenAI o1 in the API

o1 is production-ready with key features to enable real-world use cases, including:

• Function calling⁠: Seamlessly connect o1 to external data and APIs.

The snapshot of o1 we’re shipping today o1-2024-12-17 is a new post-trained version

o1-2024-12-17 sets new state-of-the-art results on several benchmarks, improving

Category Eval o1-2024-12-17 o1-preview

General GPQA diamond 75.7 73.3

MMLU (pass @1) 91.8 90.8

Coding SWE-bench Verified 48.9 41.3

LiveBench (Coding) 76.6 52.3

Math MATH (pass @1) 96.4 85.5

AIME 2024 (pass @1) 79.2 42.0

MGSM (pass @1) 89.3 90.8

Vision MMMU (pass @1) 77.3 —

MathVista (pass @1) 71.0 —

Factuality SimpleQA 42.6 42.4

Agents TAU-bench (retail) 73.5 —

TAU-bench (airline) 54.2 —

Model Evaluation Accuracy Across Different Metrics

Additionally, we have observed that o1-2024-12-17 significantly outperforms gpt-4o

Improvements to the Realtime API

The Realtime API⁠enables developers to create low-latency, natural conversational

Our WebRTC integration is designed to enable smooth and responsive interactions in

1 async function createRealtimeSession(localStream, remoteAudioEl, token) {

Learn more about our WebRTC integration in the API documentation⁠.

We’re releasing gpt-4o-realtime-preview-2024-12-17 as part of the Realtime API

More control over responses

• Concurrent out-of-band responses⁠to enable background tasks such as content

• Increased maximum session length⁠from 15 to 30 min.

Supervised Fine-Tuning Preference Fine-Tuning

Objective Encourage the model to Optimize the model to favor

Use cases Tasks where an ideal Effective for tasks where

Go and Java SDKs in beta

1 OpenAIClient client = OpenAIOkHttpClient.fromEnv();

OpenAI for business

Terms & policies

You might also like