0% found this document useful (0 votes)

9 views6 pages

Computer Use Documentation

The document outlines the beta feature of computer use with the upgraded Claude 3.5 Sonnet model, detailing its capabilities and associated risks. It provides guidelines for safe implementation, including using virtual machines, avoiding sensitive data access, and confirming critical decisions with human oversight. Additionally, it describes the process for integrating computer use tools, optimizing model performance, and the limitations and pricing structure for the API requests.

Uploaded by

SlimeDiaz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views6 pages

Computer Use Documentation

Uploaded by

SlimeDiaz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 6

Build with Claude

Computer use (beta)

The upgraded Claude 3.5 Sonnet model is capable of interacting with tools that can
manipulate a computer desktop environment.

Computer use is a beta feature. Please be aware that computer use poses unique
risks that are distinct from standard API features or chat interfaces. These risks
are heightened when using computer use to interact with the internet. To minimize
risks, consider taking precautions such as:

Use a dedicated virtual machine or container with minimal privileges to prevent

direct system attacks or accidents.
Avoid giving the model access to sensitive data, such as account login information,
to prevent information theft.
Limit internet access to an allowlist of domains to reduce exposure to malicious
content.
Ask a human to confirm decisions that may result in meaningful real-world
consequences as well as any tasks requiring affirmative consent, such as accepting
cookies, executing financial transactions, or agreeing to terms of service.
In some circumstances, Claude will follow commands found in content even if it
conflicts with the user’s instructions. For example, Claude instructions on
webpages or contained in images may override instructions or cause Claude to make
mistakes. We suggest taking precautions to isolate Claude from sensitive data and
actions to avoid risks related to prompt injection.

Finally, please inform end users of relevant risks and obtain their consent prior
to enabling computer use in your own products.

Computer use reference implementation

Get started quickly with our computer use reference implementation that includes a
web interface, Docker container, example tool implementations, and an agent loop.

Please use this form to provide feedback on the quality of the model responses, the
API itself, or the quality of the documentation - we cannot wait to hear from you!

Here’s an example of how to provide computer use tools to Claude using the Messages
API:

Shell

Python

TypeScript

How computer use works

1. Provide Claude with computer use tools and a user prompt

Add Anthropic-defined computer use tools to your API request.

Include a user prompt that might require these tools, e.g., “Save a picture of a
cat to my desktop.”
2. Claude decides to use a tool

Claude loads the stored computer use tool definitions and assesses if any tools can
help with the user’s query.
If yes, Claude constructs a properly formatted tool use request.
The API response has a stop_reason of tool_use, signaling Claude’s intent.
3. Extract tool input, evaluate the tool on a computer, and return results

On your end, extract the tool name and input from Claude’s request.
Use the tool on a container or Virtual Machine.
Continue the conversation with a new user message containing a tool_result content
block.
4. Claude continues calling computer use tools until it's completed the task

Claude analyzes the tool results to determine if more tool use is needed or the
task has been completed.
If Claude decides it needs another tool, it responds with another tool_use
stop_reason and you should return to step 3.
Otherwise, it crafts a text response to the user.
We refer to the repetition of steps 3 and 4 without user input as the “agent loop”
- i.e., Claude responding with a tool use request and your application responding
to Claude with the results of evaluating that request.

How to implement computer use

Start with our reference implementation

We have built a reference implementation that includes everything you need to get
started quickly with computer use:

A containerized environment suitable for computer use with Claude

Implementations of the computer use tools
An agent loop that interacts with the Anthropic API and executes the computer use
tools
A web interface to interact with the container, agent loop, and tools.
We recommend trying the reference implementation out before reading the rest of
this documentation.

Optimize model performance with prompting

Here are some tips on how to get the best quality outputs:

Specify simple, well-defined tasks and provide explicit instructions for each step.
Claude sometimes assumes outcomes of its actions without explicitly checking their
results. To prevent this you can prompt Claude with After each step, take a
screenshot and carefully evaluate if you have achieved the right outcome.
Explicitly show your thinking: "I have evaluated step X..." If not correct, try
again. Only when you confirm a step was executed correctly should you move on to
the next one.
Some UI elements (like dropdowns and scrollbars) might be tricky for Claude to
manipulate using mouse movements. If you experience this, try prompting the model
to use keyboard shortcuts.
For repeatable tasks or UI interactions, include example screenshots and tool calls
of successful outcomes in your prompt.
If you repeatedly encounter a clear set of issues or know in advance the tasks
Claude will need to complete, use the system prompt to provide Claude with explicit
tips or instructions on how to do the tasks successfully.

System prompts
When one of the Anthropic-defined tools is requested via the Anthropic API, a
computer use-specific system prompt is generated. It’s similar to the tool use
system prompt but starts with:

You have access to a set of functions you can use to answer the user’s question.
This includes access to a sandboxed computing environment. You do NOT currently
have the ability to inspect files or interact with external resources, except by
invoking the below functions.

As with regular tool use, the user-provided system_prompt field is still respected
and used in the construction of the combined system prompt.

Understand Anthropic-defined tools

As a beta, these tool definitions are subject to change.

We have provided a set of tools that enable Claude to effectively use computers.
When specifying an Anthropic-defined tool, description and tool_schema fields are
not necessary or allowed.

Anthropic-defined tools are user executed

Anthropic-defined tools are defined by Anthropic but you must explicitly evaluate
the results of the tool and return the tool_results to Claude. As with any tool,
the model does not automatically execute the tool.

We currently provide 3 Anthropic-defined tools:

{ "type": "computer_20241022", "name": "computer" }

{ "type": "text_editor_20241022", "name": "str_replace_editor" }
{ "type": "bash_20241022", "name": "bash" }
The type field identifies the tool and its parameters for validation purposes, the
name field is the tool name exposed to the model.
If you want to prompt the model to use one of these tools, you can explicitly refer
the tool by the name field. The name field must be unique within the tool list; you
cannot define a tool with the same name as an Anthropic-defined tool in the same
API call.

We do not recommend defining tools with the names of Anthropic-defined tools. While
you can still redefine tools with these names (as long as the tool name is unique
in your tools block), doing so may result in degraded model performance.

Computer tool

Text editor tool

Bash tool

Combine computer use with other tools

You can combine regular tool use with the Anthropic-defined tools for computer use.

Shell

Python

TypeScript

curl https://api.anthropic.com/v1/messages \
-H "content-type: application/json" \
-H "x-api-key: $ANTHROPIC_API_KEY" \
-H "anthropic-version: 2023-06-01" \
-H "anthropic-beta: computer-use-2024-10-22" \
-d '{
"model": "claude-3-5-sonnet-20241022",
"max_tokens": 1024,
"tools": [
{
"type": "computer_20241022",
"name": "computer"
"display_width_px": 1024,
"display_height_px": 768,
"display_number": 1
},
{
"type": "text_editor_20241022",
"name": "str_replace_editor"
},
{
"type": "bash_20241022",
"name": "bash"
},
{
"name": "get_weather",
"description": "Get the current weather in a given location",
"input_schema": {
"type": "object",
"properties": {
"location": {
"type": "string",
"description": "The city and state, e.g. San Francisco, CA"
},
"unit": {
"type": "string",
"enum": ["celsius", "fahrenheit"],
"description": "The unit of temperature, either 'celsius' or
'fahrenheit'"
}
},
"required": ["location"]
}
},
],
"messages": [
{
"role": "user",
"content": "Find flights from San Francisco to a place with warmer
weather."
}
]
}'

Build a custom computer use environment

The reference implementation is meant to help you get started with computer use. It
includes all of the components needed have Claude use a computer. However, you can
build your own environment for computer use to suit your needs. You’ll need:

A virtualized or containerized environment suitable for computer use with Claude

An implementation of at least one of the Anthropic-defined computer use tools
An agent loop that interacts with the Anthropic API and executes the tool_use
results using your tool implementations
An API or UI that allows user input to start the agent loop

Understand computer use limitations

The computer use functionality is in beta. While Claude’s capabilities are cutting
edge, developers should be aware of its limitations:

Latency: the current computer use latency for human-AI interactions may be too slow
compared to regular human-directed computer actions. We recommend focusing on use
cases where speed isn’t critical (e.g., background information gathering, automated
software testing) in trusted environments.
Computer vision accuracy and reliability: Claude may make mistakes or hallucinate
when outputting specific coordinates while generating actions.
Tool selection accuracy and reliability: Claude may make mistakes or hallucinate
when selecting tools while generating actions or take unexpected actions to solve
problems. Additionally, reliability may be lower when interacting with niche
applications or multiple applications at once. We recommend that users prompt the
model carefully when requesting complex tasks.
Scrolling reliability: Scrolling may be unreliable in the current experience, and
the model may not reliably scroll to the bottom of a page. Scrolling-like behavior
can be improved via keystrokes (PgUp/PgDown).
Spreadsheet interaction: Mouse clicks for spreadsheet interaction are unreliable.
Cell selection may not always work as expected. This can be mitigated by prompting
the model to use arrow keys.
Account creation and content generation on social and communications platforms:
While Claude will visit websites, we are limiting its ability to create accounts or
generate and share content or otherwise engage in human impersonation across social
media websites and platforms. We may update this capability in the future.
Vulnerabilities: Vulnerabilities like jailbreaking or prompt injection may persist
across frontier AI systems, including the beta computer use API. In some
circumstances, Claude will follow commands found in content, sometimes even in
conflict with the user’s instructions. For example, Claude instructions on webpages
or contained in images may override instructions or cause Claude to make mistakes.
We recommend: a. Limiting computer use to trusted environments such as virtual
machines or containers with minimal privileges b. Avoiding giving computer use
access to sensitive accounts or data without strict oversight c. Informing end
users of relevant risks and obtaining their consent before enabling or requesting
permissions necessary for computer use features in your applications
Inappropriate or illegal actions: Per Anthropic’s terms of service, you must not
employ computer use to violate any laws or our Acceptable Use Policy.
Always carefully review and verify Claude’s computer use actions and logs. Do not
use Claude for tasks requiring perfect precision or sensitive user information
without human oversight.

Pricing
See the tool use pricing documentation for a detailed explanation of how Claude
Tool Use API requests are priced.

As a subset of tool use requests, computer use requests are priced the same as any
other Claude API request.

We also automatically include a special system prompt for the model, which enables
computer use.

Model Tool choice System prompt token count

Claude 3.5 Sonnet (new) auto
any, tool 466 tokens
499 tokens
In addition to the base tokens, the following additional input tokens are needed
for the Anthropic-defined tools:

Tool Additional input tokens

computer_20241022 683 tokens
text_editor_20241022 700 tokens
bash_20241022 245 tokens

(WFE) SRT Onboarding - VPN Setup and SRT Accounts
No ratings yet
(WFE) SRT Onboarding - VPN Setup and SRT Accounts
16 pages
Hypermodern Python Tooling
No ratings yet
Hypermodern Python Tooling
501 pages
Week 1
No ratings yet
Week 1
223 pages
Claude 4
No ratings yet
Claude 4
17 pages
Master Art of Aayush
No ratings yet
Master Art of Aayush
221 pages
Distributed Computing UnitII
No ratings yet
Distributed Computing UnitII
102 pages
PyGen A Collaborative Human-AI Approach To Python
No ratings yet
PyGen A Collaborative Human-AI Approach To Python
33 pages
Anthropic Claude Code Best Practices 1745281865
100% (1)
Anthropic Claude Code Best Practices 1745281865
30 pages
10 2 Appendix Tool Use
No ratings yet
10 2 Appendix Tool Use
12 pages
Computer Organization and Architecture Notes 1 - TutorialsDuniya
No ratings yet
Computer Organization and Architecture Notes 1 - TutorialsDuniya
119 pages
Introduction PP Ppt2
No ratings yet
Introduction PP Ppt2
29 pages
Chapter 1 - Introduction: Dept. of Electronics and Communication Engineering 1
0% (1)
Chapter 1 - Introduction: Dept. of Electronics and Communication Engineering 1
38 pages
GROUP-9-ppt 1-1
No ratings yet
GROUP-9-ppt 1-1
85 pages
Unit 4
No ratings yet
Unit 4
36 pages
NEOS Server 4.0 Administrative Guide: Argonne National Laboratory 9700 South Cass Avenue Argonne, IL 60439
No ratings yet
NEOS Server 4.0 Administrative Guide: Argonne National Laboratory 9700 South Cass Avenue Argonne, IL 60439
45 pages
Sysvabi 64
No ratings yet
Sysvabi 64
38 pages
Distributed Systems Architecture and Models
No ratings yet
Distributed Systems Architecture and Models
58 pages
RSPP En-Us SG M05 Pythonintro
No ratings yet
RSPP En-Us SG M05 Pythonintro
22 pages
8.deployment Diagram
No ratings yet
8.deployment Diagram
5 pages
Chatbot
0% (1)
Chatbot
41 pages
Chatbot Synopsis Report
No ratings yet
Chatbot Synopsis Report
18 pages
Lesson Plan - Unit No. 1 - Lesson No. 2 - Grade 10
No ratings yet
Lesson Plan - Unit No. 1 - Lesson No. 2 - Grade 10
9 pages
Claude 3 Model Card
No ratings yet
Claude 3 Model Card
64 pages
ICRES 2022 Proceeding
No ratings yet
ICRES 2022 Proceeding
305 pages
Lecture 14 PDC Bcs 6ef Smi Spring 2025
No ratings yet
Lecture 14 PDC Bcs 6ef Smi Spring 2025
32 pages
Computational Artefacts
No ratings yet
Computational Artefacts
21 pages
Anthropic
No ratings yet
Anthropic
12 pages
Claude - System Prompt
No ratings yet
Claude - System Prompt
16 pages
Claude AI - CLI Tools, Safety Architecture, and Known Limitations
No ratings yet
Claude AI - CLI Tools, Safety Architecture, and Known Limitations
5 pages
7SENG012C - Software Development Environments (IIT Sri Lanka) 2024-25v
No ratings yet
7SENG012C - Software Development Environments (IIT Sri Lanka) 2024-25v
5 pages
Unit - 2
No ratings yet
Unit - 2
163 pages
Claude Sonnet 3.7 New
No ratings yet
Claude Sonnet 3.7 New
18 pages
EPDA
No ratings yet
EPDA
25 pages
Pharmacology Dr. Dinesh Atf
No ratings yet
Pharmacology Dr. Dinesh Atf
35 pages
Marvel Demo
100% (1)
Marvel Demo
11 pages
System Analysis and Design
75% (4)
System Analysis and Design
2 pages
HND L300 Course Outline Corrected
No ratings yet
HND L300 Course Outline Corrected
10 pages
Claude Sonnet 3.7
No ratings yet
Claude Sonnet 3.7
4 pages
Process Instrumentation
100% (1)
Process Instrumentation
12 pages
Under The Guidance of Dr. Kota Solomon Raju Principal Scientist
No ratings yet
Under The Guidance of Dr. Kota Solomon Raju Principal Scientist
42 pages
Ai Enabled Programming Networking and Cybersecurity Reduced
No ratings yet
Ai Enabled Programming Networking and Cybersecurity Reduced
28 pages
Network Programmability Foundation PDF
No ratings yet
Network Programmability Foundation PDF
141 pages
XHD Skid Mounted CTU
No ratings yet
XHD Skid Mounted CTU
31 pages
Nazi
No ratings yet
Nazi
14 pages
Software Tools and Environments
No ratings yet
Software Tools and Environments
4 pages
Claude Code Best Practices - Anthropic
No ratings yet
Claude Code Best Practices - Anthropic
23 pages
Tools For Config
No ratings yet
Tools For Config
14 pages
Robotics Thesis Title
100% (3)
Robotics Thesis Title
6 pages
Distributing Computing: Introduction To Python Remote Objects (Pyro)
No ratings yet
Distributing Computing: Introduction To Python Remote Objects (Pyro)
8 pages
CH 4 Distributed System
No ratings yet
CH 4 Distributed System
6 pages
3
No ratings yet
3
39 pages
Biogrid Application Toolkit: A Grid-Based Problem Solving Environment Tool For Biomedical Data Analysis
No ratings yet
Biogrid Application Toolkit: A Grid-Based Problem Solving Environment Tool For Biomedical Data Analysis
13 pages
Infromation System1
No ratings yet
Infromation System1
47 pages
Computer Science Homework 1.2.1 - 1.2.5
No ratings yet
Computer Science Homework 1.2.1 - 1.2.5
4 pages
EC Master - V3.2 Python
No ratings yet
EC Master - V3.2 Python
17 pages
Deployment Diagram 1
No ratings yet
Deployment Diagram 1
17 pages
Claude (Language Model)
No ratings yet
Claude (Language Model)
5 pages
Cursor Agent
No ratings yet
Cursor Agent
5 pages
Cursor Chat
No ratings yet
Cursor Chat
5 pages
Commands As AI Conversations: Adventures in Code
No ratings yet
Commands As AI Conversations: Adventures in Code
4 pages
CSCE455/855 Distributed Operating Systems: Dr. Ying Lu
No ratings yet
CSCE455/855 Distributed Operating Systems: Dr. Ying Lu
40 pages
Episode
No ratings yet
Episode
8 pages
2013luv Supercomputers
No ratings yet
2013luv Supercomputers
12 pages
我被指派的任务
100% (2)
我被指派的任务
10 pages
Week 5 Module 5 Graded Quiz
No ratings yet
Week 5 Module 5 Graded Quiz
4 pages
1Z0 1112 2 Demo
No ratings yet
1Z0 1112 2 Demo
4 pages
Chapter - I
No ratings yet
Chapter - I
36 pages
Sekolah Menengah Kebangsaan Tanjung Gemok Information and Communication Technology 3 7 6 5 / 2 Sijil Pelajaran Malaysia (2 0 1 2 / 2 0 1 3)
No ratings yet
Sekolah Menengah Kebangsaan Tanjung Gemok Information and Communication Technology 3 7 6 5 / 2 Sijil Pelajaran Malaysia (2 0 1 2 / 2 0 1 3)
10 pages
The Latest Open Source Software Available and The Latest Development in ICT
No ratings yet
The Latest Open Source Software Available and The Latest Development in ICT
8 pages
AI Code Generators Article - Part 2 0623
No ratings yet
AI Code Generators Article - Part 2 0623
4 pages
Software Tools and Environments
No ratings yet
Software Tools and Environments
5 pages
Syahir Punyalah!!
No ratings yet
Syahir Punyalah!!
9 pages
Final Year Project Format
No ratings yet
Final Year Project Format
11 pages
Visvesvaraya Technological University: "Car Rental Management System"
No ratings yet
Visvesvaraya Technological University: "Car Rental Management System"
31 pages
Project 4 Design Presentation 22
No ratings yet
Project 4 Design Presentation 22
6 pages
Student Name: Bhumika Shrestha TP Number: NP000194 Performance Criteria: REPORT (30%) Very Poor Poor Adequate Good Excellent
No ratings yet
Student Name: Bhumika Shrestha TP Number: NP000194 Performance Criteria: REPORT (30%) Very Poor Poor Adequate Good Excellent
4 pages
Vaibhav Word 1
No ratings yet
Vaibhav Word 1
2 pages
Admin Panel V9
No ratings yet
Admin Panel V9
25 pages
(IHS) Grid Computing at IHS
No ratings yet
(IHS) Grid Computing at IHS
4 pages
Sentipack
No ratings yet
Sentipack
11 pages
External
No ratings yet
External
15 pages
Task Management For Soft Real-Time Applications Based On General Purpose Operating Systems
No ratings yet
Task Management For Soft Real-Time Applications Based On General Purpose Operating Systems
11 pages
19me21p1 PDF
No ratings yet
19me21p1 PDF
2 pages
Denon ASD-3N
No ratings yet
Denon ASD-3N
7 pages
Sodapdf
No ratings yet
Sodapdf
7 pages
ME569 - Project - Fall 2022
No ratings yet
ME569 - Project - Fall 2022
9 pages
(Ebook) Get Programming With JavaScript by John R. Larsen ISBN 9781617293108, 1617293105 Instant Download
100% (4)
(Ebook) Get Programming With JavaScript by John R. Larsen ISBN 9781617293108, 1617293105 Instant Download
63 pages
Aditya Resume 2
No ratings yet
Aditya Resume 2
2 pages
SQL Cheat Sheet Bascis - MD
No ratings yet
SQL Cheat Sheet Bascis - MD
1 page

Computer Use Documentation

Uploaded by

Computer Use Documentation

Uploaded by

Build with Claude

Computer use (beta)

Use a dedicated virtual machine or container with minimal privileges to prevent

Computer use reference implementation

How computer use works

Add Anthropic-defined computer use tools to your API request.

How to implement computer use

Start with our reference implementation

A containerized environment suitable for computer use with Claude

Optimize model performance with prompting

Understand Anthropic-defined tools

Anthropic-defined tools are user executed

We currently provide 3 Anthropic-defined tools:

{ "type": "computer_20241022", "name": "computer" }

Text editor tool

Combine computer use with other tools

Build a custom computer use environment

A virtualized or containerized environment suitable for computer use with Claude

Understand computer use limitations

Model Tool choice System prompt token count

Tool Additional input tokens

You might also like