Anthropic Computer Use on Modal

Anthropic's recent release of Computer Use is fantastic, but spinning up random Docker images didn't fit into our production workflow. Luckily, we use Modal and they have all the primitives we need to implement the Computer Use API.

If you're curious about why this library exists, check out this blog post. This repo has a reimplementation of the Computer Use Tools from ~scratch, with a focus on using distributed primitives and state management.

This library provides an out-of-the-box implementation that can be deployed into your Modal environment. Everything runs in a Sandbox, with tool calls being translated into Modal API calls. It may or may not spectacularly explode. It's also quite slow at the moment. Caveat emptor!

Features

Deploys into its own app that can be called from your existing apps
Sandboxes scale to zero and are resumable
VNC tunnel to each sandbox for debugging
One NFS per sandbox, available for inspection
Image processing outside the sandbox, greatly speeding up screenshot generation
Fuzzy matching for the Edit tool, since the model often misses a newline or two
Hardware-accelerated browsing in the sandbox
Pre-warming of the sandbox for faster startup times
Tools for the LLM to work faster, such as apt-fast

Installation

You can install this library without cloning the repo by running:

pip install computer-use-modal

To use it in your own project, simply deploy it once:

modal deploy computer_use_modal

Then you can use it in your app like this:

from modal import Cls

server = Cls.lookup("anthropic-computer-use-modal", "ComputerUseServer")
response = server.messages_create.remote.aio(
    request_id=uuid4().hex,
    user_messages=[{"role": "user", "content": "What is the weather in San Francisco?"}],
)
print(response)

{
    "role": "assistant",
    "content": [
        BetaTextBlock(
            text="According to the National Weather Service, the current weather in San Francisco is:\n\nTemperature: 65°F (18°C)\nHumidity: 53%\nDewpoint: 48°F (9°C)\nLast update: October 23, 2:43 PM PDT\n\nThe website shows the forecast details as well. Would you like me to provide the extended forecast for the coming days?",
            type="text",
        )
    ]
}

You can also watch the progress with a VNC tunnel:

manager = Cls.lookup("anthropic-computer-use-modal", "SandboxManager")
urls = manager.debug_urls.remote()
print(urls["vnc"])

"https://x2xzanmu4yg.r9.modal.host"

If you want to stream the responses, you can use ComputerUseServer.messages_create_gen.

Demo

You can clone this repo and run two demos locally.

CLI Demo

This demo will deploy an ephemeral Modal app, and ask the LLM to browse the web to fetch the weather in San Francisco. Screenshots will be shown in your terminal so you can follow along!

git clone https://github.com/yasyf/anthropic-tool-use-modal
cd anthropic-tool-use-modal
uv sync
modal run computer_use_modal.demo

Streamlit Demo

This demo deploys the app persistently to its own namespace in your Modal account, then starts a Streamlit app that can interact with it.

git clone https://github.com/yasyf/anthropic-tool-use-modal
cd anthropic-tool-use-modal
uv sync --dev
modal deploy computer_use_modal
python -m streamlit run computer_use_modal/streamlit.py

Thanks

Thanks to the Anthropic team for the awesome starting point!

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
computer_use_modal		computer_use_modal
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
demo.png		demo.png
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Anthropic Computer Use on Modal

Features

Installation

Demo

CLI Demo

Streamlit Demo

Thanks

About

Releases

Packages

Languages

yasyf/anthropic-computer-use-modal

Folders and files

Latest commit

History

Repository files navigation

Anthropic Computer Use on Modal

Features

Installation

Demo

CLI Demo

Streamlit Demo

Thanks

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages