AgentRun: Run AI Generated Code Safely

This is a fork of the AgentRun library that has some added customizations and optimizations, specifically:

A "full service" implementation. Both the python runner and API images use a REST API with a full-set of services needed for running Python programs.

Customize Install Commands.

Fixed issues with "uv" installer.

Improved management of cached dependencies.

Option to allow certain unsafe functions (due to false positives).

Option to ignore certain dependencies.

AgentRun is a Python library that makes it easy to run Python code safely from large language models (LLMs) with a single line of code. Built on top of the Docker Python SDK and RestrictedPython, it provides a simple, transparent, and user-friendly API to manage isolated code execution.

AgentRun automatically installs and uninstalls dependencies with optional caching, limits resource consumption, checks code safety, and sets execution timeouts. It has 100% test coverage with full static typing and only two dependencies.

Documentation
Get started in minutes

Note

Looking for a state of the art RAG API? Check out ColiVara, also from us.

Why?

Giving code execution ability to LLMs is a massive upgrade. Consider the following user query: what is 12345 * 54321? or even something more ambitious like what is the average daily move of Apple stock during the last week?? With code execution it is possible for LLMs to answer both accurately by executing code.

However, executing untrusted code is dangerous and full of potential footguns. For instance, without proper safeguards, an LLM might generate harmful code like this:

import os
# deletes all files and directories
os.system('rm -rf /')

This package gives code execution ability to any LLM in a single line of code, while preventing and guarding against dangerous code.

Key Features

Safe code execution: AgentRun checks the generated code for dangerous elements before execution
Isolated Environment: Code is executed in a fully isolated docker container
Configurable Resource Management: You can set how much compute resources the code can consume, with sane defaults
Timeouts: Set time limits on how long a script can take to run
Dependency Management: Complete control on what dependencies are allowed to install
Dependency Caching: AgentRun gives you the ability to cache any dependency in advance in the docker container to optimize performance.
Automatic Cleanups: AgentRun cleans any artifacts created by the generated code.
Dual Protocol Support: Access via REST API or Model Context Protocol (MCP) for seamless LLM integration
REST API & MCP Server: Both protocols available simultaneously on the same server, sharing session state
Transparent Exception Handling: AgentRun returns the same exact output as running Python in your system - exceptions and tracebacks included. No cryptic docker messages.

If you want to use your own Docker configuration, install this package with pip and simply initialize AgentRun with a running Docker container. Additionally, you can use an already configured Docker Compose setup and API that is ready for self-hosting by cloning this repo.

Unless you are comfortable with Docker, we highly recommend using the REST API with the already configured Docker as a standalone service.

Getting Started

Clone the github repository and start immediately with a standalone REST API.

git clone https://github.com/rvijayc/agentrun
cd agentrun_plus
./docker-compose.sh dev up --build

This starts the AgentRunPlus server with which you can communicate using the AgentRun REST API interface.

You can also interact with curl if you prefer, but it might get a bit cumbersome:

Using the MCP (Model Context Protocol) Server

AgentRunPlus now supports the Model Context Protocol (MCP), allowing LLMs like Claude to use it as a server with tools for code execution. Both REST API and MCP endpoints are available simultaneously on the same server.

MCP Endpoint

Once the server is running, the MCP endpoint is available at:

http://localhost:8000/mcp

Available MCP Tools

AgentRunPlus exposes 9 MCP tools:

create_session() - Create a new isolated session
execute_code(session_id, code, ...) - Execute Python code in a session
upload_file(session_id, filename, content_base64) - Upload a file to session
download_file(session_id, src_path) - Download a file from session
list_sessions() - List all active sessions
get_session_info(session_id) - Get session details
close_session(session_id) - Close and cleanup a session
get_packages() - Get list of installed Python packages (no session required)
get_health() - Check server health status

Using with Claude Desktop

Add to your Claude Desktop MCP configuration (~/Library/Application Support/Claude/claude_desktop_config.json on macOS):

{
  "mcpServers": {
    "agentrun": {
      "url": "http://localhost:8000/mcp",
      "transport": "http"
    }
  }
}

Using the AgentRunMCPClient

The AgentRunMCPClient provides a drop-in replacement for AgentRunAPIClient that communicates over MCP instead of REST. Both clients share the same interface.

from agentrun_plus import AgentRunMCPClient

client = AgentRunMCPClient("http://localhost:8000")

# Check what packages are available (no session required)
packages = client.get_packages()
print(f"{packages['count']} packages available: {packages['packages'][:5]}...")

# Create a session
session = client.create_session()

# Execute some Python code
result = client.execute_code(session.session_id, """
import pandas as pd
import matplotlib.pyplot as plt

data = {'x': [1, 2, 3, 4], 'y': [10, 20, 25, 30]}
df = pd.DataFrame(data)
print(df.describe())
""")
print(result["output"])

# Close the session
client.close_session(session.session_id)

File Operations via MCP Client

from agentrun_plus import AgentRunMCPClient

client = AgentRunMCPClient("http://localhost:8000")
session = client.create_session()

# Upload a file (handles base64 encoding automatically)
client.upload_file(session.session_id, "data.csv")

# Execute code that reads the upload and saves output to artifacts/
# Note: src/ and artifacts/ are pre-created - no need to mkdir them
client.execute_code(session.session_id, """
import pandas as pd
df = pd.read_csv('src/data.csv')
df.describe().to_csv('artifacts/summary.csv')
print('Done!')
""")

# Download the result (handles base64 decoding automatically)
client.download_file(session.session_id, "artifacts/summary.csv", "/tmp")

client.close_session(session.session_id)

Interoperability with REST API

Sessions created via MCP are accessible via the REST API and vice versa. Both protocols share the same backend and the clients have the same interface:

from agentrun_plus import AgentRunMCPClient, AgentRunAPIClient

mcp_client = AgentRunMCPClient("http://localhost:8000")
api_client = AgentRunAPIClient("http://localhost:8000")

# Create session via MCP
session = mcp_client.create_session()

# Execute code via REST API on the same session
api_client.execute_code(session.session_id, 'print("Hello from REST!")')

# Both clients can list packages the same way
mcp_packages = mcp_client.get_packages()
rest_packages = api_client.get_packages()
assert mcp_packages["packages"] == rest_packages["packages"]

mcp_client.close_session(session.session_id)

pip install

Install AgentRunPlus via pip:

For library usage (core AgentRun backend):

pip install agentrun_plus

For running the API server (includes REST API + MCP server):

pip install agentrun_plus[api]

Note: If using docker-compose (recommended), dependencies are managed automatically in the container.

Here is a simple example of using the MCP client to connect to a running server:

~/llm/AgentRun$ python3
Python 3.13.2 | packaged by Anaconda, Inc. | (main, Feb  6 2025, 18:56:02) [GCC 11.2.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> from agentrun_plus import AgentRunMCPClient
>>> client = AgentRunMCPClient('http://localhost:8000')
>>> client.get_packages()
{'packages': ['numpy', 'pandas', 'matplotlib', ...], 'count': 42}
>>> session = client.create_session()
>>> client.execute_code(session.session_id, 'print("Hello World!")')
{'success': True, 'output': 'Hello World!\n'}
>>> client.close_session(session.session_id)
{'success': True, 'message': 'Session 978c8bb329cd473d99cd3b596c7b557a closed successfully'}
>>>

Name		Name	Last commit message	Last commit date
Latest commit History 105 Commits
.github/workflows		.github/workflows
agentrun_plus		agentrun_plus
docs		docs
k8s		k8s
tests		tests
.coverage		.coverage
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
docker_hub_deploy.sh		docker_hub_deploy.sh
manage_docs.sh		manage_docs.sh
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AgentRun: Run AI Generated Code Safely

Why?

Key Features

Getting Started

Using the MCP (Model Context Protocol) Server

MCP Endpoint

Available MCP Tools

Using with Claude Desktop

Using the AgentRunMCPClient

File Operations via MCP Client

Interoperability with REST API

pip install

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AgentRun: Run AI Generated Code Safely

Why?

Key Features

Getting Started

Using the MCP (Model Context Protocol) Server

MCP Endpoint

Available MCP Tools

Using with Claude Desktop

Using the AgentRunMCPClient

File Operations via MCP Client

Interoperability with REST API

pip install

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages