Open-Operator: Open-Source Version of OpenAI Operator

open-operator

This project aims to provide the open-source community with an easy-to-use system for building, self-hosting, and evaluating web agent computer-use models. Our goal is to offer an alternative to the $200/month ChatGPT Pro and cloud-based, uncontrolled execution environments.

With open-operator, you can:

Annotate your web trajectory data.
Export the data for further processing.
Prepare the data for supervised fine-tuning (SFT).
Host and deploy the model to interact with live websites.
Automatically evaluate the model’s performance.

We believe in empowering developers to have complete control over their web agents, from training to deployment and evaluation.

Roadmap

Briefly describe the roadmap of the project. Green part will be included in this repo.

Run your Base Agent Using Open-Operator

Prepare the environment

conda create -n open-operator python=3.11
pip install -r requirements.txt

For the browser environment, you can use browserbase to setup the following environment variables.

export BROWSERBASE_API_KEY=your_api_key

Initialize the base agent

python inference/app.py

You can select the base model you want to use in the dropdown menu.(From Anthropic, Google, OpenAI, etc.)

Then start your first experience with Open-Operator!

Data Annotation and Downloading

Follow the step wise instruction below:

Download the latest iMean builder extension here: iMean Builder
Install the extension on your browser.
Record your web trajectory data you want to train your model on in the natural way you interact with the website. Edit the title of each data.
Create a private channel on iMean Builder Platform and move all the data into that channel. -> How to: Docs
Create a private challenge on WebCanvas website and connect it with the channel in the last step. -> How to: Docs
Get the challenge id and use it to download all the data from the iMean Builder Platform.

Set the challenge id, iMean Builder username, password in configs/config.yaml.

Just run python main.py to download the data. Now you can download some sample data by default challenge id.

If you log in iMean Builder with Google account, you can set the password on the profile page.

Data Pre-processing

For Dom Tree mode, Just run python main.py

For Vision mode, code coming soon.

Native Agent Model Training

coming soon

Native Agent Model Evaluation

coming soon

TODO

Instruction on how to annotate your web trajectory data
Data downloading
Pre-process the data to be SFT-ready - DOM Tree
Pre-process the data to be SFT-ready - Vision
Host the local model and inference on live websites
Automatically evaluation using WebCanvas framework

Previous Solutions

For reference on web agent evaluation, you can check out the WebCanvas repo: WebCanvas

For more information on open-source GUI agent research projects and collaborations, check out WebAgentLab (WebAgentLab Homepage).

Stay tuned!

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.gradio		.gradio
configs		configs
data_processing		data_processing
inference		inference
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Open-Operator: Open-Source Version of OpenAI Operator

open-operator

Roadmap

Run your Base Agent Using Open-Operator

Prepare the environment

Initialize the base agent

Data Annotation and Downloading

Data Pre-processing

Native Agent Model Training

Native Agent Model Evaluation

TODO

Previous Solutions

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

iMeanAI/open-source-operator

Folders and files

Latest commit

History

Repository files navigation

Open-Operator: Open-Source Version of OpenAI Operator

open-operator

Roadmap

Run your Base Agent Using Open-Operator

Prepare the environment

Initialize the base agent

Data Annotation and Downloading

Data Pre-processing

Native Agent Model Training

Native Agent Model Evaluation

TODO

Previous Solutions

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages