AgentsNet

This is the code for our preprint AgentsNet: Coordination and Collaborative Reasoning in Multi-Agent LLMs. We benchmark networks of LLM agents on five fundamental problems from distributed computing to assess their collaborative reasoning capabilities!

Make sure to also check out our blog post (https://agentsnet.graphben.ch) with an interactive demo!

Installation

To set up the project, follow these steps:

1. Create a Conda Environment

conda create -n agentsnet python=3.11 -y
conda activate agentsnet

2. Install Dependencies

Install the required Python packages:

pip install datasets langchain langgraph langchain-openai langchain-ollama langchain-google-genai langchain-anthropic pandas scipy networkx numpy==1.26.4

Running experiments

First make sure that you set the API key for the provider you want to use. For google genai, this would be:

export GOOGLE_API_KEY=<INSERT-API-KEY-HERE>

For openai:

export OPENAI_API_KEY=<INSERT-API-KEY-HERE>

For anthropic:

export ANTHROPIC_API_KEY=<INSERT-API-KEY-HERE>

Then, you can start a run:

python main.py --graph_size 16 --task coloring --rounds 8 --samples_per_graph_model 3 --model gemini-2.0-flash

This runs 12 instances of the coloring task of 16 nodes for 8 rounds, with 4 different graph classes and 3 samples per graph class each, with gemini-2.0-flash as the model. See main.sh to run a complete run of the benchmark for one particular model.

Running more extensive experiments

To run the AgentsNet benchmark on all tasks with 12 graphs each (of size 16), you can use:

./main.sh gemini-2.0-flash

This will run with gemini-2.0-flash. You can choose from a series of OpenAI, Anthropic and Gemini models.

Running the chat tool

We have a simple chat tool to read transcripts from completed runs. Simply run

python chat_tool.py --file [FILE]

where [FILE] is a results file produced by main.py. Providing --agents agent1 agent2 will print the chat between agent1 and agent2. Only providing --agents agent1 will print the transcript of agent1.

Dataset

All graph instances used in our benchmark are publicly available on the Hugging Face Hub: https://huggingface.co/datasets/disco-eth/AgentsNet

The dataset consists of synthetic graphs generated using various random graph models. It serves as the input for all experiments in the benchmark.

Citation

@misc{grötschla2025agentsnetcoordinationcollaborativereasoning,
      title={AgentsNet: Coordination and Collaborative Reasoning in Multi-Agent LLMs}, 
      author={Florian Grötschla and Luis Müller and Jan Tönshoff and Mikhail Galkin and Bryan Perozzi},
      year={2025},
      eprint={2507.08616},
      archivePrefix={arXiv},
      primaryClass={cs.MA},
      url={https://arxiv.org/abs/2507.08616}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
LICENSE		LICENSE
LiteralMessagePassing.py		LiteralMessagePassing.py
README.md		README.md
ablations.sh		ablations.sh
chat_tool.py		chat_tool.py
generate_graphs.py		generate_graphs.py
header.png		header.png
main.py		main.py
main.sh		main.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

AgentsNet

Installation

1. Create a Conda Environment

2. Install Dependencies

Running experiments

Running more extensive experiments

Running the chat tool

Dataset

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

shayancoin/AgentsNet

Folders and files

Latest commit

History

Repository files navigation

AgentsNet

Installation

1. Create a Conda Environment

2. Install Dependencies

Running experiments

Running more extensive experiments

Running the chat tool

Dataset

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages