Delegation fixes #6165

enyst · 2025-01-09T07:30:37Z

End-user friendly description of the problem this fixes or functionality that this introduces

Include this change in the Release Notes. If checked, you must provide an end-user friendly description for your change below
Fix agent delegation; use events for communication between parent and delegates.
Fix the lockup when the model returns a message instead of a tool call.

Give a summary of what the PR does, explaining any non-trivial design decisions

Delegation was broken after we made the agent loop rely exclusively on a controller-as-observer logic. This PR proposes to fix it in a simple way: by forwarding to the delegate

refactor the current logic (of unsubscribing parent when delegate starts and vice versa): now ONLY the parent is subscribed and stays subscribed, and it forwards to the delegate when it has one
should_step on both MessageActions from 'user' and 'agent', except when waiting for user input is explicitly set
should_step on DelegateAction too, it will create a MessageAction to kickstart the delegate
refactor ending conditions
added integration tests for DelegatorAgent

Also:

the delegate starts with a MessageAction
and ends with a AgentDelegateObservation/ErrorObservation

The code is ready for review - or this logic of delegation.
(please ignore the print() stuff, will clean up later)

Link of any specific issues this addresses
Fix #6162

To run this PR locally, use the following command:

docker run -it --rm   -p 3000:3000   -v /var/run/docker.sock:/var/run/docker.sock   --add-host host.docker.internal:host-gateway   -e SANDBOX_RUNTIME_CONTAINER_IMAGE=docker.all-hands.dev/all-hands-ai/runtime:3df4d30-nikolaik   --name openhands-app-3df4d30   docker.all-hands.dev/all-hands-ai/openhands:3df4d30

.github/workflows/integration-runner.yml

enyst · 2025-01-09T08:30:22Z

@openhands-agent Read the diff of this PR carefully. Understand what it tries to achieve. Then, we have two things to do:

The diff has added debug prints. We need them! And we need to enhance them:

all events have an event.id, add it to the print() statements after the class name, like '({event.id})'

The unit tests in test_agent_controller.py are outdated and failing. Update them to the new behavior. Understand the difference in the context of the changes of this PR and fix them.

Important:
You don't need to test the rest. Just this test file.

github-actions · 2025-01-09T16:47:31Z

Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly.

openhands/controller/agent_controller.py

This reverts commit d07f225.

This reverts commit 8d7382c.

github-actions · 2025-01-15T03:02:23Z

Hi! I started running the integration tests on your PR. You will receive a comment with the results shortly.

github-actions · 2025-01-15T03:10:45Z

Trigger by: Pull Request (integration-test label on PR #6165)
Commit: f2dcc79
Integration Tests Report (Haiku)
Haiku LLM Test Results:
Success rate: 100.00% (6/6)

Total cost: USD 0.00

instance_id	success	reason	cost	error_message
t02_add_bash_hello	True		0
t06_github_pr_browsing	True		0
t03_jupyter_write_file	True		0
t01_fix_simple_typo	True		0
t05_simple_browsing	True		0
t04_git_staging	True		0

Integration Tests Report (DeepSeek)
DeepSeek LLM Test Results:
Success rate: 83.33% (5/6)

Total cost: USD 0.00

instance_id	success	reason
t02_add_bash_hello	True
t03_jupyter_write_file	True
t04_git_staging	True
t01_fix_simple_typo	True
t06_github_pr_browsing	True
t05_simple_browsing	False	The answer is not found in any message. Total messages: 2.

Integration Tests Report Delegator (Haiku)
Success rate: 50.00% (1/2)

Total cost: USD 0.00

instance_id	success	reason	cost	error_message
t02_add_bash_hello	True		0	nan
t01_fix_simple_typo	False	File not fixed: This is a silly typo.	0	RuntimeError: Agent reached maximum iteration in headless mode. Current iteration: 30, max iteration: 30
		Really!
		No more typos.
		Enjoy!

Integration Tests Report Delegator (DeepSeek)
Success rate: 100.00% (2/2)

Total cost: USD 0.00

instance_id	success	reason	cost	error_message
t02_add_bash_hello	True		0
t01_fix_simple_typo	True		0

Download testing outputs (includes both Haiku and DeepSeek results): Download

diwu-sf · 2025-01-15T06:23:52Z

Thanks!

enyst added 6 commits January 9, 2025 08:27

refactor delegation

1c5b982

handle terminal states

588b25d

fix close

f1a4ac4

refactor delegate end

382f94c

debug

a6501b7

workflows

a1331b1

enyst marked this pull request as draft January 9, 2025 07:30

enyst added the integration-test label Jan 9, 2025

This comment was marked as outdated.

Sign in to view

enyst commented Jan 9, 2025

View reviewed changes

.github/workflows/integration-runner.yml Show resolved Hide resolved

enyst added the fix-me-experimental label Jan 9, 2025

enyst added 2 commits January 9, 2025 16:42

add missing haiku section

ea3ab1d

more clear trace

fca199a

All-Hands-AI deleted a comment from openhands-agent Jan 9, 2025

All-Hands-AI deleted a comment from github-actions bot Jan 9, 2025

fix delegate end

0393f9b

All-Hands-AI deleted a comment from openhands-agent Jan 9, 2025

enyst added integration-test and removed integration-test labels Jan 9, 2025

enyst marked this pull request as ready for review January 9, 2025 16:47

enyst commented Jan 9, 2025

View reviewed changes

openhands/controller/agent_controller.py Outdated Show resolved Hide resolved

enyst requested a review from rbren January 9, 2025 17:01

enyst commented Jan 9, 2025

View reviewed changes

openhands/controller/agent_controller.py Show resolved Hide resolved

enyst requested a review from li-boxuan January 9, 2025 17:10

This comment was marked as outdated.

Sign in to view

enyst added 2 commits January 9, 2025 18:35

Update openhands/controller/agent_controller.py

84fb5c5

fix delegate too early

f62b249

enyst added 2 commits January 13, 2025 12:44

Merge branch 'main' into enyst/delegation

b2a55cb

Merge branch 'main' into enyst/delegation

b8fe210

enyst added integration-test and removed integration-test labels Jan 15, 2025

All-Hands-AI deleted a comment from github-actions bot Jan 15, 2025

This comment was marked as outdated.

Sign in to view

enyst added 2 commits January 15, 2025 03:01

tweak, comments

2f83984

clean up debugging

0f74014

All-Hands-AI deleted a comment from github-actions bot Jan 15, 2025

enyst added 2 commits January 15, 2025 03:07

Revert "temporary run faster"

0295753

This reverts commit d07f225.

Revert "temporary - quicker run"

4db521a

This reverts commit 8d7382c.

enyst added integration-test and removed integration-test labels Jan 15, 2025

fix parameter order

2efc6cb

enyst added integration-test and removed integration-test labels Jan 15, 2025

All-Hands-AI deleted a comment from github-actions bot Jan 15, 2025

This comment was marked as outdated.

Sign in to view

remove messing with history for now

2b1a2eb

enyst added integration-test and removed integration-test labels Jan 15, 2025

All-Hands-AI deleted a comment from github-actions bot Jan 15, 2025

remove debug

3df4d30

enyst enabled auto-merge (squash) January 15, 2025 03:11

enyst merged commit b9a70c8 into main Jan 15, 2025
13 checks passed

enyst deleted the enyst/delegation branch January 15, 2025 03:24

csmith49 pushed a commit to csmith49/OpenHands that referenced this pull request Jan 19, 2025

Delegation fixes (All-Hands-AI#6165)

1bc509e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Delegation fixes #6165

Delegation fixes #6165

enyst commented Jan 9, 2025 •

edited

Loading

This comment was marked as outdated.

enyst commented Jan 9, 2025

github-actions bot commented Jan 9, 2025

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

github-actions bot commented Jan 15, 2025

github-actions bot commented Jan 15, 2025

diwu-sf commented Jan 15, 2025

Delegation fixes #6165

Delegation fixes #6165

Conversation

enyst commented Jan 9, 2025 • edited Loading

This comment was marked as outdated.

enyst commented Jan 9, 2025

github-actions bot commented Jan 9, 2025

This comment was marked as outdated.

This comment was marked as outdated.

This comment was marked as outdated.

github-actions bot commented Jan 15, 2025

github-actions bot commented Jan 15, 2025

diwu-sf commented Jan 15, 2025

enyst commented Jan 9, 2025 •

edited

Loading