Setup model routing config and plan routing to o1 #6189

ryanhoangt · 2025-01-10T12:56:16Z

End-user friendly description of the problem this fixes or functionality that this introduces

Include this change in the Release Notes. If checked, you must provide an end-user friendly description for your change below

Give a summary of what the PR does, explaining any non-trivial design decisions

This PR is to:

Setup config for model routing-related features.
Implement a prototype for routing to reasoning models if appropriate. The criteria are based on this paper.

Link of any specific issues this addresses

xingyaoww

Awesome! This is a great start for model routing and LGTM!

xingyaoww · 2025-01-10T13:37:58Z

openhands/router/plan/llm_based.py

+    Router that routes the prompt that is judged by a LLM as complex and requires a step-by-step plan.
+    """
+
+    JUDGE_MODEL = 'gpt-4o'


Would be interesting to see if we can experiment with cheaper model for that 🤔

xingyaoww · 2025-01-10T13:39:50Z

openhands/router/plan/prompts.py

+    * Translating high-level requirements into detailed implementation steps and ensuring consistency.
+
+=== BEGIN USER MESSAGE ===
+{message}


We could also experiment sending O1 with the last 5/10 action/observation 🤔 in case there's some deep reasoning required to figure out the error, etc.

enyst · 2025-01-10T20:03:00Z

openhands/llm/llm.py

+                    )
+
+                # Replace the model with the reasoning model
+                kwargs['model'] = self.model_routing_config.reasoning_model


Is model enough, or also: custom provider, base URL?

We could design the reasoning model not as a part of an LLM instance, but as a second LLM instance in the agent?

Is model enough, or also: custom provider, base URL?

Yeah, I think we also need to allow user to set these, especially if they don't use via a llm proxy 🤔

Using [llm.reasoning_model] will do it implicitly!

@enyst I've refactored it to use separate LLM instances, can you have a look again to see if it matches your suggestion?

config.template.toml

enyst

I'm so happy to see this, thank you! I do think we are missing some minimal framework to experiment with reasoning models.

About the way to choose another model:
We already have the ability to choose, configure, and use a random model, for example in evals: we can write the model configuration in toml, in a custom named LLM config section, [llm.o1], load it with an utility function, and instantiate an LLM from it.

We can use that here. Names are user-defined, and we can, if we want, set in stone a particular name for the reasoning model, e.g. [llm.reasoning_model], or [llm.oh_reasoning_model], or [llm.blueberry] (or strawberry for that matter), whatever name.

openhands/router/plan/prompts.py

openhands/utils/trajectory.py

mamoodi · 2025-02-03T15:49:18Z

@ryanhoangt gentle ping here in case this fell of the radar.

enyst · 2025-02-04T16:40:07Z

config.template.toml

+
+[llm.reasoning_model]
+model = "o1"
+api_key = ""


I think we might have here a little too much configurability 😅

It's perfectly fine if we reserve some names for our own features. So the names (of the llm configs) don't need to be configurable themselves, they mean what we say they mean.

We did that with draft_llm:

reference PR: Simplify draft llm #6281

docs: custom configs with reserved names (we should have a link to the paragraph just above this 😅, sorry, please see that paragraph)

We can reserve the names reasoning_model and reasoning_judge_model for the reasoning model routing feature, and use them freely as necessary in the code. So we don't need these lines:

reasoning_llm_config_name = 'reasoning_model' judge_llm_config_name = 'judge_model'

That will also simplify the code below, starting from reading these configs in llm_config.py: I think we don't need to do anything there? They'll be read like any other named configs. And it will save us quite a bit of code complexity elsewhere too?

I think we might have here a little too much configurability 😅

Yeah I'm also feeling this, I'm implementing this and also think a bit about how to support the Notdiamond router one, where we can train a custom router on a set of selected LLMs and hence the llm config names are not fixed like the two above. But in this case indeed choosing a reserved name would make more sense, we probably don't need it to be configurable. I'll try to change that.

Yes, I hear you, I'm thinking about Notdiamond too, and about the litellm option - there's a routing feature in litellm which we have tried twice in the past to use, and it proved too much complexity when it doesn't actually support one of the most important things we wanted from it (fallbacks/retries by reading the providers' retry-after headers). Maybe we will look again at it (3rd time's the charm?) or not.

Anyway, there seem to be two ways we can take on the idea of future routing:

ignore it. We have a full feature here, we implement it as necessary, nice enough but we don't necessarily need all the building blocks we're guessing we will need for the most generalizable thing. Cross that bridge when we come to it. (we don't even know exactly what they will need, do we?)

keep an abstract class for routing config, and not a lot more. Maybe the way we took with condensers is a relevant example here: the configs for that do share an ABC, but the subclasses don't have the same attributes, and that's fine. Each will be configured as it needs, maybe with nothing in its config (the NoOpCondenser), or maybe with a bunch of attributes (max_size, whatever, for some specialized condensers which really need a config of their own). Again, cross that last bridge when we come to it.

OK, I started by saying there are two ways, but idk, maybe they're almost the same today. 😅

This routing feature, in this PR, does need to be enabled, so as long as it's enabled, it can do its thing IMHO.

ryanhoangt added 13 commits January 6, 2025 15:40

prototype

cec10a1

Merge branch 'main' into o1-routing

fd1b4ec

add routing config

c33ba45

wire up with codeact and llm

7b08724

fix bug

910ba8c

working cli

b73f3ec

update config template

54d4401

use via ui

06db2d6

remove dotenv

b5973cd

update judge prompt

e3c8a9e

update prompt

27a83db

update prompt

6f86ad9

adjust rule-based router

ec2d162

xingyaoww reviewed Jan 10, 2025

View reviewed changes

enyst reviewed Jan 10, 2025

View reviewed changes

config.template.toml Outdated Show resolved Hide resolved

enyst reviewed Jan 10, 2025

View reviewed changes

openhands/router/plan/prompts.py Outdated Show resolved Hide resolved

ryanhoangt added 4 commits January 12, 2025 14:19

fix indentation

9bf5a7f

use full trajectory

8e05f3f

refactor traj formatter and add tests

ddc3248

add o1 to fn calling models

472d95c

enyst reviewed Jan 14, 2025

View reviewed changes

openhands/utils/trajectory.py Show resolved Hide resolved

Merge branch 'main' into o1-routing

fc5040c

enyst mentioned this pull request Jan 26, 2025

Add support for multiple LLMs simultaneously for cost savings and expanding to a team-level automation (and beyond) #6180

Open

ryanhoangt added 4 commits February 4, 2025 05:22

Merge branch 'main' into o1-routing

b8e58fd

refactor to use llm objects

dbc2412

fix bug

ea4474d

update config template

b7a0c95

add gap control

7d0132f

enyst reviewed Feb 4, 2025

View reviewed changes

enyst mentioned this pull request Feb 4, 2025

Allow us to use MCP servers to extend OpenHand's functionality #5781

Open

ryanhoangt added 2 commits February 6, 2025 04:36

working eval

a70e979

Merge branch 'main' into o1-routing

6f729de

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Setup model routing config and plan routing to o1 #6189

Setup model routing config and plan routing to o1 #6189

ryanhoangt commented Jan 10, 2025 •

edited

Loading

xingyaoww left a comment

xingyaoww Jan 10, 2025

xingyaoww Jan 10, 2025

enyst Jan 10, 2025

enyst Jan 10, 2025

ryanhoangt Jan 12, 2025 •

edited

Loading

enyst Jan 12, 2025

ryanhoangt Feb 4, 2025

enyst left a comment

mamoodi commented Feb 3, 2025

enyst Feb 4, 2025

ryanhoangt Feb 4, 2025

enyst Feb 4, 2025 •

edited

Loading

Setup model routing config and plan routing to o1 #6189

Are you sure you want to change the base?

Setup model routing config and plan routing to o1 #6189

Conversation

ryanhoangt commented Jan 10, 2025 • edited Loading

xingyaoww left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ryanhoangt Jan 12, 2025 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

enyst left a comment

Choose a reason for hiding this comment

mamoodi commented Feb 3, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

enyst Feb 4, 2025 • edited Loading

Choose a reason for hiding this comment

ryanhoangt commented Jan 10, 2025 •

edited

Loading

ryanhoangt Jan 12, 2025 •

edited

Loading

enyst Feb 4, 2025 •

edited

Loading